
Missing data imputation, classification, prediction and average treatment effect estimation via Random Recursive Partitioning
Stefano Iacus, Department of Economics, Business and Statistics, University of Milan, IT
Giuseppe Porro, Department of Economics and Statistics, University of Trieste, Italy
Download the Paper (PDF format) - February 21, 2006
Tell a colleague about it.
Printing Tips: Select 'print as image' in the Acrobat print dialog if you have trouble printing.
ABSTRACT:
In this paper we describe some applications of the Random Recursive Partitioning (RRP) method. This method generates a proximity matrix which can be used in non parametric hot-deck missing data imputation, classification, prediction, average treatment effect estimation and, more generally, in matching problems. RRP is a Monte Carlo procedure that randomly generates non-empty recursive partitions of the data and evaluates the proximity between observations as the empirical frequency they fall in the same cell of these random partitions over all the replications.
RRP works also in the presence of missing data and is invariant under monotonic transformations of the data. No other formal properties of the method are known yet, therefore Monte Carlo experiments are provided in order to explore the performance of the method. A companion software is available in the form of a package for the R statistical environment.
SUGGESTED CITATION:
Stefano Iacus and Giuseppe Porro,
"Missing data imputation, classification, prediction and average treatment effect estimation via Random Recursive Partitioning"
(February 2006).
UNIMI - Research Papers in Economics, Business, and Statistics.
Statistics and Mathematics.
Working Paper 7.
http://services.bepress.com/unimi/statistics/art7
Paper presented by Pier Alda Ferrari.
|