RfEmpImp: Multiple Imputation using Chained Random Forests

An R package for multiple imputation using chained random forests. Implemented methods can handle missing data in mixed types of variables by using prediction-based or node-based conditional distributions constructed using random forests. For prediction-based imputation, the method based on the empirical distribution of out-of-bag prediction errors of random forests and the method based on normality assumption for prediction errors of random forests are provided for imputing continuous variables. And the method based on predicted probabilities is provided for imputing categorical variables. For node-based imputation, the method based on the conditional distribution formed by the predicting nodes of random forests, and the method based on proximity measures of random forests are provided. More details of the statistical methods can be found in Hong et al. (2020) <arXiv:2004.14823>.

Package details

AuthorShangzhi Hong [aut, cre], Henry S. Lynn [ths]
MaintainerShangzhi Hong <shangzhi-hong@hotmail.com>
LicenseGPL-3
Version2.1.8
URL https://github.com/shangzhi-hong/RfEmpImp
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("RfEmpImp")

Try the RfEmpImp package in your browser

Any scripts or data that you put into this service are public.

RfEmpImp documentation built on Oct. 20, 2022, 9:06 a.m.