miselect: Variable Selection for Multiply Imputed Data

Penalized regression methods, such as lasso and elastic net, are used in many biomedical applications when simultaneous regression coefficient estimation and variable selection is desired. However, missing data complicates the implementation of these methods, particularly when missingness is handled using multiple imputation. Applying a variable selection algorithm on each imputed dataset will likely lead to different sets of selected predictors, making it difficult to ascertain a final active set without resorting to ad hoc combination rules. 'miselect' presents Stacked Adaptive Elastic Net (saenet) and Grouped Adaptive LASSO (galasso) for continuous and binary outcomes, developed by Du et al (2022) <doi:10.1080/10618600.2022.2035739>. They, by construction, force selection of the same variables across multiply imputed data. 'miselect' also provides cross validated variants of these methods.

Package details

AuthorMichael Kleinsasser [cre], Alexander Rix [aut], Jiacong Du [aut]
MaintainerMichael Kleinsasser <biostat-cran-manager@umich.edu>
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the miselect package in your browser

Any scripts or data that you put into this service are public.

miselect documentation built on May 29, 2024, 1:33 a.m.