Variable selection for latent class analysis for model-based clustering of multivariate categorical data
Variable selection for latent class analysis for model-based clustering of multivariate categorical data. The package implements a general framework for selecting the subset of variables with relevant clustering information and discard those that are redundant and/or not informative. The variable selection method is based on the approach of Fop et al. (2017) and Dean and Raftery (2010). Different algorithms are available to perform the selection: stepwise, swap-stepwise and evolutionary stochastic search. Concomitant covariates used to predict the class membership probabilities can also be included in the latent class analysis model. The selection procedure can be run in parallel on multiple cores machines.
The package is available on CRAN, to install it:
install.packages("LCAvarsel")
To install the development version from GitHub:
# install.packages("devtools")
devtools::install_github("michaelfop/LCAvarsel")
Fop, M., Smart, K. and Murphy, T. B. (2017). Variable selection for latent class analysis with application to low back pain diagnosis. Annals of Applied Statistics. 11(4), 2085-2115.
Dean, N. and Raftery, A. E. (2010). Latent class analysis variable selection. Annals of the Institute of Statistical Mathematics. 62(1):11-35.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.