Description Usage Arguments Details Value Author(s) References Examples
Estimate the number of dimensions for the Multiple Correspondence Analysis by cross-validation
1 | estim_ncpMCA(don, ncp.min=0, ncp.max=5, nbsim=100, pNA=0.05, threshold=1e-4)
|
don |
a data.frame with categorical variables; with missing entries or not |
ncp.min |
integer corresponding to the minimum number of components to test |
ncp.max |
integer corresponding to the maximum number of components to test |
nbsim |
number of simulations |
pNA |
percentage of missing values added in the data set |
threshold |
the threshold for assessing convergence |
For the cross-validation, pNA percentage of missing values are removed at random and predicted with a MCA model using ncp.min to ncp.max dimensions. This process is repeated nbsim times. The number of components which leads to the smallest MSEP is retained. Each cell is predicted using the imputeMCA function, it means using the regularized iterative MCA algorithm.
ncp |
the number of components retained for the MCA |
criterion |
the criterion (the MSEP) calculated for each number of components |
Francois Husson husson@agrocampus-ouest.fr and Julie Josse Julie.Josse@agrocampus-ouest.fr
Josse, J., Chavent, M., Liquet, B. and Husson, F. (2010). Handling missing values with Regularized Iterative Multiple Correspondence Analysis.
1 2 3 4 5 | ## Not run:
data(vnf)
result <- estim_ncpMCA(vnf,ncp.min=0, ncp.max=3, nbsim=100)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.