Description Usage Arguments Value Author(s) References See Also Examples
Perform a selection of the number K of clusters for a given subset S of clustering variables.
1 2 3 4 | selectK.R(xdata, S, Kmax, ploidy = 1, Kmin = 1,
emOptions = list(epsi = 1e-05, nberSmallEM = 20, nberIterations = 15,
nberMaxIterations = 5000, typeSmallEM = 0, typeEM = 0, putThreshold = FALSE),
cte = 1, project = deparse(substitute(xdata)))
|
xdata |
A dataset in which data of each variable are in ploidy column(s). |
S |
A subset of clustering variables on the form of logical vector of the same length P as the number of variables in |
Kmax |
The maximum number of clusters to be explored. |
ploidy |
The number of occurrences for each variable in the data. For example, ploidy = 2 for genotype |
Kmin |
The minimum number of clusters to be explored. The default value is set to 1. |
emOptions |
A list of EM options (see |
cte |
A double used for the selection criterion named |
project |
The name of the project. The default value is the name of the dataset. |
A list of estimated paramaters for each selection criteria.
Wilson Toussile
Dominique Bontemps and Wilson Toussile (2013) : Clustering and variable selection for categorical multivariate data. Electronic Journal of Statistics, Volume 7, 2344-2371, ISSN.
Wilson Toussile and Elisabeth Gassiat (2009) : Variable selection in model-based clustering using multilocus genotype data. Adv Data Anal Classif, Vol 3, number 2, 109-134.
backward.explorer
for more exploration of the competing models space, dimJump.R
for data driven calibration of the penality function, and model.selection.R
for model selection.
1 2 3 4 5 6 7 8 9 10 11 12 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.