MixSim | R Documentation |
Generates a finite mixture model with Gaussian components for prespecified levels of maximum and/or average overlaps.
MixSim(BarOmega = NULL, MaxOmega = NULL, K, p, sph = FALSE, hom = FALSE,
ecc = 0.90, PiLow = 1.0, int = c(0.0, 1.0), resN = 100,
eps = 1e-06, lim = 1e06)
BarOmega |
value of desired average overlap. |
MaxOmega |
value of desired maximum overlap. |
K |
number of components. |
p |
number of dimensions. |
sph |
covariance matrix structure (FALSE - non-spherical, TRUE - spherical). |
hom |
heterogeneous or homogeneous clusters (FALSE - heterogeneous, TRUE - homogeneous). |
ecc |
maximum eccentricity. |
PiLow |
value of the smallest mixing proportion (if 'PiLow' is not reachable with respect to K, equal proportions are taken; PiLow = 1.0 implies equal proportions by default). |
int |
mean vectors are simulated uniformly on a hypercube with sides specified by int = (lower.bound, upper.bound). |
resN |
maximum number of mixture resimulations. |
eps |
error bound for overlap computation. |
lim |
maximum number of integration terms (Davies, 1980). |
If 'BarOmega' is not specified, the function generates a mixture solely based on 'MaxOmega'; if 'MaxOmega' is not specified, the function generates a mixture solely based on 'BarOmega'.
If 'hom' is TRUE, only one of 'BarOmega' or 'MaxOmega' can be specified.
Pi |
vector of mixing proportions. |
Mu |
matrix consisting of components' mean vectors (K * p). |
S |
set of components' covariance matrices (p * p * K). |
OmegaMap |
matrix of misclassification probabilities (K * K); OmegaMap[i,j] is the probability that X coming from the i-th component is classified to the j-th component. |
BarOmega |
value of average overlap. |
MaxOmega |
value of maximum overlap. |
rcMax |
row and column numbers for the pair of components producing maximum overlap 'MaxOmega'. |
fail |
flag value; 0 represents successful mixture generation, 1 represents failure. |
Volodymyr Melnykov, Wei-Chen Chen, and Ranjan Maitra.
Maitra, R. and Melnykov, V. (2010) “Simulating data to study performance of finite mixture modeling and clustering algorithms”, The Journal of Computational and Graphical Statistics, 2:19, 354-376.
Melnykov, V., Chen, W.-C., and Maitra, R. (2012) “MixSim: An R Package for Simulating Data to Study Performance of Clustering Algorithms”, Journal of Statistical Software, 51:12, 1-25.
Davies, R. (1980) “The distribution of a linear combination of chi-square random variables”, Applied Statistics, 29, 323-333.
overlap
, pdplot
, and simdataset
.
set.seed(1234)
# controls average and maximum overlaps
(ex.1 <- MixSim(BarOmega = 0.05, MaxOmega = 0.15, K = 4, p = 5))
summary(ex.1)
# controls average overlap
(ex.2 <- MixSim(BarOmega = 0.05, K = 4, p = 5, hom = TRUE))
summary(ex.2)
# controls maximum overlap
(ex.3 <- MixSim(MaxOmega = 0.15, K = 4, p = 5, sph = TRUE))
summary(ex.3)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.