CreateSimulatedClusters: 'CreateSimulatedClusters'

View source: R/simulateclusters.R

CreateSimulatedClustersR Documentation



Generates artificial clusters from a truncated normal distribution.


CreateSimulatedClusters(DENSIT, n.clusters, j, n)



A value of 1, 2 or, 3. A 1 indicates that the clusters will be appropximately equal in size. A 2 generates a small cluster and the rest of the same size. A 3 generates a big cluster.


The number of clusters to generate.


The number of variables.


The number of observations.


Glenn W. Milligan (1985). An algorithm for generating artificial data sets which contain distinct nonoverlapping clusters is presented. The algorithm is useful for generating test data sets for Monte Carlo validation research conducted on clustering methods or statistics. The algorithm generates data sets which contain either 1, 2, 3, 4, or 5 clusters. By default, the data are embedded in either a 4, 6, or 8 dimensional space. Three different patterns for assigning the points to the clusters are provided. One pattern assigns the points equally to the clusters while the remaining two schemes produce clusters of unequal sizes. Finally, a number of methods for introducing error in the data have been incorporated in the algorithm. Psychometrica. February.

Displayr/flipCluster documentation built on Nov. 23, 2022, 4:20 a.m.