randomclustersim | R Documentation |
For a given dataset this simulates random clusterings using
stupidkcentroids
, stupidknn
,
stupidkfn
, and stupidkaven
. It then
computes and stores a set of cluster validity indexes for every
clustering.
randomclustersim(datadist,datanp=NULL,npstats=FALSE,useboot=FALSE,
bootmethod="nselectboot",
bootruns=25,
G,nnruns=100,kmruns=100,fnruns=100,avenruns=100,
nnk=4,dnnk=2,
pamcrit=TRUE,
multicore=FALSE,cores=detectCores()-1,monitor=TRUE)
datadist |
distances on which validation-measures are based, |
datanp |
optional observations times variables data matrix, see
|
npstats |
logical. If |
useboot |
logical. If |
bootmethod |
either |
bootruns |
integer. Number of resampling runs. If
|
G |
vector of integers. Numbers of clusters to consider. |
nnruns |
integer. Number of runs of |
kmruns |
integer. Number of runs of |
fnruns |
integer. Number of runs of |
avenruns |
integer. Number of runs of |
nnk |
|
dnnk |
|
pamcrit |
|
multicore |
logical. If |
cores |
integer. Number of cores for parallelisation. |
monitor |
logical. If |
List with components
nn |
list, indexed by number of clusters. Every entry is
a data frame with |
fn |
list, indexed by number of clusters. Every entry is
a data frame with |
aven |
list, indexed by number of clusters. Every entry is
a data frame with |
km |
list, indexed by number of clusters. Every entry is
a data frame with |
nnruns |
number of involved runs of |
fnruns |
number of involved runs of |
avenruns |
number of involved runs of |
kmruns |
number of involved runs of |
boot |
if |
Christian Hennig christian.hennig@unibo.it https://www.unibo.it/sitoweb/christian.hennig/en/
Hennig, C. (2019) Cluster validation by measurement of clustering characteristics relevant to the user. In C. H. Skiadas (ed.) Data Analysis and Applications 1: Clustering and Regression, Modeling-estimating, Forecasting and Data Mining, Volume 2, Wiley, New York 1-24, https://arxiv.org/abs/1703.09282
Akhanli, S. and Hennig, C. (2020) Calibrating and aggregating cluster validity indexes for context-adapted comparison of clusterings. Statistics and Computing, 30, 1523-1544, https://link.springer.com/article/10.1007/s11222-020-09958-2, https://arxiv.org/abs/2002.01822
stupidkcentroids
, stupidknn
, stupidkfn
, stupidkaven
, clustatsum
set.seed(20000)
options(digits=3)
face <- rFace(10,dMoNo=2,dNoEy=0,p=2)
rmx <- randomclustersim(dist(face),datanp=face,npstats=TRUE,G=2:3,
nnruns=2,kmruns=2, fnruns=1,avenruns=1,nnk=2)
## Not run:
rmx$km # Produces slightly different but basically identical results on ATLAS
## End(Not run)
rmx$aven
rmx$fn
rmx$nn
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.