View source: R/compareClustering.R
compareClustering | R Documentation |
Parallel Validation Of Clusteranalyses
compareClustering(dataMatrix, maxClusters, distanceMeasures = c("euclidean", "manhattan"), clusteringMethods = c("ward.D2", "single", "complete", "average", "mcquitty", "diana", "kmeans"), sfParallel = TRUE, sfCpus = 2, ...)
dataMatrix |
a data matrix accepted by |
maxClusters |
the maximum number of clusters to evaluate |
distanceMeasures |
a character vector of the distance measures to use
(currently, only " |
clusteringMethods |
a character vector of cluster methods to use (currently, the following are allowed:
) |
sfParallel |
logical Should |
sfCpus |
number of cpu to use |
... |
passed to |
a tibble::tibble
with one row per distance measure,
method and number of clusters
from 2 to k
and the columns:
= dm
= method
= k
overall average silhoutte width (cluster::summary.silhouette$avg.width
)
minimal average cluster silhoutte width (cluster::summary.silhouette$clus.avg.widths
)
minimal silhoutte width (cluster::summary.silhouette$si.summary$`Min.`
)
percentage of positive silhoutte widths
minimal cluster bootstrap mean of Jaccard's index (fpc::clusterboot$bootmean
)
percentage of cluster bootstrap means of Jaccard's index above 0.6
fpc::cluster.stats$sindex
fpc::cluster.stats$average.within
fpc::cluster.stats$wb.ratio
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.