subsample_clustering_evaluation | R Documentation |
Performs clustering analysis on each fold of an external cross validation.
subsample_clustering_evaluation(
dat_embedded,
parallel = 1,
by = c("datname", "drname", "run", "fold"),
silhouette_dissimilarity = NULL,
dat_list = NULL,
...
)
dat_embedded |
list of |
parallel |
number of threads |
by |
variables to split input data by |
silhouette_dissimilarity |
dissimilarity matrix used for silhouette evaluation |
dat_list |
list of input data matrices used for calculating clustering indices |
... |
extra arguments are passed through to |
Produces clusterings using multiple methods and settings while computing internal validation metrics such as Connectivity, Dunn and Silhouette scores. Also computes chi-squared tests with respect to a batch label if one is provided.
Returns a list
of data.frames
containing clustering_analysis
and clustering_metrics
outputs for every
combination of CV run, CV fold, clustering method, number of clusters as well as all combinations of
data sets and dimensionality reduction techniques found in the input data.frame
.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.