cluster_performance: External Cluster Validity Metrics
In DavidHofmeyr/spuds: Spectral Partitioning Using Density Separation

Description Usage Arguments Value References

Computes four popular external cluster validity metrics (adjusted Rand index, purity, V-measure and Normalised Mutual Information) through comparison of cluster assignments and true class labels.

1	cluster_performance(assigned, labels, beta)

`assigned`	a vector of cluster assignments made by a clustering algorithm.
`labels`	a vector of true class labels to be compared with assigned.
`beta`	(optional) positive numeric, used in the computation of V-measure. larger values apply higher weight to homogeneity over completeness measures. if omitted then beta = 1 (equal weight applied to both measures).

a vector containing the four evaluation metrics listed in the description.

Zhao Y., Karypis G. (2004) Empirical and Theoretical Comparisons of Selected Criterion Functions for Document Clustering. Machine Learning, 55(3), 311–331.

Strehl A., Ghosh J. (2002) Cluster ensembles<e2><80><94>a knowledge reuse framework for combining multiple partitions. Journal of Machine Learning Research, 3, 583–617.

Rosenberg A., Hirschberg J. (2007) V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure. EMNLP-CoNLL, 7, 410–420. Citeseer.

Hubert, L., Arabie, P. (1985) Comparing Partitions. Journal of Classification, 2(1), 193–218.

DavidHofmeyr/spuds documentation built on May 20, 2019, 9:40 a.m.