genieclust: Genie: Fast and Robust Hierarchical Clustering

Genie is a robust hierarchical clustering algorithm (Gagolewski, Bartoszuk, Cena, 2016 <DOI:10.1016/j.ins.2016.05.003>). 'genieclust' is its faster, more capable implementation (Gagolewski, 2021 <DOI:10.1016/j.softx.2021.100722>). It enables clustering with respect to mutual reachability distances, allowing it to act as an alternative to 'HDBSCAN*' that can identify any number of clusters or their entire hierarchy. When combined with the 'deadwood' package, it can act as an outlier detector. Additional package features include the Gini and Bonferroni inequality indices, external cluster validity measures (e.g., the normalised clustering accuracy, the adjusted Rand index, the Fowlkes-Mallows index, and normalised mutual information), and internal cluster validity indices (e.g., the Calinski-Harabasz, Davies-Bouldin, Ball-Hall, Silhouette, and generalised Dunn indices). The 'Python' version of 'genieclust' is available via 'PyPI'.

Package details

AuthorMarek Gagolewski [aut, cre, cph] (ORCID: <https://orcid.org/0000-0003-0637-6028>), Maciej Bartoszuk [ctb], Anna Cena [ctb], Peter M. Larsen [ctb]
MaintainerMarek Gagolewski <marek@gagolewski.com>
LicenseAGPL-3
Version1.3.0
URL https://genieclust.gagolewski.com/ https://clustering-benchmarks.gagolewski.com/ https://github.com/gagolews/genieclust
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("genieclust")

Try the genieclust package in your browser

Any scripts or data that you put into this service are public.

genieclust documentation built on Feb. 24, 2026, 1:06 a.m.