A retake on the Genie algorithm  a robust hierarchical clustering method (Gagolewski, Bartoszuk, Cena, 2016 <DOI:10.1016/j.ins.2016.05.003>). Now faster and more memory efficient; determining the whole hierarchy for datasets of 10M points in low dimensional Euclidean spaces or 100K points in highdimensional ones takes only 12 minutes. Allows clustering with respect to mutual reachability distances so that it can act as a noise point detector or a robustified version of 'HDBSCAN*' (that is able to detect a predefined number of clusters and hence it does not dependent on the somewhat fragile 'eps' parameter). The package also features an implementation of economic inequity indices (the Gini, Bonferroni index) and external cluster validity measures (partition similarity scores; e.g., the adjusted Rand, FowlkesMallows, adjusted mutual information, pair sets index). See also the 'Python' version of 'genieclust' available on 'PyPI', which supports sparse data, more metrics, and even larger datasets.
Package details 


Author  Marek Gagolewski [aut, cre, cph] (<https://orcid.org/0000000306376028>), Maciej Bartoszuk [ctb], Anna Cena [ctb], Peter M. Larsen [ctb] 
Maintainer  Marek Gagolewski <marek@gagolewski.com> 
License  AGPL3 
Version  0.9.4 
URL  https://genieclust.gagolewski.com/ 
Package repository  View on CRAN 
Installation 
Install the latest version of this package by entering the following in R:

Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.