genie: Fast, Robust, and Outlier Resistant Hierarchical Clustering

Includes the reference implementation of Genie - a hierarchical clustering algorithm that links two point groups in such a way that an inequity measure (namely, the Gini index) of the cluster sizes does not significantly increase above a given threshold. This method most often outperforms many other data segmentation approaches in terms of clustering quality as tested on a wide range of benchmark datasets. At the same time, Genie retains the high speed of the single linkage approach, therefore it is also suitable for analysing larger data sets. For more details see (Gagolewski et al. 2016 <DOI:10.1016/j.ins.2016.05.003>). For an even faster and more feature-rich implementation, including, amongst others, noise point detection, see the 'genieclust' package.

Package details

AuthorMarek Gagolewski [aut, cre, cph] (<https://orcid.org/0000-0003-0637-6028>), Maciej Bartoszuk [aut] (<https://orcid.org/0000-0001-6088-8273>), Anna Cena [aut] (<https://orcid.org/0000-0001-8697-5383>)
MaintainerMarek Gagolewski <marek@gagolewski.com>
LicenseGPL (>= 3)
Version1.0.5
URL http://genieclust.gagolewski.com/
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("genie")

Try the genie package in your browser

Any scripts or data that you put into this service are public.

genie documentation built on Aug. 3, 2020, 1:07 a.m.