README.md

aricode

R-CMD-check CRAN
Status Coverage
status Lifecycle:
stable

A package for efficient computations of standard clustering comparison measures

Installation

Stable version on the CRAN.

install.packages("aricode")

The development version is available via:

devtools::install_github("jchiquet/aricode")

Description

Computation of measures for clustering comparison (ARI, AMI, NID and even the (\chi^2) distance) are usually based on the contingency table. Traditional implementations (e.g., function adjustedRandIndex of package mclust) are in (\Omega(n + u v)) where

In aricode we propose an implementation, based on radix sort, that is in (\Theta(n)) in time and space. Importantly, the complexity does not depends on (u) and (v). Our implementation of the ARI for instance is one or two order of magnitude faster than some standard implementation in R.

Available measures and functions

The functions included in aricode are:

Timings

Here are some timings to compare the cost of computing the adjusted Rand Index with aricode or with the commonly used function adjustedRandIndex of the mclust package: the cost of the latter can be prohibitive for large vectors:



jchiquet/aricode documentation built on Feb. 27, 2024, 4:33 p.m.