boxplotcluster: Clustering Method Based on Boxplot Statistics

Following Arroyo-Maté-Roque (2006), the function calculates the distance between rows or columns of the dataset using the generalized Minkowski metric as described by Ichino-Yaguchi (1994). The distance measure gives more weight to differences between quartiles than to differences between extremes, making it less sensitive to outliers. Further,the function calculates the silhouette width (Rousseeuw 1987) for different numbers of clusters and selects the number of clusters that maximizes the average silhouette width, unless a specific number of clusters is provided by the user. The approach implemented in this package is based on the following publications: Rousseeuw (1987) <doi:10.1016/0377-0427(87)90125-7>; Ichino-Yaguchi (1994) <doi:10.1109/21.286391>; Arroyo-Maté-Roque (2006) <doi:10.1007/3-540-34416-0_7>.

Getting started

Package details

AuthorGianmarco Alberti [aut, cre]
MaintainerGianmarco Alberti <gianmarcoalberti@gmail.com>
LicenseGPL (>= 2)
Version0.3
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("boxplotcluster")

Try the boxplotcluster package in your browser

Any scripts or data that you put into this service are public.

boxplotcluster documentation built on May 29, 2024, 11:50 a.m.