Ckmeans.1d.dp: Optimal, Fast, and Reproducible Univariate Clustering

Fast, optimal, and reproducible weighted univariate clustering by dynamic programming. Four problems are solved, including univariate k-means (Wang & Song 2011) <doi:10.32614/RJ-2011-015> (Song & Zhong 2020) <doi:10.1093/bioinformatics/btaa613>, k-median, k-segments, and multi-channel weighted k-means. Dynamic programming is used to minimize the sum of (weighted) within-cluster distances using respective metrics. Its advantage over heuristic clustering in efficiency and accuracy is pronounced when there are many clusters. Multi-channel weighted k-means groups multiple univariate signals into k clusters. An auxiliary function generates histograms adaptive to patterns in data. This package provides a powerful set of tools for univariate data analysis with guaranteed optimality, efficiency, and reproducibility, useful for peak calling on temporal, spatial, and spectral data.

Package details

AuthorJoe Song [aut, cre] (<https://orcid.org/0000-0002-6883-6547>), Hua Zhong [aut] (<https://orcid.org/0000-0003-1962-2603>), Haizhou Wang [aut]
MaintainerJoe Song <joemsong@cs.nmsu.edu>
LicenseLGPL (>= 3)
Version4.3.5
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("Ckmeans.1d.dp")

Try the Ckmeans.1d.dp package in your browser

Any scripts or data that you put into this service are public.

Ckmeans.1d.dp documentation built on Aug. 20, 2023, 1:08 a.m.