tdigest: Wicked Fast, Accurate Quantiles Using t-Digests

The t-Digest construction algorithm, by Dunning et al., (2019) <doi:10.48550/arXiv.1902.04023>, uses a variant of 1-dimensional k-means clustering to produce a very compact data structure that allows accurate estimation of quantiles. This t-Digest data structure can be used to estimate quantiles, compute other rank statistics or even to estimate related measures like trimmed means. The advantage of the t-Digest over previous digests for this purpose is that the t-Digest handles data with full floating point resolution. The accuracy of quantile estimates produced by t-Digests can be orders of magnitude more accurate than those produced by previous digest algorithms. Methods are provided to create and update t-Digests and retrieve quantiles from the accumulated distributions.

Getting started

Package details

AuthorBob Rudis [aut, cre] (<https://orcid.org/0000-0001-5670-2640>), Ted Dunning [aut] (t-Digest algorithm; <https://github.com/tdunning/t-digest/>), Andrew Werner [aut] (Original C+ code; <https://github.com/ajwerner/tdigest>)
MaintainerBob Rudis <bob@rud.is>
LicenseMIT + file LICENSE
Version0.4.2
URL https://git.sr.ht/~hrbrmstr/tdigest
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("tdigest")

Try the tdigest package in your browser

Any scripts or data that you put into this service are public.

tdigest documentation built on June 22, 2024, 10:44 a.m.