microclustr: Entity Resolution with Random Partition Priors for Microclustering

An implementation of the model in Betancourt, Zanella, Steorts (2020) <arXiv:2004.02008>, which performs microclustering models for categorical data. The package provides a vignette for two proposed methods in the paper as well as two standard Bayesian non-parametric clustering approaches for entity resolution. The experiments are reproducible and illustrated using a simple vignette. LICENSE: GPL-3 + file license.

Package details

AuthorRebecca C Steorts [aut, cre], Brenda Betancourt [aut], Giacomo Zanella [aut]
MaintainerRebecca C Steorts <beka@stat.duke.edu>
LicenseGPL-3
Version0.1.0
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("microclustr")

Try the microclustr package in your browser

Any scripts or data that you put into this service are public.

microclustr documentation built on Jan. 13, 2021, 8:58 p.m.