refinr: Cluster and Merge Similar Values Within a Character Vector

These functions take a character vector as input, identify and cluster similar values, and then merge clusters together so their values become identical. The functions are an implementation of the key collision and ngram fingerprint algorithms from the open source tool Open Refine <https://openrefine.org/>. More info on key collision and ngram fingerprint can be found here <https://openrefine.org/docs/technical-reference/clustering-in-depth>.

Package details

AuthorChris Muir [aut, cre]
MaintainerChris Muir <chrismuirRVA@gmail.com>
LicenseGPL-3
Version0.3.3
URL https://github.com/ChrisMuir/refinr
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("refinr")

Try the refinr package in your browser

Any scripts or data that you put into this service are public.

refinr documentation built on Nov. 13, 2023, 1:09 a.m.