lmkirvan/textTries: Tries for Fast Replacement of Big Dictionaries

This package uses the data.tree package to create tries that use a modified form of the aho-corasick algorithm to allow finding and replacing large numbers of dictionary keys in a text. Really only makes sense when your dictionary is quite large (> 500 terms). Useful for things like entity resolution in texts. Should probably be reimplimented in c++ to make it super fast...¯\_(ツ)_/¯.

Getting started

Package details

Authorperson("Lewis", "Kirvan", email = "kirvan.lewis@gmail.com", role = c("aut", "cre"))
MaintainerThe package maintainer <yourself@somewhere.net>
LicenseWhat license is it under?
Version0.1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("lmkirvan/textTries")
lmkirvan/textTries documentation built on May 25, 2019, 9:32 p.m.