benmarwick/rmgarbage: Automatic garbage extraction from OCR'd text

Removes garbage generated during optical character recognition of text. Derived from the methods described by Taghva et al. in <http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.81.8901> and code at <https://github.com/foodoh/rmgarbage> and <https://github.com/Amoki/rmgarbage>.

Getting started

Package details

Maintainer
LicenseMIT + file LICENSE
Version0.0.0.9000
URL https://github.com/benmarwick/rmgarbage
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("benmarwick/rmgarbage")
benmarwick/rmgarbage documentation built on April 19, 2020, 6:06 p.m.