statsmaths/cleanNLP: A Tidy Data Model for Natural Language Processing

Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or two Python back ends with 'spaCy' <https://spacy.io> or 'CoreNLP' <https://stanfordnlp.github.io/CoreNLP/>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.

Getting started

Package details

AuthorTaylor B. Arnold [aut, cre]
MaintainerTaylor B. Arnold <tarnold2@richmond.edu>
LicenseLGPL-2
Version3.0.7
URL https://statsmaths.github.io/cleanNLP/
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("statsmaths/cleanNLP")
statsmaths/cleanNLP documentation built on Jan. 27, 2024, 1:43 p.m.