statsmaths/cleanNLP: A Tidy Data Model for Natural Language Processing

Provides a set of fast tools for converting a textual corpus into a set of normalized tables. Users may make use of the 'udpipe' back end with no external dependencies, or a Python back ends with 'spaCy' <https://spacy.io>. Exposed annotation tasks include tokenization, part of speech tagging, named entity recognition, and dependency parsing.

Getting started

Package details

AuthorTaylor B. Arnold [aut, cre]
MaintainerTaylor B. Arnold <tarnold2@richmond.edu>
LicenseLGPL-2
Version3.1.0
URL https://statsmaths.github.io/cleanNLP/
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("statsmaths/cleanNLP")
statsmaths/cleanNLP documentation built on May 21, 2024, 6:47 a.m.