chainsawriot/ngramrr: A Simple General Purpose N-Gram Tokenizer

A simple n-gram (contiguous sequences of n items from a given sequence of text) tokenizer to be used with the 'tm' package with no 'rJava'/'RWeka' dependency.

Getting started

Package details

AuthorChung-hong Chan <[email protected]>
MaintainerChung-hong Chan <[email protected]>
LicenseGPL-2
Version0.2.0
URL https://github.com/chainsawriot/ngramrr
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("chainsawriot/ngramrr")
chainsawriot/ngramrr documentation built on May 13, 2019, 3:11 p.m.