ngramrr: A Simple General Purpose N-Gram Tokenizer

A simple n-gram (contiguous sequences of n items from a given sequence of text) tokenizer to be used with the 'tm' package with no 'rJava'/'RWeka' dependency.

Author
Chung-hong Chan <chainsawtiney@gmail.com>
Date of publication
2016-03-10 23:44:11
Maintainer
Chung-hong Chan <chainsawtiney@gmail.com>
License
GPL-2
Version
0.2.0
URLs

View on CRAN

Man pages

dtmwrappers
Wrappers to DocumentTermMatrix and DocumentTermMatrix to use...
ngramrr
General purpose n-gram tokenizer

Files in this package

ngramrr
ngramrr/tests
ngramrr/tests/testthat.R
ngramrr/tests/testthat
ngramrr/tests/testthat/test_integration.R
ngramrr/tests/testthat/test_charngram.R
ngramrr/NAMESPACE
ngramrr/R
ngramrr/R/ngramrr.R
ngramrr/R/dtm2.R
ngramrr/README.md
ngramrr/MD5
ngramrr/DESCRIPTION
ngramrr/man
ngramrr/man/dtmwrappers.Rd
ngramrr/man/ngramrr.Rd