ngram: Fast n-Gram 'Tokenization'
Version 3.0.4

An n-gram is a sequence of n "words" taken, in order, from a body of text. This is a collection of utilities for creating, displaying, summarizing, and "babbling" n-grams. The 'tokenization' and "babbling" are handled by very efficient C code, which can even be built as its own standalone library. The babbler is a simple Markov chain. The package also offers a vignette with complete example 'workflows' and information about the utilities offered in the package.

Package details

AuthorDrew Schmidt [aut, cre], Christian Heckendorf [aut]
Date of publication2017-11-21 15:22:56 UTC
MaintainerDrew Schmidt <[email protected]>
LicenseBSD 2-clause License + file LICENSE
Version3.0.4
URL https://github.com/wrathematics/ngram
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("ngram")

Try the ngram package in your browser

Any scripts or data that you put into this service are public.

ngram documentation built on Nov. 21, 2017, 5:03 p.m.