Description Usage Arguments Value See Also Examples
Wrappers to DocumentTermMatrix
and DocumentTermMatrix
to use n-gram tokenization provided by ngramrr
.
1 2 3 |
x |
character vector, |
char |
logical, using character n-gram. char = FALSE denotes word n-gram. |
ngmin |
integer, minimun order of n-gram |
ngmax |
integer, maximun order of n-gram |
rmEOL |
logical, remove ngrams wih EOL character |
... |
Additional options for |
DocumentTermMatrix
or DocumentTermMatrix
ngramrr
, DocumentTermMatrix
, TermDocumentMatrix
1 2 3 4 5 6 | nirvana <- c("hello hello hello how low", "hello hello hello how low",
"hello hello hello how low", "hello hello hello",
"with the lights out", "it's less dangerous", "here we are now", "entertain us",
"i feel stupid", "and contagious", "here we are now", "entertain us",
"a mulatto", "an albino", "a mosquito", "my libido", "yeah", "hey yay")
dtm2(nirvana, ngmax = 3, removePunctuation = TRUE)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.