Description Usage Arguments Value
View source: R/generate_corpus.R
Generates and tokenizes a text corpus.
1 | generate_corpus(text, type, sub_token, label = NULL, clean = TRUE)
|
text |
A vector of character strings. |
type |
Type of texts to be sampled. Possible values are texts, paragraphs, sentences, words, and characters. |
sub_token |
A string specifying the text unit for filtering texts by length via |
label |
A vector of labels. |
clean |
If |
Text corpus.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.