Description Usage Arguments Details Value See Also
This function applies different cleaning techniques to clean corpus data.
1 |
data |
Data read by |
This function removes non english characters, numbers, white spaces, brackets, punctuation. It also handles cases like abbreviation, contraction. It converts entire text to lower case.
a list having sampled text data
tm_map
iconv
content_transformer
removeNumbers
replace_contraction
replace_abbreviation
bracketX
removePunctuation
tolower
stripWhitespace
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.