| tm_clean | R Documentation |
This function processes the Subject column in a Meeting Query by applying
tokenisation usingtidytext::unnest_tokens(), and removing any stopwords
supplied in a data frame (using the argument stopwords). This is a
sub-function that feeds into tm_freq(), tm_cooc(), and tm_wordcloud().
The default is to return a data frame with tokenised counts of words or
ngrams.
tm_clean(data, token = "words", stopwords = NULL, ...)
data |
A Meeting Query dataset in the form of a data frame. |
token |
A character vector accepting either |
stopwords |
A character vector OR a single-column data frame labelled
|
... |
Additional parameters to pass to |
data frame with two columns:
line
word
Other Text-mining:
meeting_tm_report(),
pairwise_count(),
tm_cooc(),
tm_freq(),
tm_wordcloud()
# words
tm_clean(mt_data)
# ngrams
tm_clean(mt_data, token = "ngrams")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.