tidify_to_unigrams: Prepare data for text analysis.

Description Usage Arguments Details Value See Also

Description

Create a tidy data.frame of unigrams.

Usage

1
2
3
4
5
tidify_to_unigrams_at(data = NULL, text = "text", rgx_pattern,
  rgx_replacement, rgx_unnest, stopwords = TRUE, stopwords_lexicon,
  rgx_ignore_custom)

tidify_to_unigrams(..., text)

Arguments

data

data.frame. Not in 'tidy' format.

text

bare for NSE; character for SE. Name of column in data to parse.

rgx_pattern

character. Regular expression to substitute.

rgx_replacement

character. Regular expression used as replacement.

rgx_unnest

character. Regular expression to use in tidytext::unnest_tokens(). If not specified, then not used.

stopwords

logical. Whether or not to remove stopwords. Default: TRUE

stopwords_lexicon

character. If not specified, then uses all stop words available in tidytext package. Otherwise, must be a lexicon available in the tidytext package.

rgx_ignore_custom

character. Custom regular expression to remove.

...

dots. Parameters to pass directly to visualize_time().

Details

Heavily influenced by Text Mining with R. Creates output column 'word'.

Value

data.frame.

See Also

Text Mining with R. https://www.tidytextmining.com/ngrams.html. https://www.tidytextmining.com/twitter.html.


tonyelhabr/tetext documentation built on May 14, 2019, 8:03 a.m.