get_tweet_rgx_tidiers: Get helper list for tidying

Description Usage Arguments Details Value See Also

Description

Return a list of regular expression to used to 'tidify' a rtweet data.frame (i.e. with the tidify_to_ functions).

Usage

1
2
3
  get_tweet_rgx_tidiers(rgx_unnest = "([^A-Za-z_\\d#@']|'(?![A-Za-z_\\d#@]))",
  rgx_pattern = "https://t.co/[A-Za-z\\d]+|http://[A-Za-z\\d]+|&|<|>|RT|https",
  rgx_ignore_custom = "^[0-9f][0-9a-f]+$")

Arguments

rgx_unnest

character. Regular expression.

rgx_pattern

character. Regular expression.

rgx_ignore_custom

character. Regular expression.

Details

This function is primarily a 'convenience' function for saving regular expressions that are good to use with a rtweet data.frame.

Value

list

See Also

https://www.tidytextmining.com/twitter.html.


tonyelhabr/tetext documentation built on May 14, 2019, 8:03 a.m.