tweet_replace_patterns: Regex patterns of special tokens in tweet texts

Description Usage Format

Description

A vector mapping replacment place holders to regex patterns matching special tokens in tweets, namely mentions (@...), hastags (#... ), email addresses, URLs, and common Emojis.

Usage

1

Format

A named character vector with five elements (names: :

"@_": regex pattern capturing Twitter user mentions
"#_": regex pattern capturing Twitter hastags
"M_": regex pattern capturing email addresses (see https://emailregex.com/)
"U_": regex pattern capturing URLs (see http://daringfireball.net/2010/07/improved_regex_for_matching_urls)
"E_": regex pattern capturing common Emojis (thse listed here http://www.unicode.org/emoji/charts/full-emoji-list.html#smileys_&_emotion)

haukelicht/twscrape documentation built on Jan. 29, 2020, 3:23 p.m.