replace_emoji: Replace Emojis With Words/Identifier

Description Usage Arguments Value Examples

View source: R/replace_emoji.R

Description

Replaces emojis with word equivalents or a token identifier for use in the sentimentr package. Not that this function will coerce the text to ASCII using Encoding(x) <- "latin1"; iconv(x, "latin1", "ASCII", "byte"). The function replace_emoji replaces emojis with text representations while replace_emoji_identifier replaces with a unique identifier that corresponds to lexicon::hash_sentiment_emoji for use in the sentimentr package.

Usage

1
2
3
replace_emoji(x, emoji_dt = lexicon::hash_emojis, ...)

replace_emoji_identifier(x, emoji_dt = lexicon::hash_emojis_identifier, ...)

Arguments

x

The text variable.

emoji_dt

A data.table of emojis (ASCII byte representations) and corresponding word/identifier meanings.

...

Other arguments passed to .mgsub (see textclean:::.mgsub for details).

Value

Returns a vector of strings with emojis replaced with word equivalents.

Examples

1
2
3
4
fls <- system.file("docs/emoji_sample.txt", package = "textclean")
x <- readLines(fls)[1]
replace_emoji(x)
replace_emoji_identifier(x)

trinker/textmod documentation built on July 26, 2018, 6:33 a.m.