limpiar_stopwords: Clean stop words for visualisations
In jpcompartir/LimpiaR: LimpiaR

limpiar_stopwords

R Documentation

Clean stop words for visualisations

Description

The two lists - sentiment & topics, are very similar, in that most words are in both lists. However, sentiment analysis is sensitive to negation, so negation cues e.g. "no", "nada" etc. are not removed by the sentiment list. For most purposes, topics are the go-to lists, but care is always advised when removing stop words.

Usage

limpiar_stopwords(data, text_var = mention_content, stop_words)

Arguments

`data`	Name of your Data Frame or Tibble object
`text_var`	Name of your text variable. Can be given as a 'string' or a symbol - should refer to a column inside `data`
`stop_words`	"sentiment" or "topics" - sentiment retains negation cues

Details

stop word list is editable via data("sentiment_stops") or data("topic_stops").

Value

the text variable with stop words from specified list removed

Examples

limpiar_examples %>% dplyr::select(mention_content)

limpiar_examples %>% limpiar_stopwords(stop_words = "topics") %>%
dplyr::select(mention_content) %>% limpiar_spaces()

jpcompartir/LimpiaR documentation built on Dec. 9, 2024, 9:43 p.m.