limpiar_repeat_chars: Clean repeated charaaaacters

View source: R/limpiar_repeat_chars.R

limpiar_repeat_charsR Documentation

Clean repeated charaaaacters

Description

Removes multiple vowels (holaaaa) and normalises common laughing patterns (jajaja, jejeje, ajajaaaaja). Useful for visualisations, and reducing the overall number of tokens present in the text variable.

Usage

limpiar_repeat_chars(df, text_var = mention_content)

Arguments

df

Name of the Data Frame or Tibble object

text_var

Name of the text variable/character vector. Default is mention_content

Value

Data Frame or Tibble object with most repeat vowels & laughing patterns removed from the text variable

Examples

limpiar_examples %>% dplyr::select(mention_content)

limpiar_examples %>% limpiar_repeat_chars() %>% dplyr::select(mention_content)



jpcompartir/LimpiaR documentation built on April 6, 2024, 5:22 a.m.