View source: R/limpiar_non_ascii.R
limpiar_non_ascii | R Documentation |
Function uses a simple RegEx to retain only basic ASCII characters plus attempts to retain characters with latin accents. If you know that you want to remove everything including accented characters then you should use limpiar_alphanumeric
.
limpiar_non_ascii(data, text_var = mention_content)
data |
Name of your Data Frame or Tibble object |
text_var |
Name of your text variable. Can be given as a 'string' or a symbol - should refer to a column inside |
Data frame with the text variable changed in place
test_df <- data.frame(
text = c(
"Simple text 123", # Basic ASCII only
"Hello! How are you? 😊 🌟", # ASCII + punctuation + emojis
"café München niño", # Latin-1 accented characters
"#special@chars&(~)|[$]", # Special characters and symbols
"混合汉字と日本語 → ⌘ £€¥" # CJK characters + symbols + arrows
)
)
limpiar_non_ascii(test_df, text)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.