txt_clean_nonascii: Functions to clean character strings

View source: R/text_cleaning_functions.R

txt_clean_nonasciiR Documentation

Functions to clean character strings

Description

'txt_clean_nonascii' removes non-standard characters from text, but keeps space, -, and . by default, using regex matching. Default matching can be overridden by 'regex' option.

Usage

txt_clean_nonascii(string, regex = "[^[:alnum:][:blank:]+?&/\\-\\.]")

Arguments

string

character string, or vector of character strings which will be cleaned

regex

defaults to remove all non-ascii text characters, but keeping a few

Value

A character string or vector, now stripped of any non-matching values

Examples

test_strings <- c("\"The Erlenmeyer Flask\"‡", "\"My Struggle\"‡", "\"Founder's Mutation\"", "\"Mulder & Scully Meet the Were-Monster\"",
"\"Home Again\"", "\"Babylon\"", "\"My Struggle II\"‡" )
test_strings
txt_clean_nonascii(test_strings)

JMLuther/tabletools documentation built on July 1, 2024, 2:01 p.m.