View source: R/stringPreprocessing.R
stringPreprocessing | R Documentation |
Function replaces some common characters / character sequences (e.g., Ä, Ü, "DIPL.-ING.") with their uppercase equivalents and removes punctuation, empty spaces and the word "Diplom".
stringPreprocessing(verbatim, lang = "de")
verbatim |
a character vector. |
lang |
(default de) Everything else will throw an error. |
charToRaw
helps to find UTF-8 characters.
the same character vector after processing
(x <- c("Verkauf von B\xfcchern, Schreibwaren", "Fach\xe4rzin f\xfcr Kinder- und Jugendmedizin im \xf6ffentlichen Gesundheitswesen", "Industriemechaniker", "Dipl.-Ing. - Agrarwirtschaft (Landwirtschaft)"))
stringPreprocessing(x)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.