removePunctuation | R Documentation |
Remove punctuation marks from a text document.
## S3 method for class 'character' removePunctuation(x, preserve_intra_word_contractions = FALSE, preserve_intra_word_dashes = FALSE, ucp = FALSE, ...) ## S3 method for class 'PlainTextDocument' removePunctuation(x, ...)
x |
a character vector or text document. |
preserve_intra_word_contractions |
a logical specifying whether intra-word contractions should be kept. |
preserve_intra_word_dashes |
a logical specifying whether intra-word dashes should be kept. |
ucp |
a logical specifying whether to use Unicode character
properties for determining punctuation characters. If |
... |
arguments to be passed to or from methods;
in particular, from the |
The character or text document x
without punctuation marks
(besides intra-word contractions (') and intra-word dashes
(-) if preserve_intra_word_contractions
and
preserve_intra_word_dashes
are set, respectively).
getTransformations
to list available transformation
(mapping) functions.
regex
shows the class [:punct:]
of punctuation
characters.
https://unicode.org/reports/tr44/#General_Category_Values.
data("crude") inspect(crude[[14]]) inspect(removePunctuation(crude[[14]])) inspect(removePunctuation(crude[[14]], preserve_intra_word_contractions = TRUE, preserve_intra_word_dashes = TRUE))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.