removePunctuation | R Documentation |
Remove punctuation marks from a text document.
## S3 method for class 'character'
removePunctuation(x,
preserve_intra_word_contractions = FALSE,
preserve_intra_word_dashes = FALSE,
ucp = FALSE, ...)
## S3 method for class 'PlainTextDocument'
removePunctuation(x, ...)
x |
a character vector or text document. |
preserve_intra_word_contractions |
a logical specifying whether intra-word contractions should be kept. |
preserve_intra_word_dashes |
a logical specifying whether intra-word dashes should be kept. |
ucp |
a logical specifying whether to use Unicode character
properties for determining punctuation characters. If |
... |
arguments to be passed to or from methods;
in particular, from the |
The character or text document x
without punctuation marks
(besides intra-word contractions (‘'’) and intra-word dashes
(‘-’) if preserve_intra_word_contractions
and
preserve_intra_word_dashes
are set, respectively).
getTransformations
to list available transformation
(mapping) functions.
regex
shows the class [:punct:]
of punctuation
characters.
https://unicode.org/reports/tr44/#General_Category_Values.
data("crude")
inspect(crude[[14]])
inspect(removePunctuation(crude[[14]]))
inspect(removePunctuation(crude[[14]],
preserve_intra_word_contractions = TRUE,
preserve_intra_word_dashes = TRUE))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.