removePunctuation: Remove Punctuation Marks from a Text Document

Description Usage Arguments Value See Also Examples

View source: R/transform.R

Description

Remove punctuation marks from a text document.

Usage

1
2
3
4
5
6
7
## S3 method for class 'character'
removePunctuation(x,
                  preserve_intra_word_contractions = FALSE,
                  preserve_intra_word_dashes = FALSE,
                  ucp = FALSE, ...)
## S3 method for class 'PlainTextDocument'
removePunctuation(x, ...)

Arguments

x

a character vector or text document.

preserve_intra_word_contractions

a logical specifying whether intra-word contractions should be kept.

preserve_intra_word_dashes

a logical specifying whether intra-word dashes should be kept.

ucp

a logical specifying whether to use Unicode character properties for determining punctuation characters. If FALSE (default), characters in the ASCII [:punct:] class are taken; if TRUE, the characters with Unicode general category P (Punctuation).

...

arguments to be passed to or from methods; in particular, from the PlainTextDocument method to the character method.

Value

The character or text document x without punctuation marks (besides intra-word contractions (') and intra-word dashes (-) if preserve_intra_word_contractions and preserve_intra_word_dashes are set, respectively).

See Also

getTransformations to list available transformation (mapping) functions.

regex shows the class [:punct:] of punctuation characters.

https://unicode.org/reports/tr44/#General_Category_Values.

Examples

1
2
3
4
5
6
data("crude")
inspect(crude[[14]])
inspect(removePunctuation(crude[[14]]))
inspect(removePunctuation(crude[[14]],
                          preserve_intra_word_contractions = TRUE,
                          preserve_intra_word_dashes = TRUE))

Example output

Loading required package: NLP
<<PlainTextDocument>>
Metadata:  15
Content:  chars: 591

Kuwait's oil minister said in a newspaper
interview that there were no plans for an emergency OPEC
meeting after the recent weakness in world oil prices.
    Sheikh Ali al-Khalifa al-Sabah was quoted by the local
daily al-Qabas as saying that "none of the OPEC members has
asked for such a meeting."
    He also denied that Kuwait was pumping above its OPEC quota
of 948,000 barrels of crude daily (bpd).
    Crude oil prices fell sharply last week as international
oil traders and analysts estimated the 13-nation OPEC was
pumping up to one million bpd over its self-imposed limits.
 Reuter
<<PlainTextDocument>>
Metadata:  15
Content:  chars: 576

Kuwaits oil minister said in a newspaper
interview that there were no plans for an emergency OPEC
meeting after the recent weakness in world oil prices
    Sheikh Ali alKhalifa alSabah was quoted by the local
daily alQabas as saying that none of the OPEC members has
asked for such a meeting
    He also denied that Kuwait was pumping above its OPEC quota
of 948000 barrels of crude daily bpd
    Crude oil prices fell sharply last week as international
oil traders and analysts estimated the 13nation OPEC was
pumping up to one million bpd over its selfimposed limits
 Reuter
<<PlainTextDocument>>
Metadata:  15
Content:  chars: 582

Kuwait's oil minister said in a newspaper
interview that there were no plans for an emergency OPEC
meeting after the recent weakness in world oil prices
    Sheikh Ali al-Khalifa al-Sabah was quoted by the local
daily al-Qabas as saying that none of the OPEC members has
asked for such a meeting
    He also denied that Kuwait was pumping above its OPEC quota
of 948000 barrels of crude daily bpd
    Crude oil prices fell sharply last week as international
oil traders and analysts estimated the 13-nation OPEC was
pumping up to one million bpd over its self-imposed limits
 Reuter

tm documentation built on April 7, 2021, 3:01 a.m.