strip: Strip Text

Description Usage Arguments Value See Also Examples

View source: R/strip.R

Description

Strip text of unwanted characters.

strip.character - factor method for strip.

strip.factor - factor method for strip.

strip.default - factor method for strip.

strip.list - factor method for strip.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
strip(x, char.keep = "~~", digit.remove = TRUE, apostrophe.remove = TRUE,
  lower.case = TRUE)

## S3 method for class 'character'
strip(x, char.keep = "~~", digit.remove = TRUE,
  apostrophe.remove = TRUE, lower.case = TRUE)

## S3 method for class 'factor'
strip(x, char.keep = "~~", digit.remove = TRUE,
  apostrophe.remove = TRUE, lower.case = TRUE)

## Default S3 method:
strip(x, char.keep = "~~", digit.remove = TRUE,
  apostrophe.remove = TRUE, lower.case = TRUE)

## S3 method for class 'list'
strip(x, char.keep = "~~", digit.remove = TRUE,
  apostrophe.remove = TRUE, lower.case = TRUE)

Arguments

x

The text variable.

char.keep

A character vector of symbols (i.e., punctuation) that strip should keep. The default is to strip every symbol except apostrophes and a double tilde "~~". The double tilde "~~" is included for a convenient means of keeping word groups together in functions that split text apart based on spaces. To remove double tildes "~~" set char.keep to NULL.

digit.remove

logical. If TRUE strips digits from the text.

apostrophe.remove

logical. If TRUE removes apostrophes from the output.

lower.case

logical. If TRUE forces all alpha characters to lower case.

Value

Returns a vector of text that has been stripped of unwanted characters.

See Also

rm_stopwords

Examples

1
2
3
4
5
6
7
## Not run: 
DATA$state #no strip applied
strip(DATA$state)
strip(DATA$state, apostrophe.remove=FALSE)
strip(DATA$state, char.keep = c("?", "."))

## End(Not run)

Example output

Loading required package: qdapDictionaries
Loading required package: qdapRegex
Loading required package: qdapTools
Loading required package: RColorBrewer
OpenJDK 64-Bit Server VM warning: Can't detect initial thread stack location - find_vma failed

Attaching package: 'qdap'

The following object is masked from 'package:base':

    Filter

 [1] "Computer is fun. Not too fun."        
 [2] "No it's not, it's dumb."              
 [3] "What should we do?"                   
 [4] "You liar, it stinks!"                 
 [5] "I am telling the truth!"              
 [6] "How can we be certain?"               
 [7] "There is no way."                     
 [8] "I distrust you."                      
 [9] "What are you talking about?"          
[10] "Shall we move on?  Good then."        
[11] "I'm hungry.  Let's eat.  You already?"
 [1] "computer is fun not too fun"    "no its not its dumb"           
 [3] "what should we do"              "you liar it stinks"            
 [5] "i am telling the truth"         "how can we be certain"         
 [7] "there is no way"                "i distrust you"                
 [9] "what are you talking about"     "shall we move on good then"    
[11] "im hungry lets eat you already"
 [1] "computer is fun not too fun"      "no it's not it's dumb"           
 [3] "what should we do"                "you liar it stinks"              
 [5] "i am telling the truth"           "how can we be certain"           
 [7] "there is no way"                  "i distrust you"                  
 [9] "what are you talking about"       "shall we move on good then"      
[11] "i'm hungry let's eat you already"
 [1] "computer is fun. not too fun."     "no its not its dumb."             
 [3] "what should we do?"                "you liar it stinks"               
 [5] "i am telling the truth"            "how can we be certain?"           
 [7] "there is no way."                  "i distrust you."                  
 [9] "what are you talking about?"       "shall we move on? good then."     
[11] "im hungry. lets eat. you already?"
Warning message:
system call failed: Cannot allocate memory 

qdap documentation built on Nov. 20, 2017, 5:09 p.m.