stopwords: Collection of stopwords in multiple languages

Description Usage Arguments Details Value Examples

View source: R/stopwords.R

Description

This function returns character vectors of stopwords for different languages, using the ISO-639-1 language codes, and allows for different sources of stopwords to be defined.

The default source is the Snowball() stopwords collection but other() sources are also available.

Usage

1
stopwords(language = "en", source = "snowball", simplify = TRUE)

Arguments

language

specify language of stopwords by ISO 639-1 code

source

specify a stopwords source. To list the currently available options, use stopwords_getsources().

simplify

logical; if TRUE return a simple vector, if FALSE return a list if the original word list was nested

Details

The language codes for each stopword list use the two-letter ISO code from https://en.wikipedia.org/wiki/List_of_ISO_639-1_codes. For backwards compatibility, the full English names of the stopwords from the quanteda package may also be used, although these are deprecated.

Value

a character vector containing the stopwords, or a list of characters simplify = FALSE

Examples

1
2
stopwords("en")
stopwords("de")

Example output

  [1] "i"          "me"         "my"         "myself"     "we"        
  [6] "our"        "ours"       "ourselves"  "you"        "your"      
 [11] "yours"      "yourself"   "yourselves" "he"         "him"       
 [16] "his"        "himself"    "she"        "her"        "hers"      
 [21] "herself"    "it"         "its"        "itself"     "they"      
 [26] "them"       "their"      "theirs"     "themselves" "what"      
 [31] "which"      "who"        "whom"       "this"       "that"      
 [36] "these"      "those"      "am"         "is"         "are"       
 [41] "was"        "were"       "be"         "been"       "being"     
 [46] "have"       "has"        "had"        "having"     "do"        
 [51] "does"       "did"        "doing"      "would"      "should"    
 [56] "could"      "ought"      "i'm"        "you're"     "he's"      
 [61] "she's"      "it's"       "we're"      "they're"    "i've"      
 [66] "you've"     "we've"      "they've"    "i'd"        "you'd"     
 [71] "he'd"       "she'd"      "we'd"       "they'd"     "i'll"      
 [76] "you'll"     "he'll"      "she'll"     "we'll"      "they'll"   
 [81] "isn't"      "aren't"     "wasn't"     "weren't"    "hasn't"    
 [86] "haven't"    "hadn't"     "doesn't"    "don't"      "didn't"    
 [91] "won't"      "wouldn't"   "shan't"     "shouldn't"  "can't"     
 [96] "cannot"     "couldn't"   "mustn't"    "let's"      "that's"    
[101] "who's"      "what's"     "here's"     "there's"    "when's"    
[106] "where's"    "why's"      "how's"      "a"          "an"        
[111] "the"        "and"        "but"        "if"         "or"        
[116] "because"    "as"         "until"      "while"      "of"        
[121] "at"         "by"         "for"        "with"       "about"     
[126] "against"    "between"    "into"       "through"    "during"    
[131] "before"     "after"      "above"      "below"      "to"        
[136] "from"       "up"         "down"       "in"         "out"       
[141] "on"         "off"        "over"       "under"      "again"     
[146] "further"    "then"       "once"       "here"       "there"     
[151] "when"       "where"      "why"        "how"        "all"       
[156] "any"        "both"       "each"       "few"        "more"      
[161] "most"       "other"      "some"       "such"       "no"        
[166] "nor"        "not"        "only"       "own"        "same"      
[171] "so"         "than"       "too"        "very"       "will"      
  [1] "aber"         "alle"         "allem"        "allen"        "aller"       
  [6] "alles"        "als"          "also"         "am"           "an"          
 [11] "ander"        "andere"       "anderem"      "anderen"      "anderer"     
 [16] "anderes"      "anderm"       "andern"       "anderr"       "anders"      
 [21] "auch"         "auf"          "aus"          "bei"          "bin"         
 [26] "bis"          "bist"         "da"           "damit"        "dann"        
 [31] "der"          "den"          "des"          "dem"          "die"         
 [36] "das"          "da<U+00DF>"   "derselbe"     "derselben"    "denselben"   
 [41] "desselben"    "demselben"    "dieselbe"     "dieselben"    "dasselbe"    
 [46] "dazu"         "dein"         "deine"        "deinem"       "deinen"      
 [51] "deiner"       "deines"       "denn"         "derer"        "dessen"      
 [56] "dich"         "dir"          "du"           "dies"         "diese"       
 [61] "diesem"       "diesen"       "dieser"       "dieses"       "doch"        
 [66] "dort"         "durch"        "ein"          "eine"         "einem"       
 [71] "einen"        "einer"        "eines"        "einig"        "einige"      
 [76] "einigem"      "einigen"      "einiger"      "einiges"      "einmal"      
 [81] "er"           "ihn"          "ihm"          "es"           "etwas"       
 [86] "euer"         "eure"         "eurem"        "euren"        "eurer"       
 [91] "eures"        "f<U+00FC>r"   "gegen"        "gewesen"      "hab"         
 [96] "habe"         "haben"        "hat"          "hatte"        "hatten"      
[101] "hier"         "hin"          "hinter"       "ich"          "mich"        
[106] "mir"          "ihr"          "ihre"         "ihrem"        "ihren"       
[111] "ihrer"        "ihres"        "euch"         "im"           "in"          
[116] "indem"        "ins"          "ist"          "jede"         "jedem"       
[121] "jeden"        "jeder"        "jedes"        "jene"         "jenem"       
[126] "jenen"        "jener"        "jenes"        "jetzt"        "kann"        
[131] "kein"         "keine"        "keinem"       "keinen"       "keiner"      
[136] "keines"       "k<U+00F6>nnen" "k<U+00F6>nnte" "machen"       "man"         
[141] "manche"       "manchem"      "manchen"      "mancher"      "manches"     
[146] "mein"         "meine"        "meinem"       "meinen"       "meiner"      
[151] "meines"       "mit"          "muss"         "musste"       "nach"        
[156] "nicht"        "nichts"       "noch"         "nun"          "nur"         
[161] "ob"           "oder"         "ohne"         "sehr"         "sein"        
[166] "seine"        "seinem"       "seinen"       "seiner"       "seines"      
[171] "selbst"       "sich"         "sie"          "ihnen"        "sind"        
[176] "so"           "solche"       "solchem"      "solchen"      "solcher"     
[181] "solches"      "soll"         "sollte"       "sondern"      "sonst"       
[186] "<U+00FC>ber"  "um"           "und"          "uns"          "unse"        
[191] "unsem"        "unsen"        "unser"        "unses"        "unter"       
[196] "viel"         "vom"          "von"          "vor"          "w<U+00E4>hrend"
[201] "war"          "waren"        "warst"        "was"          "weg"         
[206] "weil"         "weiter"       "welche"       "welchem"      "welchen"     
[211] "welcher"      "welches"      "wenn"         "werde"        "werden"      
[216] "wie"          "wieder"       "will"         "wir"          "wird"        
[221] "wirst"        "wo"           "wollen"       "wollte"       "w<U+00FC>rde"
[226] "w<U+00FC>rden" "zu"           "zum"          "zur"          "zwar"        
[231] "zwischen"    

stopwords documentation built on Oct. 28, 2021, 5:10 p.m.