stopwords: Stopwords

Description Usage Arguments Details Value Examples

Description

Return various kinds of stopwords with support for different languages.

Usage

1
stopwords(kind = "en")

Arguments

kind

A character string identifying the desired stopword list.

Details

Available stopword lists are:

catalan

Catalan stopwords (obtained from http://latel.upf.edu/morgana/altres/pub/ca_stop.htm),

romanian

Romanian stopwords (extracted from http://snowball.tartarus.org/otherapps/romanian/romanian1.tgz),

SMART

English stopwords from the SMART information retrieval system (as documented in Appendix 11 of https://jmlr.csail.mit.edu/papers/volume5/lewis04a/) (which coincides with the stopword list used by the MC toolkit (https://www.cs.utexas.edu/users/dml/software/mc/)),

and a set of stopword lists from the Snowball stemmer project in different languages (obtained from http://svn.tartarus.org/snowball/trunk/website/algorithms/*/stop.txt). Supported languages are danish, dutch, english, finnish, french, german, hungarian, italian, norwegian, portuguese, russian, spanish, and swedish. Language names are case sensitive. Alternatively, their IETF language tags may be used.

Value

A character vector containing the requested stopwords. An error is raised if no stopwords are available for the requested kind.

Examples

1
2
3
stopwords("en")
stopwords("SMART")
stopwords("german")

Example output

Loading required package: NLP
  [1] "i"          "me"         "my"         "myself"     "we"        
  [6] "our"        "ours"       "ourselves"  "you"        "your"      
 [11] "yours"      "yourself"   "yourselves" "he"         "him"       
 [16] "his"        "himself"    "she"        "her"        "hers"      
 [21] "herself"    "it"         "its"        "itself"     "they"      
 [26] "them"       "their"      "theirs"     "themselves" "what"      
 [31] "which"      "who"        "whom"       "this"       "that"      
 [36] "these"      "those"      "am"         "is"         "are"       
 [41] "was"        "were"       "be"         "been"       "being"     
 [46] "have"       "has"        "had"        "having"     "do"        
 [51] "does"       "did"        "doing"      "would"      "should"    
 [56] "could"      "ought"      "i'm"        "you're"     "he's"      
 [61] "she's"      "it's"       "we're"      "they're"    "i've"      
 [66] "you've"     "we've"      "they've"    "i'd"        "you'd"     
 [71] "he'd"       "she'd"      "we'd"       "they'd"     "i'll"      
 [76] "you'll"     "he'll"      "she'll"     "we'll"      "they'll"   
 [81] "isn't"      "aren't"     "wasn't"     "weren't"    "hasn't"    
 [86] "haven't"    "hadn't"     "doesn't"    "don't"      "didn't"    
 [91] "won't"      "wouldn't"   "shan't"     "shouldn't"  "can't"     
 [96] "cannot"     "couldn't"   "mustn't"    "let's"      "that's"    
[101] "who's"      "what's"     "here's"     "there's"    "when's"    
[106] "where's"    "why's"      "how's"      "a"          "an"        
[111] "the"        "and"        "but"        "if"         "or"        
[116] "because"    "as"         "until"      "while"      "of"        
[121] "at"         "by"         "for"        "with"       "about"     
[126] "against"    "between"    "into"       "through"    "during"    
[131] "before"     "after"      "above"      "below"      "to"        
[136] "from"       "up"         "down"       "in"         "out"       
[141] "on"         "off"        "over"       "under"      "again"     
[146] "further"    "then"       "once"       "here"       "there"     
[151] "when"       "where"      "why"        "how"        "all"       
[156] "any"        "both"       "each"       "few"        "more"      
[161] "most"       "other"      "some"       "such"       "no"        
[166] "nor"        "not"        "only"       "own"        "same"      
[171] "so"         "than"       "too"        "very"      
  [1] "a"             "a's"           "able"          "about"        
  [5] "above"         "according"     "accordingly"   "across"       
  [9] "actually"      "after"         "afterwards"    "again"        
 [13] "against"       "ain't"         "all"           "allow"        
 [17] "allows"        "almost"        "alone"         "along"        
 [21] "already"       "also"          "although"      "always"       
 [25] "am"            "among"         "amongst"       "an"           
 [29] "and"           "another"       "any"           "anybody"      
 [33] "anyhow"        "anyone"        "anything"      "anyway"       
 [37] "anyways"       "anywhere"      "apart"         "appear"       
 [41] "appreciate"    "appropriate"   "are"           "aren't"       
 [45] "around"        "as"            "aside"         "ask"          
 [49] "asking"        "associated"    "at"            "available"    
 [53] "away"          "awfully"       "b"             "be"           
 [57] "became"        "because"       "become"        "becomes"      
 [61] "becoming"      "been"          "before"        "beforehand"   
 [65] "behind"        "being"         "believe"       "below"        
 [69] "beside"        "besides"       "best"          "better"       
 [73] "between"       "beyond"        "both"          "brief"        
 [77] "but"           "by"            "c"             "c'mon"        
 [81] "c's"           "came"          "can"           "can't"        
 [85] "cannot"        "cant"          "cause"         "causes"       
 [89] "certain"       "certainly"     "changes"       "clearly"      
 [93] "co"            "com"           "come"          "comes"        
 [97] "concerning"    "consequently"  "consider"      "considering"  
[101] "contain"       "containing"    "contains"      "corresponding"
[105] "could"         "couldn't"      "course"        "currently"    
[109] "d"             "definitely"    "described"     "despite"      
[113] "did"           "didn't"        "different"     "do"           
[117] "does"          "doesn't"       "doing"         "don't"        
[121] "done"          "down"          "downwards"     "during"       
[125] "e"             "each"          "edu"           "eg"           
[129] "eight"         "either"        "else"          "elsewhere"    
[133] "enough"        "entirely"      "especially"    "et"           
[137] "etc"           "even"          "ever"          "every"        
[141] "everybody"     "everyone"      "everything"    "everywhere"   
[145] "ex"            "exactly"       "example"       "except"       
[149] "f"             "far"           "few"           "fifth"        
[153] "first"         "five"          "followed"      "following"    
[157] "follows"       "for"           "former"        "formerly"     
[161] "forth"         "four"          "from"          "further"      
[165] "furthermore"   "g"             "get"           "gets"         
[169] "getting"       "given"         "gives"         "go"           
[173] "goes"          "going"         "gone"          "got"          
[177] "gotten"        "greetings"     "h"             "had"          
[181] "hadn't"        "happens"       "hardly"        "has"          
[185] "hasn't"        "have"          "haven't"       "having"       
[189] "he"            "he's"          "hello"         "help"         
[193] "hence"         "her"           "here"          "here's"       
[197] "hereafter"     "hereby"        "herein"        "hereupon"     
[201] "hers"          "herself"       "hi"            "him"          
[205] "himself"       "his"           "hither"        "hopefully"    
[209] "how"           "howbeit"       "however"       "i"            
[213] "i'd"           "i'll"          "i'm"           "i've"         
[217] "ie"            "if"            "ignored"       "immediate"    
[221] "in"            "inasmuch"      "inc"           "indeed"       
[225] "indicate"      "indicated"     "indicates"     "inner"        
[229] "insofar"       "instead"       "into"          "inward"       
[233] "is"            "isn't"         "it"            "it'd"         
[237] "it'll"         "it's"          "its"           "itself"       
[241] "j"             "just"          "k"             "keep"         
[245] "keeps"         "kept"          "know"          "knows"        
[249] "known"         "l"             "last"          "lately"       
[253] "later"         "latter"        "latterly"      "least"        
[257] "less"          "lest"          "let"           "let's"        
[261] "like"          "liked"         "likely"        "little"       
[265] "look"          "looking"       "looks"         "ltd"          
[269] "m"             "mainly"        "many"          "may"          
[273] "maybe"         "me"            "mean"          "meanwhile"    
[277] "merely"        "might"         "more"          "moreover"     
[281] "most"          "mostly"        "much"          "must"         
[285] "my"            "myself"        "n"             "name"         
[289] "namely"        "nd"            "near"          "nearly"       
[293] "necessary"     "need"          "needs"         "neither"      
[297] "never"         "nevertheless"  "new"           "next"         
[301] "nine"          "no"            "nobody"        "non"          
[305] "none"          "noone"         "nor"           "normally"     
[309] "not"           "nothing"       "novel"         "now"          
[313] "nowhere"       "o"             "obviously"     "of"           
[317] "off"           "often"         "oh"            "ok"           
[321] "okay"          "old"           "on"            "once"         
[325] "one"           "ones"          "only"          "onto"         
[329] "or"            "other"         "others"        "otherwise"    
[333] "ought"         "our"           "ours"          "ourselves"    
[337] "out"           "outside"       "over"          "overall"      
[341] "own"           "p"             "particular"    "particularly" 
[345] "per"           "perhaps"       "placed"        "please"       
[349] "plus"          "possible"      "presumably"    "probably"     
[353] "provides"      "q"             "que"           "quite"        
[357] "qv"            "r"             "rather"        "rd"           
[361] "re"            "really"        "reasonably"    "regarding"    
[365] "regardless"    "regards"       "relatively"    "respectively" 
[369] "right"         "s"             "said"          "same"         
[373] "saw"           "say"           "saying"        "says"         
[377] "second"        "secondly"      "see"           "seeing"       
[381] "seem"          "seemed"        "seeming"       "seems"        
[385] "seen"          "self"          "selves"        "sensible"     
[389] "sent"          "serious"       "seriously"     "seven"        
[393] "several"       "shall"         "she"           "should"       
[397] "shouldn't"     "since"         "six"           "so"           
[401] "some"          "somebody"      "somehow"       "someone"      
[405] "something"     "sometime"      "sometimes"     "somewhat"     
[409] "somewhere"     "soon"          "sorry"         "specified"    
[413] "specify"       "specifying"    "still"         "sub"          
[417] "such"          "sup"           "sure"          "t"            
[421] "t's"           "take"          "taken"         "tell"         
[425] "tends"         "th"            "than"          "thank"        
[429] "thanks"        "thanx"         "that"          "that's"       
[433] "thats"         "the"           "their"         "theirs"       
[437] "them"          "themselves"    "then"          "thence"       
[441] "there"         "there's"       "thereafter"    "thereby"      
[445] "therefore"     "therein"       "theres"        "thereupon"    
[449] "these"         "they"          "they'd"        "they'll"      
[453] "they're"       "they've"       "think"         "third"        
[457] "this"          "thorough"      "thoroughly"    "those"        
[461] "though"        "three"         "through"       "throughout"   
[465] "thru"          "thus"          "to"            "together"     
[469] "too"           "took"          "toward"        "towards"      
[473] "tried"         "tries"         "truly"         "try"          
[477] "trying"        "twice"         "two"           "u"            
[481] "un"            "under"         "unfortunately" "unless"       
[485] "unlikely"      "until"         "unto"          "up"           
[489] "upon"          "us"            "use"           "used"         
[493] "useful"        "uses"          "using"         "usually"      
[497] "uucp"          "v"             "value"         "various"      
[501] "very"          "via"           "viz"           "vs"           
[505] "w"             "want"          "wants"         "was"          
[509] "wasn't"        "way"           "we"            "we'd"         
[513] "we'll"         "we're"         "we've"         "welcome"      
[517] "well"          "went"          "were"          "weren't"      
[521] "what"          "what's"        "whatever"      "when"         
[525] "whence"        "whenever"      "where"         "where's"      
[529] "whereafter"    "whereas"       "whereby"       "wherein"      
[533] "whereupon"     "wherever"      "whether"       "which"        
[537] "while"         "whither"       "who"           "who's"        
[541] "whoever"       "whole"         "whom"          "whose"        
[545] "why"           "will"          "willing"       "wish"         
[549] "with"          "within"        "without"       "won't"        
[553] "wonder"        "would"         "would"         "wouldn't"     
[557] "x"             "y"             "yes"           "yet"          
[561] "you"           "you'd"         "you'll"        "you're"       
[565] "you've"        "your"          "yours"         "yourself"     
[569] "yourselves"    "z"             "zero"         
  [1] "aber"         "alle"         "allem"        "allen"        "aller"       
  [6] "alles"        "als"          "also"         "am"           "an"          
 [11] "ander"        "andere"       "anderem"      "anderen"      "anderer"     
 [16] "anderes"      "anderm"       "andern"       "anderr"       "anders"      
 [21] "auch"         "auf"          "aus"          "bei"          "bin"         
 [26] "bis"          "bist"         "da"           "damit"        "dann"        
 [31] "der"          "den"          "des"          "dem"          "die"         
 [36] "das"          "da<U+00DF>"   "derselbe"     "derselben"    "denselben"   
 [41] "desselben"    "demselben"    "dieselbe"     "dieselben"    "dasselbe"    
 [46] "dazu"         "dein"         "deine"        "deinem"       "deinen"      
 [51] "deiner"       "deines"       "denn"         "derer"        "dessen"      
 [56] "dich"         "dir"          "du"           "dies"         "diese"       
 [61] "diesem"       "diesen"       "dieser"       "dieses"       "doch"        
 [66] "dort"         "durch"        "ein"          "eine"         "einem"       
 [71] "einen"        "einer"        "eines"        "einig"        "einige"      
 [76] "einigem"      "einigen"      "einiger"      "einiges"      "einmal"      
 [81] "er"           "ihn"          "ihm"          "es"           "etwas"       
 [86] "euer"         "eure"         "eurem"        "euren"        "eurer"       
 [91] "eures"        "f<U+00FC>r"   "gegen"        "gewesen"      "hab"         
 [96] "habe"         "haben"        "hat"          "hatte"        "hatten"      
[101] "hier"         "hin"          "hinter"       "ich"          "mich"        
[106] "mir"          "ihr"          "ihre"         "ihrem"        "ihren"       
[111] "ihrer"        "ihres"        "euch"         "im"           "in"          
[116] "indem"        "ins"          "ist"          "jede"         "jedem"       
[121] "jeden"        "jeder"        "jedes"        "jene"         "jenem"       
[126] "jenen"        "jener"        "jenes"        "jetzt"        "kann"        
[131] "kein"         "keine"        "keinem"       "keinen"       "keiner"      
[136] "keines"       "k<U+00F6>nnen" "k<U+00F6>nnte" "machen"       "man"         
[141] "manche"       "manchem"      "manchen"      "mancher"      "manches"     
[146] "mein"         "meine"        "meinem"       "meinen"       "meiner"      
[151] "meines"       "mit"          "muss"         "musste"       "nach"        
[156] "nicht"        "nichts"       "noch"         "nun"          "nur"         
[161] "ob"           "oder"         "ohne"         "sehr"         "sein"        
[166] "seine"        "seinem"       "seinen"       "seiner"       "seines"      
[171] "selbst"       "sich"         "sie"          "ihnen"        "sind"        
[176] "so"           "solche"       "solchem"      "solchen"      "solcher"     
[181] "solches"      "soll"         "sollte"       "sondern"      "sonst"       
[186] "<U+00FC>ber"  "um"           "und"          "uns"          "unse"        
[191] "unsem"        "unsen"        "unser"        "unses"        "unter"       
[196] "viel"         "vom"          "von"          "vor"          "w<U+00E4>hrend"
[201] "war"          "waren"        "warst"        "was"          "weg"         
[206] "weil"         "weiter"       "welche"       "welchem"      "welchen"     
[211] "welcher"      "welches"      "wenn"         "werde"        "werden"      
[216] "wie"          "wieder"       "will"         "wir"          "wird"        
[221] "wirst"        "wo"           "wollen"       "wollte"       "w<U+00FC>rde"
[226] "w<U+00FC>rden" "zu"           "zum"          "zur"          "zwar"        
[231] "zwischen"    

tm documentation built on April 7, 2021, 3:01 a.m.