| stopwords | R Documentation |
Return various kinds of stopwords with support for different languages.
stopwords(kind = "en")
kind |
A character string identifying the desired stopword list. |
Available stopword lists are:
catalanCatalan stopwords (obtained from http://latel.upf.edu/morgana/altres/pub/ca_stop.htm),
romanianRomanian stopwords (extracted from http://snowball.tartarus.org/otherapps/romanian/romanian1.tgz),
SMARTEnglish stopwords from the SMART information retrieval system (as documented in Appendix 11 of https://jmlr.csail.mit.edu/papers/volume5/lewis04a/) (which coincides with the stopword list used by the MC toolkit (https://www.cs.utexas.edu/~dml/software/mc/)),
and a set of stopword lists from the Snowball stemmer project in different
languages (obtained from
‘http://svn.tartarus.org/snowball/trunk/website/algorithms/*/stop.txt’).
Supported languages are danish, dutch, english,
finnish, french, german, hungarian, italian,
norwegian, portuguese, russian, spanish, and
swedish. Language names are case sensitive. Alternatively, their
IETF language tags may be used.
A character vector containing the requested stopwords. An error
is raised if no stopwords are available for the requested
kind.
stopwords("en")
stopwords("SMART")
stopwords("german")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.