Multi-lingual stopwords package in R

stopwords is an R package that provides easy access to stopwords in more than 50 languages in the Stopwords ISO library. This package should be used conjunction with packages such as quanteda to perform text analysis in many different languages.

Supported languages

Currently supported languages are the following:

lang <- stopwords::languages()
cat(paste0('- ', names(lang), ' [', unname(lang), ']\n'))

How to install

Please just execute the following command to install:

devtools::install_github("koheiw/stopwords")

How to use

The interface of the stopwords package is designed to be consistent with quanteda, but the words are considerably different:

head(quanteda::stopwords('english'), 10)
head(stopwords::stopwords('en'), 10)


koheiw/stopwords documentation built on May 30, 2019, 3:43 p.m.