stopwordsISO: Combined stop words for all languages

stopwordsISOR Documentation

Combined stop words for all languages

Description

A list of stop words in each of the supported languages

Usage

stopwordsISO

Format

An object of class list of length 65.

Details

Note: All Unicode characters are escaped. To un-escape them, consider using:

   library(AllStopwords)
   if(!requireNamespace('stringi')){
     install.packages('stringi')
   }
   data('stopwordsISO')
   stopwords_unescaped <- lapply(stopwordsISO,
                                 stringi::stri_unescape_unicode)

Author(s)

Each stop-word list's Authors

Source

All unique stopwords in the following databases:

  • the StopwordISO repository;

  • python's Natural Language Toolkit (nltk);

  • the Snowball stop-word list;

  • the R package quanteda;

  • the marimo repository;

  • the Perseus project; and

  • AurĂ©lien Berra's list of stop words for Ancient Greek and Latin (\Sexpr[results=rd]{tools:::Rd_expr_doi("10.5281/zenodo.3860343")}).


morestopwords documentation built on July 10, 2023, 2 a.m.