SnowballC: Snowball stemmers based on the C libstemmer UTF-8 library

Share:

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

Author
Milan Bouchet-Valat [aut, cre]
Date of publication
2014-08-09 00:17:57
Maintainer
Milan Bouchet-Valat <nalimilan@club.fr>
License
BSD_2_clause + file LICENSE
Version
0.5.1
URLs

View on CRAN

Man pages

getStemLanguages
Query the list of supported languages
wordStem
Get the stem of words

Files in this package

SnowballC
SnowballC/inst
SnowballC/inst/words
SnowballC/inst/words/finnish.RData
SnowballC/inst/words/italian.RData
SnowballC/inst/words/hungarian.RData
SnowballC/inst/words/norwegian.RData
SnowballC/inst/words/french.RData
SnowballC/inst/words/dutch.RData
SnowballC/inst/words/spanish.RData
SnowballC/inst/words/swedish.RData
SnowballC/inst/words/porter.RData
SnowballC/inst/words/german.RData
SnowballC/inst/words/portuguese.RData
SnowballC/inst/words/english.RData
SnowballC/inst/words/danish.RData
SnowballC/inst/words/turkish.RData
SnowballC/inst/words/romanian.RData
SnowballC/inst/words/russian.RData
SnowballC/inst/words.R
SnowballC/src
SnowballC/src/header.h
SnowballC/src/modules_utf8.h
SnowballC/src/stem_UTF_8_turkish.c
SnowballC/src/stem_UTF_8_swedish.h
SnowballC/src/stem_UTF_8_german.h
SnowballC/src/stem_UTF_8_porter.h
SnowballC/src/stem_UTF_8_danish.h
SnowballC/src/stem_UTF_8_italian.c
SnowballC/src/stem_UTF_8_finnish.h
SnowballC/src/utilities.c
SnowballC/src/stem_UTF_8_norwegian.c
SnowballC/src/stem_UTF_8_dutch.c
SnowballC/src/stem_UTF_8_russian.h
SnowballC/src/stem_UTF_8_dutch.h
SnowballC/src/stem_UTF_8_italian.h
SnowballC/src/libstemmer_utf8.c
SnowballC/src/stem_UTF_8_portuguese.c
SnowballC/src/stem.c
SnowballC/src/stem_UTF_8_hungarian.c
SnowballC/src/libstemmer.h
SnowballC/src/stem_UTF_8_porter.c
SnowballC/src/stem_UTF_8_portuguese.h
SnowballC/src/stem_UTF_8_english.c
SnowballC/src/stem_UTF_8_romanian.c
SnowballC/src/stem_UTF_8_swedish.c
SnowballC/src/stem_UTF_8_romanian.h
SnowballC/src/stem_UTF_8_german.c
SnowballC/src/stem_UTF_8_french.h
SnowballC/src/stem_UTF_8_turkish.h
SnowballC/src/stem_UTF_8_norwegian.h
SnowballC/src/stem_UTF_8_spanish.h
SnowballC/src/stem_UTF_8_danish.c
SnowballC/src/stem_UTF_8_french.c
SnowballC/src/stem_UTF_8_spanish.c
SnowballC/src/stem_UTF_8_english.h
SnowballC/src/stem_UTF_8_finnish.c
SnowballC/src/stem_UTF_8_russian.c
SnowballC/src/stem_UTF_8_hungarian.h
SnowballC/src/api.c
SnowballC/src/api.h
SnowballC/NAMESPACE
SnowballC/NEWS
SnowballC/R
SnowballC/R/stem.R
SnowballC/MD5
SnowballC/DESCRIPTION
SnowballC/man
SnowballC/man/getStemLanguages.Rd
SnowballC/man/wordStem.Rd
SnowballC/LICENSE