SnowballC: Snowball stemmers based on the C libstemmer UTF-8 library

An R interface to the C libstemmer library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Danish, Dutch, English, Finnish, French, German, Hungarian, Italian, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish and Turkish.

AuthorMilan Bouchet-Valat [aut, cre]
Date of publication2014-08-09 00:17:57
MaintainerMilan Bouchet-Valat <nalimilan@club.fr>
LicenseBSD_2_clause + file LICENSE
Version0.5.1
https://r-forge.r-project.org/projects/r-temis/

View on CRAN

Files in this package

SnowballC
SnowballC/inst
SnowballC/inst/words
SnowballC/inst/words/finnish.RData
SnowballC/inst/words/italian.RData
SnowballC/inst/words/hungarian.RData
SnowballC/inst/words/norwegian.RData
SnowballC/inst/words/french.RData
SnowballC/inst/words/dutch.RData
SnowballC/inst/words/spanish.RData
SnowballC/inst/words/swedish.RData
SnowballC/inst/words/porter.RData
SnowballC/inst/words/german.RData
SnowballC/inst/words/portuguese.RData
SnowballC/inst/words/english.RData
SnowballC/inst/words/danish.RData
SnowballC/inst/words/turkish.RData
SnowballC/inst/words/romanian.RData
SnowballC/inst/words/russian.RData
SnowballC/inst/words.R
SnowballC/src
SnowballC/src/header.h
SnowballC/src/modules_utf8.h
SnowballC/src/stem_UTF_8_turkish.c
SnowballC/src/stem_UTF_8_swedish.h
SnowballC/src/stem_UTF_8_german.h
SnowballC/src/stem_UTF_8_porter.h
SnowballC/src/stem_UTF_8_danish.h
SnowballC/src/stem_UTF_8_italian.c
SnowballC/src/stem_UTF_8_finnish.h
SnowballC/src/utilities.c
SnowballC/src/stem_UTF_8_norwegian.c
SnowballC/src/stem_UTF_8_dutch.c
SnowballC/src/stem_UTF_8_russian.h
SnowballC/src/stem_UTF_8_dutch.h
SnowballC/src/stem_UTF_8_italian.h
SnowballC/src/libstemmer_utf8.c
SnowballC/src/stem_UTF_8_portuguese.c
SnowballC/src/stem.c
SnowballC/src/stem_UTF_8_hungarian.c
SnowballC/src/libstemmer.h
SnowballC/src/stem_UTF_8_porter.c
SnowballC/src/stem_UTF_8_portuguese.h
SnowballC/src/stem_UTF_8_english.c
SnowballC/src/stem_UTF_8_romanian.c
SnowballC/src/stem_UTF_8_swedish.c
SnowballC/src/stem_UTF_8_romanian.h
SnowballC/src/stem_UTF_8_german.c
SnowballC/src/stem_UTF_8_french.h
SnowballC/src/stem_UTF_8_turkish.h
SnowballC/src/stem_UTF_8_norwegian.h
SnowballC/src/stem_UTF_8_spanish.h
SnowballC/src/stem_UTF_8_danish.c
SnowballC/src/stem_UTF_8_french.c
SnowballC/src/stem_UTF_8_spanish.c
SnowballC/src/stem_UTF_8_english.h
SnowballC/src/stem_UTF_8_finnish.c
SnowballC/src/stem_UTF_8_russian.c
SnowballC/src/stem_UTF_8_hungarian.h
SnowballC/src/api.c
SnowballC/src/api.h
SnowballC/NAMESPACE
SnowballC/NEWS
SnowballC/R
SnowballC/R/stem.R
SnowballC/MD5
SnowballC/DESCRIPTION
SnowballC/man
SnowballC/man/getStemLanguages.Rd SnowballC/man/wordStem.Rd
SnowballC/LICENSE

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.