Rstem: Interface to Snowball implementation of Porter's word stemming algorithm.
Version 0.4-1

An R interface to the C code that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of texts. There is code to for different languages (i.e. Danish, Dutch, English, Finnish, French, German, Norwegian, Portuguese, Russian, Spanish, Swedish). However, these may not be applicable if the words require UTF encoding. This is extensible by allowing different routines to be specified to create the C routines used in the stemming, permitting debugging, profiling, pool management, caching, etc.

AuthorDuncan Temple Lang [aut], Milan Bouchet-Valat [cre]
Date of publication2013-04-21 11:55:50
MaintainerMilan Bouchet-Valat <nalimilan@club.fr>
LicenseBSD
Version0.4-1
Package repositoryView on R-Forge
InstallationInstall the latest version of this package by entering the following in R:
install.packages("Rstem", repos="http://R-Forge.R-project.org")

Popular man pages

getStemLanguages: Query the languages supported in this package
wordStem: Get the common root/stem of words
See all...

All man pages Function index File listing

Man pages

getStemLanguages: Query the languages supported in this package
wordStem: Get the common root/stem of words

Functions

getStemLanguages Man page
getTestedLanguages Source code
wordStem Man page

Files

DESCRIPTION
NAMESPACE
R
R/langs.R
R/stem.S
SPlus
Todo.html
Web
Web/index.html
inst
inst/scripts
inst/scripts/README.html
inst/scripts/download
inst/words
inst/words/english
inst/words/english/output.txt
inst/words/english/stop.txt
inst/words/english/voc.txt
inst/words/french
inst/words/french/output.txt
inst/words/french/stop.txt
inst/words/french/voc.txt
man
man/getStemLanguages.Rd
man/wordStem.Rd
src
src/Languages.h
src/Makevars
src/api.c
src/api.h
src/danish_stem.c
src/danish_stem.h
src/dutch_stem.c
src/dutch_stem.h
src/english_stem.c
src/english_stem.h
src/finnish_stem.c
src/finnish_stem.h
src/french_stem.c
src/french_stem.h
src/german_stem.c
src/german_stem.h
src/header.h
src/mytest.c
src/norwegian_stem.c
src/norwegian_stem.h
src/portuguese_stem.c
src/portuguese_stem.h
src/russian_stem.c
src/russian_stem.h
src/spanish_stem.c
src/spanish_stem.h
src/stem.h
src/swedish_stem.c
src/swedish_stem.h
src/utilities.c
vignettes
vignettes/stemming.tex
Rstem documentation built on May 21, 2017, 3:59 a.m.

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

Please suggest features or report bugs in the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.