SnowballC: Snowball Stemmers Based on the C 'libstemmer' UTF-8 Library

An R interface to the C 'libstemmer' library that implements Porter's word stemming algorithm for collapsing words to a common root to aid comparison of vocabulary. Currently supported languages are Arabic, Basque, Catalan, Danish, Dutch, English, Finnish, French, German, Greek, Hindi, Hungarian, Indonesian, Irish, Italian, Lithuanian, Nepali, Norwegian, Portuguese, Romanian, Russian, Spanish, Swedish, Tamil and Turkish.

Getting started

Package details

AuthorMilan Bouchet-Valat [aut, cre]
MaintainerMilan Bouchet-Valat <nalimilan@club.fr>
LicenseBSD_3_clause + file LICENSE
Version0.7.1
URL https://github.com/nalimilan/R.TeMiS
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("SnowballC")

Try the SnowballC package in your browser

Any scripts or data that you put into this service are public.

SnowballC documentation built on April 26, 2023, 1:17 a.m.