stem_rslp: Stemming using RSLP

Description Usage Arguments References Examples

Description

This function uses the RSLP algorithm to stem a vector of words. By default, the RSLP algorithm leaves words cutted. As this makes reading stemmed texts very difficult, this function provides an option to complete the stemmed words. By default it completes with the most used word in the text that has the same stem.

Usage

1
stem_rslp(words, complete = TRUE)

Arguments

words

character vector of words to be stemmed

complete

wheter words must be completed or not (T)

References

V. Orengo, C. Huyck, "A Stemming Algorithmm for the Portuguese Language", SPIRE, 2001, String Processing and Information Retrieval, International Symposium on, String Processing and Information Retrieval, International Symposium on 2001, pp. 0186, doi:10.1109/SPIRE.2001.10024

Examples

1
2
words <- c("balões", "aviões", "avião", "gostou", "gosto", "gostaram")
ptstem:::stem_rslp(words)

ptstem documentation built on May 1, 2019, 7:32 p.m.