rslp_doc: RSLP Document

Description Usage Arguments References Examples

View source: R/rslp.R

Description

Apply the Stemming Algorithm for the Portuguese Language to vector of documents. It extracts words using the regex "\b[:alpha:]\b"

Usage

1
2
3
4
rslp_doc(
  docs,
  steprules = readRDS(system.file("steprules.rds", package = "rslp"))
)

Arguments

docs

chr vector of documents

steprules

as obtained from the function extract_rules. (only define if you are certain about it). The default is to get the parsed version of the rules installed with the package.

References

V. Orengo, C. Huyck, "A Stemming Algorithmm for the Portuguese Language", SPIRE, 2001, String Processing and Information Retrieval, International Symposium on, String Processing and Information Retrieval, International Symposium on 2001, pp. 0186, doi:10.1109/SPIRE.2001.10024

Examples

1
2
docs <- c("coma frutas pois elas fazem bem para.")
rslp_doc(docs)

rslp documentation built on July 1, 2020, 11:11 p.m.