Resha: Resha: A package for stemming Turkish more gently than...

Description Details

Description

This package is an R version of Harun Reşit Zafer's resha stemmer for Turkish, from which it takes its language resources.

Details

The main function is wordStem which takes a token or a vector of tokens and returns for each, either its stem from a look-up table, or the original token if none can be found. Tokens are truncated up to the first apostrophe, if any, before stemming.

You can add your own token-stem mappings either individually using add_stem or as together using an input file using add_stems.


conjugateprior/Resha documentation built on May 20, 2019, 5:20 p.m.