stem.text: Stem text

Description Usage Arguments Value

Description

A function to stem text. It is less aggressive than the Porter stemming algorithm but is still useful for reducing data sparsity. It removes 'ing', 'ed', 'est', 'es' and 'ion' from the end of words. One limitation is that 'thing' becomes 'th' and 'action becomes 'act' - but, in general, it is useful.

Usage

1
stem.text(text.clean)

Arguments

text.clean

character vector

Value

vector with cleaned text


bvidgen/tc documentation built on May 9, 2019, 2:21 a.m.