lemma_data: Lemma Data

Description Usage Format Source

Description

A dataset containing inflected and base forms of text, used for the lemmatize function.

Usage

1

Format

A data frame with 47,366 rows and 2 variables:

inflected_form

the original, inflected word (e.g. 'dogs' or 'walked')

lemma

the base form of the word (e.g. 'dog' or 'walk')

Source

https://github.com/tm4ss/tm4ss.github.io/blob/master/resources/baseform_en.tsv


nlanderson9/languagePredictR documentation built on June 10, 2021, 11 a.m.