KN: Kneser Ney

Description Usage Arguments Format Details Author(s) See Also

Description

KN Kneser-Ney Statistical Learning Model

Usage

1

Arguments

x

a CVSet containing training and test Corpus objects

train

Train Corpus object. Ignored if x is a CVSet, required otherwise.

test

Test Corpus object. Ignored if x is a CVSet, required otherwise.

modelSize

Numeric between 1, for unigram to 5 for quintgram.

openVocabulary

Logical. If TRUE, preprocessing will replace all words in the test corpus that are not in the training corpus with the pseudo-word UNK. If FALSE, all words in test corpus are assumed to be in the training corpus. The default is TRUE.

Format

An object of class R6ClassGenerator of length 24.

Details

Encapsulates a Statistical Language Model implementing the Kneser-Ney smoothing algorithm.

Author(s)

John James, jjames@dataScienceSalon.org

See Also

Other Statistical Language Model Classes: Katz, MKN, SLM0


DecisionScients/NLPStudio documentation built on May 15, 2019, 12:51 p.m.