get.estimate: Get single stabilization estimate

Description Usage Arguments Value See Also Examples

Description

get.estimate is used to return a single entropy rate estimate for a given text.

Usage

1
get.estimate(text, max.length = length(text), every.word = 10)

Arguments

text

A character vector. Preferably this should be read with read.file to take care of preprocessing. Note that internally this uses a regular expression for the matching so in non-ASCII texts it might be unstable. In those cases it is recommended to normalize the text beforehand.

max.length

A numeric scalar. How many elements from text should be read? Defaults to the length of the text.

every.word

A numeric scalar. "every.word=1" specifies that each word token should be used for estimation. To speed up processing only every 2nd, 3rd, xth word token could be used. Hence, every.word can be assigned any integer between 1 and the step size.

Value

A numeric scalar. This is the entropy rate on the given chunk of text.

See Also

PreprocessText, read.file, IsSubstring

Examples

1
2
data(deuparl)
est <- get.estimate(deuparl, max.length = 10000, every.word = 10)

dimalik/EntropyEstimator documentation built on May 15, 2019, 8:44 a.m.