hanlp.newWordDiscover: new word discovering .

Description Usage Arguments Value Author(s) Examples

Description

A function of new word discovering .

Usage

1
2
hanlp.newWordDiscover(doc, size = 100L, max_len = 5L, min_freq = 5e-05,
  min_entropy = 0.4, min_aggregation = 1.2, filter = TRUE)

Arguments

doc

A big Chinese document in UTF-8 .

size

output word number ,default 100.

max_len

max word length default 5.

min_freq

min word freq ,default 0.00005.

min_entropy

min word entropy,default 0.4.

min_aggregation

min mutual information of words,default 1.2.

filter

Whether or not to filter out the words already existing in the HanLP dict,default TRUE.

Value

a character .

Author(s)

qxde01

Examples

1
2
3
4
## Not run: 
hanlp.newWordDiscover("hello world!")

## End(Not run)

qxde01/RHanLP documentation built on May 13, 2019, 8:39 a.m.