mpmiExcraction: Extract words based on MPMI+BE alogrithm

Description Usage Arguments Value Examples

View source: R/mpmiExtraction.R

Description

Extract words based on MPMI+BE alogrithm

Usage

1
2
3
4
5
6
7
8
9
mpmiExcraction(
  df,
  ChineseCharFilter = T,
  lambda = 0.25,
  optics,
  steps = 100,
  threshold,
  bayesianCutoff
)

Arguments

df

One column dataframe.

ChineseCharFilter

Drop non-Chinese characters. Default is True.

lambda

Weight distribution between PMI and BE, which will directly affect the results.

optics

Automatically optimize the result, select TURE if you don't know how to choose lambda.

steps

Number of iterations in the optimization process.

threshold

threshold for the final score.

bayesianCutoff

threshold for the bayesian probabilty value in word extension process

Value

List of extracted strings.

Examples

1
2
3
4
5
6
mpmiExcraction(drugnames_test,
               ChineseCharFilter = T,
               lambda = 0.25,
               optics = F,
               threshold = 30,
               bayesianCutoff = 0.6)

Flaretie/mpb documentation built on Jan. 24, 2020, 3:18 a.m.