kuromoji: Japanese Part-of-Speech and Morphological Analyze using...

Description Usage Arguments Details Value Examples

Description

The kuromoji method of jaNLP class for Japanese PoS tagging and morphological analyze

Usage

1
2
## S4 method for signature 'jaNLP'
kuromoji(str = "", set.mode = NULL, jar.path = NULL)

Arguments

str

input single character vector for Kuromoji analysis

set.mode

set Kuromoji's segmentation modes (default: "Normal"). See http://www.atilika.org[Kuromoji] "Designed for search"

jar.path

kuromoji.jar file path (default: "./libs/kuromoji-0.7.7.jar")

Details

"Kuromoji is an open source Japanese morphological analyzer written in Java."
http://www.atilika.org

Value

surface

<e8><a1><a8><e5><b1><a4><e5><bd><a2>

feature

<e5><93><81><e8><a9><9e>,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>1,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>2,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>3,<e6><b4><bb><e7><94><a8><e5><bd><a2>,<e6><b4><bb><e7><94><a8><e5><9e><8b>,<e5><8e><9f><e5><bd><a2>,<e8><aa><ad><e3><81><bf>,<e7><99><ba><e9><9f><b3>

is_know

"Returns true if this token is known word"

is_unk

"Returns true if this token is unknown word"

is_user

"Returns true if this token is defined in user dictionary"

mode

segmentation modes

Examples

1
2
3
4
5
6
7
8
library(RjaNLP)
example <- JaNLP$new(
 text = "<e6><97><a5><e6><9c><ac><e7><b5><8c><e6><b8><88><e6><96><b0><e8><81><9e><e3><81><a7><e3><83><a2><e3><83><90><e3><82><b2><e3><83><bc><e3><81><ae><e8><a8><98><e4><ba><8b><e3><82><92><e8><aa><ad><e3><82><93><e3><81><a0>",
 kuromoji_jar = "./share/kuromoji-0.7.7.jar"
)
normal.res <- example$kuromoji()
search.res <- example$kuromoji(set.mode = "SEARCH")
extend.res <- example$kuromoji(set.mode = "EXTENDED")

yamano357/rJaNLP documentation built on May 4, 2019, 2:28 p.m.