Description Usage Arguments Details Value Examples
The kuromoji
method of jaNLP
class for Japanese PoS tagging and morphological analyze
1 2 |
|
input single character vector for Kuromoji analysis |
|
set Kuromoji's segmentation modes (default: "Normal"). See http://www.atilika.org[Kuromoji] "Designed for search" |
|
kuromoji.jar file path (default: "./libs/kuromoji-0.7.7.jar") |
"Kuromoji is an open source Japanese morphological analyzer written in Java."
http://www.atilika.org
surface
<e8><a1><a8><e5><b1><a4><e5><bd><a2>
feature
<e5><93><81><e8><a9><9e>,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>1,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>2,<e5><93><81><e8><a9><9e><e7><b4><b0><e5><88><86><e9><a1><9e>3,<e6><b4><bb><e7><94><a8><e5><bd><a2>,<e6><b4><bb><e7><94><a8><e5><9e><8b>,<e5><8e><9f><e5><bd><a2>,<e8><aa><ad><e3><81><bf>,<e7><99><ba><e9><9f><b3>
is_know
"Returns true if this token is known word"
is_unk
"Returns true if this token is unknown word"
is_user
"Returns true if this token is defined in user dictionary"
mode
segmentation modes
1 2 3 4 5 6 7 8 | library(RjaNLP)
example <- JaNLP$new(
text = "<e6><97><a5><e6><9c><ac><e7><b5><8c><e6><b8><88><e6><96><b0><e8><81><9e><e3><81><a7><e3><83><a2><e3><83><90><e3><82><b2><e3><83><bc><e3><81><ae><e8><a8><98><e4><ba><8b><e3><82><92><e8><aa><ad><e3><82><93><e3><81><a0>",
kuromoji_jar = "./share/kuromoji-0.7.7.jar"
)
normal.res <- example$kuromoji()
search.res <- example$kuromoji(set.mode = "SEARCH")
extend.res <- example$kuromoji(set.mode = "EXTENDED")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.