jiebaR: Chinese Text Segmentation

Chinese text segmentation, keyword extraction and speech tagging For R.

Author
Qin Wenfeng, and the authors of CppJieba for the included version of CppJieba
Date of publication
2016-09-28 17:10:38
Maintainer
Qin Wenfeng <mail@qinwenfeng.com>
License
MIT + file LICENSE
Version
0.9.1
URLs

View on CRAN

Man pages

apply_list
Apply list input to a worker
DICTPATH
The path of dictionary
distance
Hamming distance of words
edit_dict
Edit default user dictionary
file_coding
Files encoding detection
filter_segment
Filter segmentation result
freq
The frequency of words
get_idf
generate IDF dict
get_tuple
get tuple from the segmentation result
jiebaR
A package for Chinese text segmentation
keywords
Keyword extraction
less-than-equals-.keywords
Keywords symbol
less-than-equals-.qseg
Quick mode symbol
less-than-equals-.segment
Text segmentation symbol
less-than-equals-.simhash
Simhash symbol
less-than-equals-.tagger
Tagger symbol
new_user_word
Add user word
print.jieba
Print worker settings
query_threshold
Set query threshold
segment
Chinese text segmentation function
set_qsegmodel
Set quick mode model
show_dictpath
Show default path of dictionaries
simhash
Simhash computation
simhash_dist
Compute Hamming distance of Simhash value
tagging
Speech Tagging
tobin
simhash value to binary
vector_tag
Tag the a character vector
words_locate
Get text location
worker
Initialize jiebaR worker

Files in this package

jiebaR
jiebaR/inst
jiebaR/inst/AUTHORS
jiebaR/inst/other
jiebaR/inst/other/get_tuple.R
jiebaR/inst/model
jiebaR/inst/model/backup.rda
jiebaR/inst/model/model.rda
jiebaR/inst/COPYRIGHTS
jiebaR/inst/doc
jiebaR/inst/doc/Quick_Start_Guide.html
jiebaR/inst/doc/Quick_Start_Guide.Rmd
jiebaR/inst/doc/Quick_Start_Guide.R
jiebaR/inst/include
jiebaR/inst/include/lib
jiebaR/inst/include/lib/SegmentBase.hpp
jiebaR/inst/include/lib/hashes
jiebaR/inst/include/lib/hashes/jenkins.h
jiebaR/inst/include/lib/Simhasher.hpp
jiebaR/inst/include/lib/QuerySegment.hpp
jiebaR/inst/include/lib/PosTagger.hpp
jiebaR/inst/include/lib/HMMModel.hpp
jiebaR/inst/include/lib/TransCode.hpp
jiebaR/inst/include/lib/MixSegment.hpp
jiebaR/inst/include/lib/MPSegment.hpp
jiebaR/inst/include/lib/HMMSegment.hpp
jiebaR/inst/include/lib/FullSegment.hpp
jiebaR/inst/include/lib/limonp
jiebaR/inst/include/lib/limonp/Logging.hpp
jiebaR/inst/include/lib/limonp/StringUtil.hpp
jiebaR/inst/include/lib/limonp/LocalVector.hpp
jiebaR/inst/include/lib/limonp/NonCopyable.hpp
jiebaR/inst/include/lib/limonp/StdExtension.hpp
jiebaR/inst/include/lib/PreFilter.hpp
jiebaR/inst/include/lib/LevelSegment.hpp
jiebaR/inst/include/lib/Jieba.hpp
jiebaR/inst/include/lib/KeywordExtractor.hpp
jiebaR/inst/include/lib/Trie.hpp
jiebaR/inst/include/lib/DictTrie.hpp
jiebaR/inst/include/segtype-v4.hpp
jiebaR/inst/include/jiebaRAPI.h
jiebaR/inst/include/jiebaR.h
jiebaR/tests
jiebaR/tests/testthat.R
jiebaR/tests/testthat
jiebaR/tests/testthat/CPP_API
jiebaR/tests/testthat/CPP_API/tests
jiebaR/tests/testthat/CPP_API/tests/testthat.R
jiebaR/tests/testthat/CPP_API/tests/testthat
jiebaR/tests/testthat/CPP_API/tests/testthat/test-cpp.R
jiebaR/tests/testthat/CPP_API/src
jiebaR/tests/testthat/CPP_API/src/Makevars
jiebaR/tests/testthat/CPP_API/src/test_api.cpp
jiebaR/tests/testthat/CPP_API/NAMESPACE
jiebaR/tests/testthat/CPP_API/R
jiebaR/tests/testthat/CPP_API/R/Rcpps.R
jiebaR/tests/testthat/CPP_API/R/all.R
jiebaR/tests/testthat/CPP_API/DESCRIPTION
jiebaR/tests/testthat/CPP_API/man
jiebaR/tests/testthat/CPP_API/man/filecoding.Rd
jiebaR/tests/testthat/CPP_API/LICENSE
jiebaR/tests/testthat/test-cut.R
jiebaR/tests/testthat/bylines.utf8
jiebaR/tests/testthat/test-api.R
jiebaR/tests/testthat/C_API
jiebaR/tests/testthat/C_API/tests
jiebaR/tests/testthat/C_API/tests/testthat.R
jiebaR/tests/testthat/C_API/tests/testthat
jiebaR/tests/testthat/C_API/tests/testthat/test-c.R
jiebaR/tests/testthat/C_API/src
jiebaR/tests/testthat/C_API/src/Makevars
jiebaR/tests/testthat/C_API/src/test_api.c
jiebaR/tests/testthat/C_API/NAMESPACE
jiebaR/tests/testthat/C_API/R
jiebaR/tests/testthat/C_API/R/Rcpps.R
jiebaR/tests/testthat/C_API/R/all.R
jiebaR/tests/testthat/C_API/DESCRIPTION
jiebaR/tests/testthat/C_API/man
jiebaR/tests/testthat/C_API/man/filecoding.Rd
jiebaR/tests/testthat/C_API/LICENSE
jiebaR/src
jiebaR/src/Makevars
jiebaR/src/word_freq.cpp
jiebaR/src/get_tuple.cpp
jiebaR/src/util.cpp
jiebaR/src/Makevars.win
jiebaR/src/init.c
jiebaR/src/RcppExports.cpp
jiebaR/src/segtype-v4.cpp
jiebaR/src/detect.cpp
jiebaR/src/get_idf.cpp
jiebaR/NAMESPACE
jiebaR/NEWS
jiebaR/R
jiebaR/R/jiebaR-package.r
jiebaR/R/dict_tools.R
jiebaR/R/gen_idf.R
jiebaR/R/segment.R
jiebaR/R/filter.R
jiebaR/R/overload.R
jiebaR/R/worker_func.R
jiebaR/R/keywords.R
jiebaR/R/quick.R
jiebaR/R/RcppExports.R
jiebaR/R/simhash.R
jiebaR/R/print.R
jiebaR/R/worker.R
jiebaR/R/ham_dist.R
jiebaR/R/util.R
jiebaR/R/words_freq.R
jiebaR/R/tagger.R
jiebaR/R/tobin.R
jiebaR/R/zzz.R
jiebaR/R/get_tuple.R
jiebaR/vignettes
jiebaR/vignettes/Quick_Start_Guide.Rmd
jiebaR/README.md
jiebaR/MD5
jiebaR/build
jiebaR/build/vignette.rds
jiebaR/DESCRIPTION
jiebaR/man
jiebaR/man/segment.Rd
jiebaR/man/filter_segment.Rd
jiebaR/man/less-than-equals-.qseg.Rd
jiebaR/man/vector_tag.Rd
jiebaR/man/freq.Rd
jiebaR/man/get_tuple.Rd
jiebaR/man/less-than-equals-.simhash.Rd
jiebaR/man/get_idf.Rd
jiebaR/man/print.jieba.Rd
jiebaR/man/less-than-equals-.keywords.Rd
jiebaR/man/file_coding.Rd
jiebaR/man/simhash_dist.Rd
jiebaR/man/tobin.Rd
jiebaR/man/less-than-equals-.segment.Rd
jiebaR/man/distance.Rd
jiebaR/man/new_user_word.Rd
jiebaR/man/words_locate.Rd
jiebaR/man/show_dictpath.Rd
jiebaR/man/query_threshold.Rd
jiebaR/man/worker.Rd
jiebaR/man/jiebaR.Rd
jiebaR/man/simhash.Rd
jiebaR/man/DICTPATH.Rd
jiebaR/man/less-than-equals-.tagger.Rd
jiebaR/man/set_qsegmodel.Rd
jiebaR/man/tagging.Rd
jiebaR/man/edit_dict.Rd
jiebaR/man/apply_list.Rd
jiebaR/man/keywords.Rd
jiebaR/LICENSE