Man pages for OPTI-SURVEIL/chinsimi
Measuring the Similarity between Chinese Strings

ambig_countFunctions to indicate the presence of characters signalling...
ChStr2fcConvert Chinese strings to four corner code.
ChStr2pyConvert Chinese strings to pinyin.
ChStr2radExtract radical decomposition of Chinese Character strings.
ChStr2wbConvert Chinese strings to wubi code (based on radicals).
FClibFour Corner database
hancheckRough detection of Han Chinese names
hannamesBaidu's top 400 most common Han family names, 2013
homonymSupport function for Pinyin with more than one pronunciation....
idslibCharacter decomposition dataset
name_freq_tableFunctions to calculate empirical relative frequency of...
pylibPinyin database
pylibnewPinyin database
rad100libRadical decomposition dataset after full decomposition
rad1libRadical decomposition dataset after one round of...
revpyFunction to parse format used for storing character matches...
sim_funcCompute levenstein edit similarity between strings,...
sim_func_matCompute a matrix of similarity measures between strings,...
str100libCharacter structure dataset after full decomposition
str1libCharacter structure dataset after a single round of...
WBlibFour Corner database
OPTI-SURVEIL/chinsimi documentation built on Oct. 27, 2019, 7:05 p.m.