catUTF8 | Print the UTF-8 codes of a string. |
createHashmapEnv | Create an environment for hash mapping. |
GBK | GBK character set |
getCharset | Get the current encoding of the locale. |
getWordFreq | Get the word frequency data.frame. |
isBIG5 | Indicate whether the encoding of input string is BIG5. |
isGB18030 | Indicate whether the encoding of input string is GB18030. |
isGB2312 | Indicate whether the encoding of input string is GB2312. |
isGBK | Indicate whether the encoding of input string is GBK. |
isUTF8 | Indicate whether the encoding of input string is UTF-8. |
NTUSD | National Taiwan University Semantic Dictionary |
revUTF8 | Revert UTF-8 string to Chinese character. |
setchs | Set locale to Simplified Chinese. |
setcht | Set locale to Simplified Chinese. |
SIMTRA | Dictionary of simplified and traditional Chinese |
stopwordsCN | Return Chinese stop words. |
strcap | Mixed case capitalizing. |
strextract | Extract matched substrings by regular expression. |
strpad | Pad a string to a specified length with a padding character. |
strstrip | Trim space of a string. |
tmcnTest | Run unit tests. |
toPinyin | Convert a chinese text to pinyin format. |
toTrad | Convert a Chinese text from simplified to traditional... |
toUTF8 | Convert encoding of Chinese string to UTF-8. |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.