tokenize_chinese_chars | R Documentation |
(R implementation of BasicTokenizer._tokenize_chinese_chars from BERT: tokenization.py.) This may result in doubled-up spaces, but that's the behavior of the python code...
tokenize_chinese_chars(text)
text |
A character scalar. |
Text with spaces around CJK characters.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.