BasicTokenizer | R Documentation |
(I'm not sure that this object-based approach is best for R implementation, but for now just trying to reproduce python functionality.)
BasicTokenizer(do_lower_case = TRUE)
do_lower_case |
Logical; the value to give to the "do_lower_case" argument in the BasicTokenizer object. |
Has methods: 'tokenize.BasicTokenizer()' 'run_strip_accents.BasicTokenizer()' (internal use) 'run_split_on_punc.BasicTokenizer()' (internal use) 'tokenize_chinese_chars.BasicTokenizer()' (internal use) 'is_chinese_char.BasicTokenizer()' (internal use) 'clean_text.BasicTokenizer()' (internal use)
an object of class BasicTokenizer
## Not run: b_tokenizer <- BasicTokenizer(TRUE) ## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.