.mp_tokenize_word_lookup | R Documentation |
Look up a word in the table; go to fall-back otherwise.
.mp_tokenize_word_lookup(word, vocab, lookup, unk_token, max_chars)
word |
Character scalar; word to tokenize. |
vocab |
A morphemepiece vocabulary. |
lookup |
A morphemepiece lookup table. |
unk_token |
Token to represent unknown words. |
max_chars |
Maximum length of word recognized. |
Input word, broken into tokens.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.