model_wordpiece | R Documentation |
An implementation of the WordPiece algorithm
An implementation of the WordPiece algorithm
tok::tok_model
-> tok_model_wordpiece
new()
Constructor for the wordpiece tokenizer
model_wordpiece$new( vocab = NULL, unk_token = NULL, max_input_chars_per_word = NULL )
vocab
A dictionary of string keys and their corresponding ids.
Default: NULL
.
unk_token
The unknown token to be used by the model.
Default: NULL
.
max_input_chars_per_word
The maximum number of characters to allow in a single word.
Default: NULL
.
clone()
The objects of this class are cloneable with this method.
model_wordpiece$clone(deep = FALSE)
deep
Whether to make a deep clone.
Other model:
model_bpe
,
model_unigram
,
tok_model
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.