Description Usage Arguments Note See Also
View source: R/preprocessing.R
Update tokenizer internal vocabulary based on a list of texts or list of sequences.
1 | fit_text_tokenizer(object, x)
|
object |
Tokenizer returned by |
x |
Vector/list of strings, or a generator of strings (for memory-efficiency); Alternatively a list of "sequence" (a sequence is a list of integer word indices). |
Required before using texts_to_sequences()
, texts_to_matrix()
, or
sequences_to_matrix()
.
Other text tokenization:
save_text_tokenizer()
,
sequences_to_matrix()
,
text_tokenizer()
,
texts_to_matrix()
,
texts_to_sequences_generator()
,
texts_to_sequences()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.