Description Usage Arguments Details See Also Examples
View source: R/preprocessing.R
Enables persistence of text tokenizers alongside saved models.
1 2 3 | save_text_tokenizer(object, filename)
load_text_tokenizer(filename)
|
object |
Text tokenizer fit with |
filename |
File to save/load |
You should always use the same text tokenizer for training and
prediction. In many cases however prediction will occur in another
session with a version of the model loaded via load_model_hdf5()
.
In this case you need to save the text tokenizer object after training and then reload it prior to prediction.
Other text tokenization:
fit_text_tokenizer()
,
sequences_to_matrix()
,
text_tokenizer()
,
texts_to_matrix()
,
texts_to_sequences_generator()
,
texts_to_sequences()
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 | ## Not run:
# vectorize texts then save for use in prediction
tokenizer <- text_tokenizer(num_words = 10000)
fit_text_tokenizer(tokenizer, texts)
save_text_tokenizer(tokenizer, "tokenizer")
# (train model, etc.)
# ...later in another session
tokenizer <- load_text_tokenizer("tokenizer")
# (use tokenizer to preprocess data for prediction)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.