Man pages for wordpiece
R Implementation of Wordpiece Tokenization

dot-get_casednessDetermine Casedness of Vocabulary
dot-infer_case_from_vocabDetermine Vocabulary Casedness
dot-new_wordpiece_vocabularyConstructor for Class wordpiece_vocabulary
dot-process_vocabProcess a Vocabulary for Tokenization
dot-process_wp_vocabProcess a Wordpiece Vocabulary for Tokenization
dot-validate_wordpiece_vocabularyValidator for Objects of Class wordpiece_vocabulary
dot-wp_tokenize_single_stringTokenize an Input Word-by-word
dot-wp_tokenize_wordTokenize a Word
load_or_retrieve_vocabLoad a vocabulary file, or retrieve from cache
load_vocabLoad a vocabulary file
prepare_vocabFormat a Token List as a Vocabulary
reexportsObjects exported from other packages
set_wordpiece_cache_dirSet a Cache Directory for wordpiece
wordpiece_cache_dirRetrieve Directory for wordpiece Cache
wordpiece_tokenizeTokenize Sequence with Word Pieces
wordpiece documentation built on March 18, 2022, 5:55 p.m.