Man pages for sentencepiece
Text Tokenization using Byte Pair Encoding and Unigram Modelling

BPEembedTokenise and embed text alongside a Sentencepiece and...
BPEembedderBuild a BPEembed model containing a Sentencepiece and...
predict.BPEembedEncode and Decode alongside a BPEembed model
read_word2vecRead a word2vec embedding file
sentencepieceConstruct a Sentencepiece model
sentencepiece_decodeDecode encoded sequences back to text
sentencepiece_download_modelDownload a Sentencepiece model
sentencepiece_encodeTokenise text alongside a Sentencepiece model
sentencepiece_load_modelLoad a Sentencepiece model
txt_remove_Remove prefixed underscore
wordpiece_encodeWordpiece encoding
sentencepiece documentation built on Nov. 13, 2022, 5:05 p.m.