fix R CMD check warning due to change in version 0.2.2.
in third_party/protobuf-lite/strutil.cc:506:33: warning: argument to ‘sizeof’ in ‘int snprintf(char, size_t, const char, ...)’ call is the same expression as the destination; did you mean to provide an explicit length? [-Wsizeof-pointer-memaccess]
this part of third_party/protobuf-lite/strutil.cc was not used in sentencepiece
CHANGES IN sentencepiece VERSION 0.2.2
use snprintf instead of sprintf to handle the R CMD check deprecating note on M1mac
CHANGES IN sentencepiece VERSION 0.2.1
Fix for clang-UBSAN error
CHANGES IN sentencepiece VERSION 0.2
Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
Upgraded to sentencepiece release v0.1.96
CHANGES IN sentencepiece VERSION 0.1.3
Fix wordpiece bug for 1-character words. (@jonthegeek, #4)
Fix Solaris installation issue related to incorrect usage of pointer as a function
Also download the binary model in sentencepiece_download_model as it can be loaded with word2vec::read.wordvectors
read_word2vec now uses word2vec::read.wordvectors from word2vec >= 0.2.0
added BPEembed and predict.BPEembed
allow subword regularisation by adding nbest and alpha option in sentencepiece_encode and changed sentencepiece_decode accordingly
Added txt_remove_
Upgrade sentencepiece to release v0.1.91 commit a32d7dc6ce6f383a65ad6e1cbe1983f94ab11932 which has subword regularisation for BPE
CHANGES IN sentencepiece VERSION 0.1.2
Fix Solaris installation issue which used log of uint64 which is not defined on Solaris