global_idf3
.bind_tf_idf2
.norm=TRUE
. Cosine nomalization is now performed on tf_idf
values as in the RMeCab package.tf="itf"
and idf="df"
options.pack
for performance.tokenize_mecab
and tokenize_sudachipy
.bind_lr
function which can calculate the 'LR' value of bigrams.pack
now always returns a tibble, not a data.frame.bind_tf_idf2
can calculate and bind the term frequency, inverse document frequency, and tf-idf of the tidy text dataset.collapse_tokens
, mute_tokens
, and lexical_density
can be used for handling a tidy text dataset of tokens.strj_tokenize
now preserves the original order of text names.prettify
now can get delim
argument.strj_fill_iter_mark
function.strj_fill_iter_mark
now replaces a sequence of iteration marks recursively.strj_tokenize
function.strj_tokenize
now can retrieve engine
argument to switch tokenizers for splitting text into tokens.ngram_tokenizer
function.pack
function.pack
function. pack
now accepts pull
as its second argument and n
as its third argument.pull
now can accept a symbol.NEWS.md
file to track changes to the package.Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.