JACCARD_DICE | R Documentation |
Jaccard or Dice similarity for text documents
JACCARD_DICE( token_list1 = NULL, token_list2 = NULL, method = "jaccard", threads = 1 )
token_list1 |
a list of tokenized text documents (it should have the same length as the token_list2) |
token_list2 |
a list of tokenized text documents (it should have the same length as the token_list1) |
method |
a character string specifying the similarity metric. One of 'jaccard', 'dice' |
threads |
a numeric value specifying the number of cores to run in parallel |
The function calculates either the jaccard or the dice distance between pairs of tokenized text of two lists
a numeric vector
library(textTinyR) lst1 = list(c('use', 'this', 'function', 'to'), c('either', 'compute', 'the', 'jaccard')) lst2 = list(c('or', 'the', 'dice', 'distance'), c('for', 'two', 'same', 'sized', 'lists')) out = JACCARD_DICE(token_list1 = lst1, token_list2 = lst2, method = 'jaccard', threads = 1)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.