wgt_jaccard_distance: Computing Weighted Jaccard Distance

Description Usage Arguments Details Value

View source: R/wgt_jaccard_distance.R

Description

#' wgt_jaccard_distance computes the Weighted Jaccard Distance between two strings. It is vectorized, and accepts only two equal-length string vectors.

Usage

1
wgt_jaccard_distance(string_1, string_2, corpus, nthreads = 1)

Arguments

string_1

character vector

string_2

character vector

corpus

corpus data.table, constructed with fedmatch::build_corpus

nthreads

number of threads to use in the underlying C++ code

Details

See the vignette fuzzy_matching for details on how the Weighted Jaccard similarity is computed.

Value

numeric vector with the Weighted Jaccard distances for each element of string_1 and string_2.


fedmatch documentation built on Nov. 23, 2021, 1:07 a.m.