Description Usage Arguments Value Examples
View source: R/lsh_candidates.R
Given a data frame of LSH buckets returned from lsh
, this
function returns the potential candidates.
1 | lsh_candidates(buckets)
|
buckets |
A data frame returned from |
A data frame of candidate pairs.
1 2 3 4 5 6 7 | dir <- system.file("extdata/legal", package = "textreuse")
minhash <- minhash_generator(200, seed = 234)
corpus <- TextReuseCorpus(dir = dir,
tokenizer = tokenize_ngrams, n = 5,
minhash_func = minhash)
buckets <- lsh(corpus, bands = 50)
lsh_candidates(buckets)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.