build_edges: Build edge data frame between documents

Description Usage Arguments

View source: R/edges.r

Description

Builds an edge list based on whether documents are similar based on their shingles. Uses the candidates list to make comparing document much faster.

Usage

1
build_edges(candidates, shingles, threshold = 0.8)

Arguments

candidates

list of buckets with document ids from lsh

shingles

list of documents and their shingles from shingle

threshold

jaccard similarity threshold


zamorarr/lshr documentation built on April 24, 2021, 11:35 p.m.