jaccard_filter: Vector Filtering by Jaccard Similarity

Description Usage Arguments Value

View source: R/utils-cluster-similarity.R

Description

One of two vectors with too high jaccard similarity is removed. Priority is given to groups that appear higher in the list of vectors, such that the lower-placed vector is removed.

Usage

1
jaccard_filter(x, out = NULL, jac.cut = 0.75)

Arguments

x

list of named vectors. Vectors are assumed to be ordered from most to least significant, such that of two vectors with a similarity score higher than the cut-off, the higher-placed vector will be kept and the lower-placed vector will be thrown.

out

vectors that passed jaccard test.

jac.cut

jaccard value cut off. arg to jaccard.

Value

list of i) x: names of vectors that passed jaccard test and ii) out: vectors that passed jaccard test.


jlaffy/statistrics documentation built on May 23, 2019, 4:04 a.m.