pairwise_overlap: Pairwise Overlap Coefficient

View source: R/pairwise_overlap.R

pairwise_overlapR Documentation

Pairwise Overlap Coefficient

Description

Calculates the pairwise overlap (Szymkiewicz–Simpson) coefficient. Related to the Jaccard index. Defined as the size of the intersection divided by the smaller of the size of the two sets.

Usage

pairwise_overlap(data, cols, case_weights)

Arguments

data

A data frame.

cols

Columns to analyze.

case_weights

An optional column of case weights.

Details

  • Metric: Similarity

  • Symmetrical: Yes

  • Upper Limit: 1

  • Lower Limit: 0

B
1 0
A 1 a b
0 c d

Overlap = a/(min(a+b, a+c))

Value

A matrix.

Examples

res <- pairwise_overlap(
    data = FoodSample,
    cols = Bisque:Turkey
)

print(res)

tidy(res)


ttrodrigz/onezero documentation built on May 9, 2023, 2:59 p.m.