overlaps: Overlap cleaner

View source: R/ngramComponents.R

overlapsR Documentation

Overlap cleaner

Description

background function to load

Usage

overlaps(high, low, cutoff = 1, verbose = FALSE)

Arguments

high

matrix Token counts that will all be kept.

low

matrix Token counts that will evaluated (and pruned) for overlapping.

cutoff

numeric Threshold (as cosine distance) for including overlapping tokens. Default is 1 (i.e. all tokens included).

Value

Combined token count matrix.


doc2concrete documentation built on May 29, 2024, 8:19 a.m.