multi_doc_compare: Multiple document comparison for textual overlap

Description Usage Arguments Value

Description

Multiple document comparison for textual overlap

Usage

1
multi_doc_compare(texts, n_grams, sd_criterion)

Arguments

texts

character vector of texts, each text is a string in the vector

n_grams

integer to specify ngram units

sd_criterion

numeric set a standard deviation criterion for returning documents that are unsually similar, 2-3 is pretty good

Value

list


CrumpLab/playjareyesores documentation built on June 25, 2019, 8:29 a.m.