tCorpus_docsim: Document similarity

tCorpus_docsimR Documentation

Document similarity

Description

(back to overview)

Details

Compare documents, and perform similarity based deduplication

compare_documents() Compare documents
$deduplicate() Remove duplicate documents

corpustools documentation built on Aug. 8, 2025, 6:08 p.m.