API for ropensci/textreuse
Detect Text Reuse and Document Similarity

Global functions
TextReuseCorpus Man page Source code
TextReuseTextDocument Man page Source code
TextReuseTextDocument-accessors Man page
`[.TextReuseCorpus` Source code
`[[.TextReuseCorpus` Source code
`content<-.TextReuseTextDocument` Source code
`hashes<-.TextReuseTextDocument` Source code
`hashes<-` Source code
`meta<-.TextReuseCorpus` Source code
`meta<-.TextReuseTextDocument` Source code
`minhashes<-.TextReuseTextDocument` Source code
`minhashes<-` Source code
`names<-.TextReuseCorpus` Source code
`tokens<-.TextReuseTextDocument` Source code
`tokens<-` Source code
align_local Man page Source code
align_local.TextReuseTextDocument Source code
align_local.default Source code
as.character.TextReuseTextDocument Source code
as.matrix.textreuse_candidates Man page Source code
as_string Source code
band_seq Source code
check_banding Source code
content Man page
content.TextReuseTextDocument Source code
content<- Man page
digest_progress Source code
filenames Man page Source code
get_apply_function Source code
has_content Man page Source code
has_hashes Man page Source code
has_id Source code
has_minhashes Man page Source code
has_minhashes_corpus Source code
has_tokens Man page Source code
hash_string Man page Source code
hashes Man page Source code
hashes.TextReuseCorpus Source code
hashes.TextReuseTextDocument Source code
hashes<- Man page
is.TextReuseCorpus Man page Source code
is.TextReuseTextDocument Man page Source code
is_candidates_df Source code
is_integer_like Source code
is_lsh_buckets Source code
jaccard_bag_similarity Man page Source code
jaccard_bag_similarity.TextReuseTextDocument Source code
jaccard_bag_similarity.default Source code
jaccard_dissimilarity Man page Source code
jaccard_dissimilarity.default Source code
jaccard_similarity Man page Source code
jaccard_similarity.TextReuseTextDocument Source code
jaccard_similarity.default Source code
length.TextReuseCorpus Source code
lsh Man page Source code
lsh.TextReuseCorpus Source code
lsh.TextReuseTextDocument Source code
lsh_candidates Man page Source code
lsh_compare Man page Source code
lsh_probability Man page Source code
lsh_query Man page Source code
lsh_subset Man page Source code
lsh_threshold Man page Source code
mark_chars Source code
meta Man page
meta.TextReuseCorpus Source code
meta.TextReuseTextDocument Source code
meta<- Man page
minhash_generator Man page Source code
minhashes Man page Source code
minhashes.TextReuseCorpus Source code
minhashes.TextReuseTextDocument Source code
minhashes<- Man page
names.TextReuseCorpus Source code
pairwise_candidates Man page Source code
pairwise_compare Man page Source code
pretty_print_metadata Source code
print.TextReuseCorpus Source code
print.TextReuseTextDocument Source code
print.textreuse_alignment Source code
random_ints Source code
ratio_of_matches Man page Source code
ratio_of_matches.TextReuseTextDocument Source code
ratio_of_matches.default Source code
reexports Man page
rehash Man page Source code
rehash.TextReuseCorpus Source code
rehash.TextReuseTextDocument Source code
shingle_ngrams Source code
similarity-functions Man page
skip_ngrams Source code
skipped Man page Source code
sort_df_by_columns Source code
sort_df_by_rows Source code
sort_meta Source code
sw_matrix Source code
textreuse Man page
textreuse-package Man page
tokenize Man page Source code
tokenize.TextReuseCorpus Source code
tokenize.TextReuseTextDocument Source code
tokenize_ngrams Man page Source code
tokenize_sentences Man page Source code
tokenize_skip_ngrams Man page Source code
tokenize_words Man page Source code
tokenizers Man page
tokens Man page Source code
tokens.TextReuseCorpus Source code
tokens.TextReuseTextDocument Source code
tokens<- Man page
using_parallel Source code
wordcount Man page Source code
wordcount.TextDocument Source code
wordcount.TextReuseCorpus Source code
wordcount.default Source code
ropensci/textreuse documentation built on Aug. 8, 2024, 9:17 a.m.