tf_idf: find the most relevant document for a queery by computing...

Description Usage Arguments Value

View source: R/tf_idf.R

Description

find the most relevant document for a queery by computing tf-idf for a set of terms within a corpus

Usage

1
tf_idf(corpus, terms, own_regex = FALSE)

Arguments

corpus

the text or texts to be analyzed as a list of character vectors

terms

a character vector (or a regular expression if own_regex == TRUE) of terms to determine document relevance with

own_regex

when TRUE allows you to add custom regular expressions for terms argument. when FALSE rbow will construct regular expression from the character vector you supplied. defaults to FALSE

Value

a data frame with document number and mean tf-idf


till-tietz/rbow documentation built on Oct. 21, 2021, 9:16 p.m.