ctfidf: Combined tf-idf matrix

Description Usage Arguments Details Value

View source: R/ctfidf.R

Description

Returns a tf-idf sparse matrix given a condition-gene matrix. This matrix is a DxT matrix, where D is the set of conditions (disease indications) T are the genes affecting that condition. To correct effect/prevalence of genes in a given condition or to correct conditions affected by many genes, we compute the tf-idf matrix twice, to get the combined tf-idf:

Usage

1
ctfidf(data_matrix)

Arguments

data_matrix

Sparse document-term matrix (condition by gene matrix), named (rows and columns)

Details

ctfidf = tfidf(dtm) \times t(tfidf(dtm))

Value

A sparse matrix with the computed combined tf-idf


diogocamacho/druid documentation built on Sept. 17, 2020, 12:39 p.m.