ctfidf: Combined tf-idf matrix

Description Usage Arguments Details Value

Description

Returns a tf-idf sparse matrix given a condition-gene matrix. This matrix is a DxT matrix, where D is the set of conditions (disease indications) T are the genes affecting that condition. To correct effect/prevalence of genes in a given condition or to correct conditions affected by many genes, we compute the tf-idf matrix twice, to get the combined tf-idf:

Usage

1
ctfidf(data_matrix)

Arguments

data_matrix

Sparse document-term matrix (condition by gene matrix), named (rows and columns)

Details

ctfidf = tfidf(dtm) \times t(tfidf(dtm))

Value

A sparse matrix with the computed combined tf-idf


diogocamacho/cauldron documentation built on June 6, 2019, 8:45 a.m.