calc_class_term_frequency: Calculate class term frequency matrix

Description Usage Arguments Value Examples

View source: R/vsm.R

Description

Computes the class term frequency matrix for a given corpus.

Usage

1
calc_class_term_frequency(class.labels, vocab, doc.class.labels, documents)

Arguments

class.labels

A vector of true class names

vocab

a vector of words in the vocabulary

doc.class.labels

a vector of document classes

documents

a list documents read via read_docs

Value

The class term frequency matrix

Examples

1
2
3
4
5
6
7
8
documents <- read_docs('bop.ldac');
vocab <- readLines('bop.ldac.vocab');

doc.metadata <- read.csv2('bop.csv', header = T, sep = ';');
class.labels <- levels(doc.metadata[, 'category'])
ctf <- calc_class_term_frequency(class.labels, vocab, 
                                 doc.metadata[, 'category'], 
                                 documents);

clintpgeorge/ldamcmc documentation built on Feb. 22, 2020, 12:39 p.m.