prune_dfm: Feature matrix pruning

Description Usage Arguments Value Examples

Description

Prunes a document feature matrix (dfm) using relative thresholds

Usage

1
prune_dfm(dfm, minimum_threshold = 0.005, maximum_threshold = 1)

Arguments

dfm

a sparse matrix (Matrix)

minimum_threshold

minimum document frequency threshold for relative pruning

maximum_threshold

maximum document frequency threshold for relative pruning

Value

A pruned sparse Matrix

Examples

1
2
3
4
5
6
# example feature matrix
m <- Matrix(round(replicate(10, abs(rnorm(20))) * 10))
colnames(m) <- as.character(1:10)
dim(m)
n <- prune_dfm(m)
dim(n)

tm4ss/tmca.classify documentation built on June 24, 2019, 12:37 p.m.