makeDTM: Turn a corpus into a document term matrix

Description Usage Arguments Value

View source: R/topic_modelling.R

Description

Turn a corpus into a document term matrix

Usage

1
makeDTM(corpus, sparsity, rnames, cols, rows)

Arguments

corpus

- A document corpus

sparsity

- An integer between 0 and 1. Lower values remove more highly common and highly uncommon terms

rnames

- a vector of document identifiers (usually papers$UT)

Value

A corpus of words, punct removed and stemmed


mcallaghan/scimetrix documentation built on May 22, 2019, 12:58 p.m.