generateTDM: Generate term document frequency table from corpus

Description Usage Arguments Details Value See Also

View source: R/ANLP.R

Description

This function builds term documement sparse matrix

Usage

1
generateTDM(data, N, isTrace = F)

Arguments

data

It can be text corpus/data cleaned by cleanTextData

N

size of n-gram model

isTrace

for debugging purpose, use this if you want to track time to build model.

Details

This function generates terms with N number of words specified in argument. This can be used in many tasks like information retrival, document similarity etc.

Value

term document matrix for terms having N words

See Also

TermDocumentMatrix buildNgramModel


ANLP documentation built on May 30, 2017, 4:42 a.m.