generateTDM: Generate term document frequency table from corpus

Description Usage Arguments Details Value See Also

Description

This function builds term documement sparse matrix

Usage

1
generateTDM(data, N, isTrace = F)

Arguments

data

It can be text corpus/data cleaned by cleanTextData

N

size of n-gram model

isTrace

for debugging purpose, use this if you want to track time to build model.

Details

This function generates terms with N number of words specified in argument. This can be used in many tasks like information retrival, document similarity etc.

Value

term document matrix for terms having N words

See Also

TermDocumentMatrix buildNgramModel


achalshah20/ANLP documentation built on May 10, 2019, 5:10 a.m.