doccount_corpus: Doccount of Corpus

Description Usage Arguments Value Examples

Description

A function to get the total words for each document in the corpus. Useful for things like ldavisCpp for estimating topic proportion in corpus.

Usage

1
doccount_corpus(ipath, ncores, flag = 0)

Arguments

ipath

A string specifying the path to the input files.

ncores

A number specifying the number of cores to use.

flag

**optional** A number specifying if documents are delimited by newline (set to 0) or each text file has only one document (1)

Value

An int vector with num elements = num docs

Examples

1
2
3
4
## Not run: 
doccout_corpus("/path/to/corpus/", 20)

## End(Not run)

avkoehl/textprocessingDSI documentation built on June 5, 2019, 7:41 p.m.