create_lda_dataset: create_lda_dataset

Description Usage Arguments

View source: R/pcldar.R

Description

Create an LDA dataset from existing string vector. Each entry in the vector must be a string with the following format: <unique id>\t<doc class>\t<document content> The document class is not used in by the LDA sampler. The document content CAN have \t in it.

Usage

1
create_lda_dataset(train, test = NULL, stoplist_fn = "stoplist.txt")

Arguments

train

string vector with document data

test

string vector with test document data

stoplist_fn

filiename of stoplist file


lejon/pcldar documentation built on Feb. 23, 2020, 3:11 p.m.