Create an LDA dataset from existing string vector. Each entry in the vector must be a string with the following format: <unique id>\t<doc class>\t<document content> The document class is not used in by the LDA sampler. The document content CAN have \t in it.
1 | create_lda_dataset(train, test = NULL, stoplist_fn = "stoplist.txt")
|
train |
string vector with document data |
test |
string vector with test document data |
stoplist_fn |
filiename of stoplist file |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.