Converts pre-processed document matrices stored in popular formats to stm format.
An input file or filepath to be processed
The type of input file. We offer several sources, see details.
This function provides a simple utility for converting other document
formats to our own. Briefly-
dtm takes as input a standard matrix
and converts to our format.
slam converts from the
simple_triplet_matrix representation used by the
This is also the representation of corpora in the popular
and should work in those cases.
dtm expects a matrix object where each row represents a document and
each column represents a word in the dictionary.
slam expects a
simple_triplet_matrix from that
Matrix attempts to coerce the matrix to a
simple_triplet_matrix and convert using the
functionality built for the
slam package. This will work for most
applicable classes in the
Matrix package such as
If you are trying to read a
.ldac file see
A documents object in our format
A vocab object if information is available to construct one
1 2 3 4 5 6 7 8 9
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.