Associated Press data from the First Text Retrieval Conference (TREC-1) 1992.
The data set is an object of class
provided by package tm. It is a document-term matrix which
contains the term frequency of 10473 terms in 2246 documents.
Accompanying material to the source code for fitting LDA models provided by David M. Blei and co-authors. Downloaded from: http://www.cs.columbia.edu/~blei/
D. Harman (1992) Overview of the first text retrieval conference (TREC-1). In Proceedings of the First Text Retrieval Conference (TREC-1), 1–20.