R/data.R

#' Associated Press data
#'
#' Associated Press data from the First Text Retrieval Conference (TREC-1) 1992, 
#' which has being processed by stop-words removal, 
#' low-frequency words removal and short documents removal.
#' @usage data("AP")
#' @keywords datasets
#' @format The data set is an object of class "\code{simple_triplet_matrix}" provided by package \strong{slam}. 
#' It is a word-document matrix which contains the term frequency of 7000 words in 2134 documents.
#' @source Harman, D. (1992, November). Overview of the First Text REtrieval Conference (TREC-1). In TREC (Vol. 1992, pp. 1-20).
"AP"

Try the TopicScore package in your browser

Any scripts or data that you put into this service are public.

TopicScore documentation built on June 6, 2019, 5:06 p.m.