Description Usage Format Note Author(s) Source See Also
It is a collection of restaurant reviews from Yelp. This corpus consists of 24,310 reviews and 9,517 unique words.
1 | data("yelp")
|
vocab
a vector of unique words in the corpus vocabulary.
docs
a list of documents in the corpus. Each item (represents a
document) is a matrix (2 X U) of word frequencies, where U represents the
number of unique words in a document. Each column in the matrix represents
a unique word in a document and contains
vocabulary-id. the index of the word in the vocabulary (starts with 0)
frequency. the relative frequency of the word in the document
docs.metadata
a matrix of document (article) metadata, where each
row represents a document with
doc.id. a unique article id
review.id.
reviewer.id.
rating. customer rating
restaurant. restaurant name
row.word.count. the number of words in the article
category. the category of the review
cids
a vector of document collection ids
class.labels
a vector of categories (classes) in the corpus
collection.labels
a vector of collections in the corpus
ds.name
the corpus name (string)
num.docs
the number of documents in the corpus
V
the vocabulary size
Created on July 26, 2015
Clint P. George
Articles are downloaded from the link
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.