yelp: Yelp Dataset

Description Usage Format Note Author(s) Source See Also

Description

It is a collection of restaurant reviews from Yelp. This corpus consists of 24,310 reviews and 9,517 unique words.

Usage

1
data("yelp")

Format

vocab a vector of unique words in the corpus vocabulary.

docs a list of documents in the corpus. Each item (represents a document) is a matrix (2 X U) of word frequencies, where U represents the number of unique words in a document. Each column in the matrix represents a unique word in a document and contains

docs.metadata a matrix of document (article) metadata, where each row represents a document with

cids a vector of document collection ids

class.labels a vector of categories (classes) in the corpus

collection.labels a vector of collections in the corpus

ds.name the corpus name (string)

num.docs the number of documents in the corpus

V the vocabulary size

Note

Created on July 26, 2015

Author(s)

Clint P. George

Source

Articles are downloaded from the link

See Also

Other datasets: news, nips


clintpgeorge/clda documentation built on May 13, 2019, 8 p.m.