createAmazonSentimentStars: Create Amazon Sentiment star ratings dataset

Description Usage Arguments Details Value See Also

Description

The goal in this dataset is to predict the number of stars given in a product review taken from Amazon.com in four different categories: books, dvds, electronics and kitchen.

Usage

1
2
createAmazonSentimentStars(file = getfilepath("amazonsentimentstars.rds"),
  write = TRUE, read = TRUE)

Arguments

file

character; path/filename to write data file to

write

logical; should the dataset be written to disk for later use? (default: TRUE)

read

logical; should we try to read the dataset from the specified location first? (default: TRUE)

Details

For 21669 reviews, 1009925 unigram and bigram features are given in a sparse matrix X, along with a vector y containing the star rating and a vector indicating the domain. Note the data is NOT a data.table, but sparse matrix generated by sparseMatrix.

Task: Classification: Use X to predict y, possibly in a domain adaptation setting.

Value

List containing:

See Also

createAmazonSentiment, http://www.cs.jhu.edu/~mdredze/datasets/sentiment/


jkrijthe/createdatasets documentation built on May 19, 2019, 12:44 p.m.