data_corpus_LMRD | R Documentation |
A corpus object containing a dataset for sentiment classification containing 25,000 highly polar movie reviews for training, and 25,000 for testing, from Maas et. al. (2011).
data_corpus_LMRD
The corpus docvars consist of:
serial (within set and polarity) document number
user-assigned movie rating on a 1-10 point integer scale
used for test v. training set
either neg
or pos
to indicate whether the
movie review was negative or positive. See Maas et al (2011) for the
cut-off values that governed this assignment.
http://ai.stanford.edu/~amaas/data/sentiment/
Andrew L. Maas, Raymond E. Daly, Peter T. Pham, Dan Huang, Andrew Y. Ng, and Christopher Potts. (2011). "Learning Word Vectors for Sentiment Analysis". The 49th Annual Meeting of the Association for Computational Linguistics (ACL 2011).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.