Description Usage Format Source
This dataset includes a set of email subject lines used for classification of whether the message is spam (unsolicited commercial content) or not. Many subject lines include subject matter innapropriate for classroom use. Given the volume of headlines containing such language (especially for type == "spam"), user discretion is advised.
1 |
A data frame with 6,908 rows and 3 variables:
character Email subject line
character Email classification into three levels: spam, hard_ham, and easy_ham
integer Row number
http://www.rdatasciencecases.org/Spam
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.