Description Usage Format Source References See Also Examples
This is a subsample of the email data set.
1 |
A data frame with 50 observations on the following 21 variables.
spamIndicator for whether the email was spam.
to_multipleIndicator for whether the email was addressed to more than one recipient.
fromWhether the message was listed as from anyone (this is usually set by default for regular outgoing email).
ccIndicator for whether anyone was CCed.
sent_emailIndicator for whether the sender had been sent an email in the last 30 days.
timeTime at which email was sent.
imageThe number of images attached.
attachThe number of attached files.
dollarThe number of times a dollar sign or the word “dollar” appeared in the email.
winnerIndicates whether “winner” appeared in the email.
inheritThe number of times “inherit” (or an extension, such as “inheritance”) appeared in the email.
viagraThe number of times “viagra” appeared in the email.
passwordThe number of times “password” appeared in the email.
num_charThe number of characters in the email, in thousands.
line_breaksThe number of line breaks in the email (does not count text wrapping).
formatIndicates whether the email was written using HTML (e.g. may have included bolding or active links).
re_subjWhether the subject started with “Re:”, “RE:”, “re:”, or “rE:”
exclaim_subjWhether there was an exclamation point in the subject.
urgent_subjWhether the word “urgent” was in the email subject.
exclaim_messThe number of exclamation points in the email message.
numberFactor variable saying whether there was no number, a small number (under 1 million), or a big number.
David Diez's Gmail Account, early months of 2012. All personally identifiable information has been removed.
~~ OpenIntro Statistics, openintro.org ~~
1 2 3 4 5 6 7 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.