spam7 | R Documentation |
The data consist of 4601 email items, of which 1813 items were identified as spam.
This data frame contains the following columns:
crl.tot
: total length of words in capitals
dollar
: number of occurrences of the $ symbol
bang
: number of occurrences of the ! symbol
money
: number of occurrences of the word ‘money’
n000
: number of occurrences of the string ‘000’
make
: number of occurrences of the word ‘make’
yesno
: outcome variable, a factor with levels n
not spam,
y
spam
spam7
George Forman, Hewlett-Packard Laboratories
These data are available from the University of California at Irvine Repository of Machine Learning Databases and Domain Theories. The address is: https://archive.ics.uci.edu/ml/index.php
Also available in the DAAG R package.
John H. Maindonald and W. John Braun (2019). DAAG: Data Analysis and Graphics Data and Functions. R package version 1.22.1. https://CRAN.R-project.org/package=DAAG
require(rpart)
spam.rpart <- rpart(formula = yesno ~ crl.tot + dollar + bang +
money + n000 + make, data=spam7)
plot(spam.rpart)
text(spam.rpart)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.