Description Usage Format Details Source Examples
The Spambase data set was created by Mark Hopkins, Erik Reeber, George Forman, and Jaap Suermondt at Hewlett-Packard Labs. It includes 4601 observations corresponding to email messages, 1813 of which are spam. From the original email messages, 58 different attributes were computed.
1 |
A data frame with 4601 observations on the following 58 variables.
word_freq_makea numeric vector
word_freq_addressa numeric vector
word_freq_alla numeric vector
word_freq_3da numeric vector
word_freq_oura numeric vector
word_freq_overa numeric vector
word_freq_removea numeric vector
word_freq_interneta numeric vector
word_freq_ordera numeric vector
word_freq_maila numeric vector
word_freq_receivea numeric vector
word_freq_willa numeric vector
word_freq_peoplea numeric vector
word_freq_reporta numeric vector
word_freq_addressesa numeric vector
word_freq_freea numeric vector
word_freq_businessa numeric vector
word_freq_emaila numeric vector
word_freq_youa numeric vector
word_freq_credita numeric vector
word_freq_youra numeric vector
word_freq_fonta numeric vector
word_freq_000a numeric vector
word_freq_moneya numeric vector
word_freq_hpa numeric vector
word_freq_hpla numeric vector
word_freq_georgea numeric vector
word_freq_650a numeric vector
word_freq_laba numeric vector
word_freq_labsa numeric vector
word_freq_telneta numeric vector
word_freq_857a numeric vector
word_freq_dataa numeric vector
word_freq_415a numeric vector
word_freq_85a numeric vector
word_freq_technologya numeric vector
word_freq_1999a numeric vector
word_freq_partsa numeric vector
word_freq_pma numeric vector
word_freq_directa numeric vector
word_freq_csa numeric vector
word_freq_meetinga numeric vector
word_freq_originala numeric vector
word_freq_projecta numeric vector
word_freq_rea numeric vector
word_freq_edua numeric vector
word_freq_tablea numeric vector
word_freq_conferencea numeric vector
char_freq_semicolona numeric vector
char_freq_left_parena numeric vector
char_freq_left_bracketa numeric vector
char_freq_exclamationa numeric vector
char_freq_dollara numeric vector
char_freq_pounda numeric vector
capital_run_length_averagea numeric vector
capital_run_length_longesta numeric vector
capital_run_length_totala numeric vector
is_spama factor with levels 0 1
This data is used as an example in the book "R in a Nutshell," from O'Reilly Media.
This data set is from the UCI Machine Learning Repository. You can find more information about this data set, including the ciation policy, from http://archive.ics.uci.edu/ml/datasets/Spambase
1 2 3 4 5 |
Loading required package: nutshell.bbdb
Loading required package: nutshell.audioscrobbler
0 1
2788 1813
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.