mlr_tasks_spam: Spam Classification Task
In mllg/mlr3: Machine Learning in R - Next Generation

Description Format Construction Source References See Also

Spam data set from the UCI machine learning repository (http://archive.ics.uci.edu/ml/datasets/spambase). Data set collected at Hewlett-Packard Labs to classify emails as spam or non-spam. 57 variables indicate the frequency of certain words and characters in the e-mail. The positive class is set to "spam".

R6::R6Class inheriting from TaskClassif.

1 2	mlr_tasks$get("spam") tsk("spam")

Creators: Mark Hopkins, Erik Reeber, George Forman, Jaap Suermondt. Hewlett-Packard Labs, 1501 Page Mill Rd., Palo Alto, CA 94304

Donor: George Forman (gforman at nospam hpl.hp.com) 650-857-7835

Preprocessing: Columns have been renamed. Preprocessed data taken from the kernlab package.

Dua, D. and Graff, C. (2019). UCI Machine Learning Repository http://archive.ics.uci.edu/ml. Irvine, CA: University of California, School of Information and Computer Science.

Dictionary of Tasks: mlr_tasks

mllg/mlr3 documentation built on Sept. 27, 2019, 9:38 a.m.