breastcancer: Wisconsin Breast Cancer Database

Description Usage Format Details Examples

Description

Formatted subset of mlbench::BreastCancer. See mlbench for original data more context.

Usage

1

Format

Data frame with 675 observations on 10 variables: a factor Id, 9 numeric variables, and target class.

Details

The objective is to identify each of a number of benign or malignant classes. Samples arrive periodically as Dr. Wolberg reports his clinical cases. The database therefore reflects this chronological grouping of the data. This grouping information appears immediately below, having been removed from the data itself. Each variable except for the first was converted into 11 primitive numerical attributes with values ranging from 0 through 10. There are 16 missing attribute values.

Data frame with 675 observations on 10 variables: a factor Id, 9 numeric variables, and target class:

Reproducing this dataset:

1
2
3
4
5
6
7
8
9
library("mlbench")
data(BreastCancer)

d <- BreastCancer
d <- d[!duplicated(d), ]
d <- d[complete.cases(d), ]
mat <- as.matrix(d[ , 2:9])
mat <- apply(mat, 2, as.numeric)
breastcancer <- data.frame(Id = d$Id, mat, Class = d$Class)

Examples

1
2
3
4
5
str(breastcancer)
## Not run: 
play_manual_tour(data = breastcancer[, 2:9], manip_var = 3, rescale_data = TRUE)

## End(Not run)

nspyrison/spinifex documentation built on Aug. 23, 2019, 1:21 p.m.