binarize | R Documentation |
Create a set of variables (columns) with binary values for each column in the input data. For a variable with values of 0 and 1, the column is retained and no new column is created. For a numeric variable, the function binarize.numeric
is called. For a factor column, the function binarize.factor
is called.
binarize( x, y, target = stop("'target' (0 or 1) must be provided"), control = bscontrol() )
x |
a data frame or matrix to be binarized. |
y |
a vector with two unique values (0 and 1). It is the response variable that guides the optimal discretization of variables in |
target |
the level of y (0 or 1) which indicates the boolean rule target |
control |
a list or a |
a data frame containing binary variables, or a character string describing the rule that perfectly split the target.
# Load and prepare data x <- auto[, c('mpg', 'cylinders', 'displacement')] x$cylinders <- as.factor(x$cylinders) y <- ifelse(auto$origin == 'USA', 1L, 0L) # binarize x by y = 1 bx1 <- binarize(x, y, target = 1) head(bx1) # binarize x by y = 0 bx0 <- binarize(x, y, target = 0) head(bx0) # when selecting only one column from a data frame, use drop = FALSE to maintain structure binarize(auto[,'mpg', drop = FALSE], y, target = 1)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.