Description Usage Arguments Details Value
View source: R/range_classifier.R
Use classification models to classify if the response is larger than a series of given values for regression tasks
1 2 3 4 5 | pipe_range_classifier(train, response, exclude_columns = response,
base_temporary_column_name = "base_temporary_column_name",
base_definitive_column_name = paste0(response, "_quantile"),
quantiles = 0, even_spreads = 0, values, model = c("glm",
"xgboost")[1], controls)
|
train |
The train dataset, as a data.frame or data.table. Data.tables may be changed by reference. |
response |
String denoting the name of the column that should be used as the response variable. |
exclude_columns |
Columns that shouldn't be used in the models. Defaults to the response column and will ALWAYS include the response column. |
base_temporary_column_name |
Base name that will be used to create a temporary variable for training the classifier. Use this to ensure no existing columns are overwritten. |
base_definitive_column_name |
Base name that will be used to store the predictions of the created classifiers. Will be appended by the threshold value. Use this to ensure no existing columns are overwritten. |
quantiles |
Number of quantiles to use to generate threshold values. Will actually generate |
even_spreads |
Number of evenly spread thresholds to use. These will be based on the minimum and maximum value of the response in |
values |
Threshold values to use. We will check if these fall in the range of the response in |
model |
Type of model to use. Currently only binomial glm and xgboost are available. |
controls |
Parameters for the models to use. Leave empty or set to NA to use defaults:
|
If multiple values out of quantiles
, even_spreads
, or values
are chosen, all options will be applied.
A list containing the transformed train dataset and a trained pipe.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.