nano_automl | R Documentation |
Creates robust, fast models using H2O
's h2o.automl
function
implemented with nano
objects.
nano_automl(
nano = nano::create_nano(),
response,
data,
test,
train_test = NA,
ignore_vars = c(),
weight_column = NULL,
fold_column = NULL,
nfolds = NA,
max_models = 3,
max_time = 10 * 60,
thresh = 10,
monotone_constraints = NULL,
exclude_algos = c("StackedEnsemble", "DeepLearning"),
include_algos = NULL,
plots = TRUE,
alarm = TRUE,
quiet = FALSE,
save = FALSE,
subdir = NA,
project = "ML Project",
seed = 628,
project_name = paste0("grid_", nano$n_model + 1),
grid_description = "",
...
)
nano |
nano object to store model in. If not specified, a new nano object will be created the results will be stored in the new nano object. |
response |
a character. Target variable for model. |
data |
a data.frame containing data to train model. May also contain testing and
holdout data, in which case, the |
test |
a data.frame containing testing dataset. If this is provided, the |
train_test |
a character. Variable in |
ignore_vars |
vector of characters. Variables in the dataset which should not be used
for modelling. Note, if any of |
weight_column |
a character. Column name in |
fold_column |
a character. Column name in |
nfolds |
a numeric. Number of folds used in cross-validation. If this is provided, the
|
max_models |
a numeric. Maximum number of models to be built. |
max_time |
a numeric. Maximum amount of time spent building models. |
thresh |
a numeric. Cutoff of number of unique values in response variable to determine whether performing classification or regression. Default value is 10. |
monotone_constraints |
a list. Mapping between variable names in |
exclude_algos |
a vector of characters. Algorithms which should be excluding from training process. |
include_algos |
a vector of characters. Algorithms to be included in training process.
Set to |
plots |
a logical. Whether to produce plots. |
alarm |
a logical. Whether to beep when function has finished running. |
quiet |
a logical. Whether to print messages to the console. |
seed |
a numeric. |
grid_description |
a character. Optional description of grid. Can be later accessed by
|
... |
further parameters to pass to |
This function used H2O
's h2o.automl
function to easily and quickly build
several different machine learning models. Importantly, an active H2O connection is
required (i.e. run h2o.init()
)) before using this function.
For more details, please see the documentation for h2o.automl
.
nano object with new entry filled with models produced.
## Not run:
if(interactive()){
library(h2o)
library(nano)
h2o.init()
# import dataset
data(property_prices)
# prepare data for modelling
data_all <- nano::data_prep(data = property_prices,
response = "sale_price",
split_or_fold = 0.7,
holdout_ratio = 0.1)
data <- data_all$data
# create models
nano <- nano_automl(data = data,
response = "sale_price",
train_test = "data_id",
ignore_vars = "data_id")
}
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.