s_H2OGBM | R Documentation |
Trains a Gradient Boosting Machine using H2O (http://www.h2o.ai)
s_H2OGBM(
x,
y = NULL,
x.test = NULL,
y.test = NULL,
x.name = NULL,
y.name = NULL,
ip = "localhost",
port = 54321,
h2o.init = TRUE,
gs.h2o.init = FALSE,
h2o.shutdown.at.end = TRUE,
grid.resample.params = setup.resample("kfold", 5),
metric = NULL,
maximize = NULL,
n.trees = 10000,
force.n.trees = NULL,
max.depth = 5,
n.stopping.rounds = 50,
stopping.metric = "AUTO",
p.col.sample = 1,
p.row.sample = 0.9,
minobsinnode = 5,
min.split.improvement = 1e-05,
quantile.alpha = 0.5,
learning.rate = 0.01,
learning.rate.annealing = 1,
weights = NULL,
ifw = TRUE,
ifw.type = 2,
upsample = FALSE,
downsample = FALSE,
resample.seed = NULL,
na.action = na.fail,
grid.n.cores = 1,
n.cores = rtCores,
imetrics = FALSE,
.gs = FALSE,
print.plot = FALSE,
plot.fitted = NULL,
plot.predicted = NULL,
plot.theme = rtTheme,
question = NULL,
verbose = TRUE,
trace = 0,
grid.verbose = verbose,
save.mod = FALSE,
outdir = NULL,
...
)
x |
Numeric vector or matrix / data frame of features i.e. independent variables |
y |
Numeric vector of outcome, i.e. dependent variable |
x.test |
Numeric vector or matrix / data frame of testing set features
Columns must correspond to columns in |
y.test |
Numeric vector of testing set outcome |
x.name |
Character: Name for feature set |
y.name |
Character: Name for outcome |
ip |
Character: IP address of H2O server. Default = "localhost" |
port |
Integer: Port number for server. Default = 54321 |
h2o.shutdown.at.end |
Logical: If TRUE, run |
n.trees |
Integer: Number of trees to grow. Maximum number of trees if |
max.depth |
[gS] Integer: Depth of trees to grow |
n.stopping.rounds |
Integer: If > 0, stop training if |
stopping.metric |
Character: "AUTO" (Default), "deviance", "logloss", "MSE", "RMSE", "MAE", "RMSLE", "AUC", "lift_top_group", "misclassification", "mean_per_class_error" |
p.col.sample |
[gS] |
p.row.sample |
[gS] |
minobsinnode |
[gS] |
learning.rate |
[gS] |
learning.rate.annealing |
[gS] |
weights |
Numeric vector: Weights for cases. For classification, |
ifw |
Logical: If TRUE, apply inverse frequency weighting
(for Classification only).
Note: If |
ifw.type |
Integer 0, 1, 2 1: class.weights as in 0, divided by min(class.weights) 2: class.weights as in 0, divided by max(class.weights) |
upsample |
Logical: If TRUE, upsample cases to balance outcome classes (for Classification only) Note: upsample will randomly sample with replacement if the length of the majority class is more than double the length of the class you are upsampling, thereby introducing randomness |
downsample |
Logical: If TRUE, downsample majority class to match size of minority class |
resample.seed |
Integer: If provided, will be used to set the seed during upsampling. Default = NULL (random seed) |
na.action |
How to handle missing values. See |
n.cores |
Integer: Number of cores to use |
.gs |
Internal use only |
print.plot |
Logical: if TRUE, produce plot using |
plot.fitted |
Logical: if TRUE, plot True (y) vs Fitted |
plot.predicted |
Logical: if TRUE, plot True (y.test) vs Predicted.
Requires |
plot.theme |
Character: "zero", "dark", "box", "darkbox" |
question |
Character: the question you are attempting to answer with this model, in plain language. |
verbose |
Logical: If TRUE, print summary to screen. |
trace |
Integer: If higher than 0, will print more information to the console. |
save.mod |
Logical: If TRUE, save all output to an RDS file in |
outdir |
Path to output directory.
If defined, will save Predicted vs. True plot, if available,
as well as full model output, if |
... |
Additional arguments |
[gS] denotes tunable hyperparameters
Warning: If you get an HTTP 500 error at random, use h2o.shutdown()
to shutdown the server.
It will be restarted when s_H2OGBM
is called
rtMod
object
E.D. Gennatas
train_cv for external cross-validation
Other Supervised Learning:
s_AdaBoost()
,
s_AddTree()
,
s_BART()
,
s_BRUTO()
,
s_BayesGLM()
,
s_C50()
,
s_CART()
,
s_CTree()
,
s_EVTree()
,
s_GAM()
,
s_GBM()
,
s_GLM()
,
s_GLMNET()
,
s_GLMTree()
,
s_GLS()
,
s_H2ODL()
,
s_H2ORF()
,
s_HAL()
,
s_Isotonic()
,
s_KNN()
,
s_LDA()
,
s_LM()
,
s_LMTree()
,
s_LightCART()
,
s_LightGBM()
,
s_MARS()
,
s_MLRF()
,
s_NBayes()
,
s_NLA()
,
s_NLS()
,
s_NW()
,
s_PPR()
,
s_PolyMARS()
,
s_QDA()
,
s_QRNN()
,
s_RF()
,
s_RFSRC()
,
s_Ranger()
,
s_SDA()
,
s_SGD()
,
s_SPLS()
,
s_SVM()
,
s_TFN()
,
s_XGBoost()
,
s_XRF()
Other Tree-based methods:
s_AdaBoost()
,
s_AddTree()
,
s_BART()
,
s_C50()
,
s_CART()
,
s_CTree()
,
s_EVTree()
,
s_GBM()
,
s_GLMTree()
,
s_H2ORF()
,
s_LMTree()
,
s_LightCART()
,
s_LightGBM()
,
s_MLRF()
,
s_RF()
,
s_RFSRC()
,
s_Ranger()
,
s_XGBoost()
,
s_XRF()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.