View source: R/kerasOptimizer.R
optimizer_sgd | R Documentation |
Stochastic gradient descent optimizer with support for momentum, learning rate decay, and Nesterov momentum.
optimizer_sgd( learning_rate = 0.01, momentum = 0, decay = 0, nesterov = FALSE, clipnorm = NULL, clipvalue = NULL, ... )
learning_rate |
float >= 0. Learning rate. |
momentum |
float >= 0. Parameter that accelerates SGD in the relevant direction and dampens oscillations. |
decay |
float >= 0. Learning rate decay over each update. |
nesterov |
boolean. Whether to apply Nesterov momentum. |
clipnorm |
Gradients will be clipped when their L2 norm exceeds this value. |
clipvalue |
Gradients will be clipped when their absolute value exceeds this value. |
... |
Unused, present only for backwards compatability |
Based on:
[keras/R/optimizers.R](https://github.com/rstudio/keras/blob/main/R/optimizers.R).
The following code is commented:
backcompat_fix_rename_lr_to_learning_rate(...)
Optimizer for use with compile.keras.engine.training.Model
.
To enable compatibility with the ranges of the learning rates
of the other optimizers, the learning rate learning_rate
is internally mapped to 10 * learning_rate
. That is,
a learning rat of 0.001 will be mapped to 0.01 (which is the default.)
Other optimizers:
optimizer_adadelta()
,
optimizer_adagrad()
,
optimizer_adamax()
,
optimizer_adam()
,
optimizer_nadam()
,
optimizer_rmsprop()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.