Description Usage Arguments Value See Also
Batch Gradient Q-Learning
1 2 3 4 5 6 7 8 9 10 11 12 13 | BatchGradientQ(
phis,
discount,
method = "FQI",
loss = NULL,
lambda = 0,
alpha = 1,
theta = NULL,
learning_rate = 1,
max_iter = 1000,
tol = 0.001,
accelerate = TRUE
)
|
phis |
a list of processed outcome from |
discount |
a numeric number between 0 and 1. |
method |
Q-learning method, choice of "FQI", "GGQ", and "BEM" |
loss |
loss function for evaluation, choice of "MSPBE" and "MSBE" |
lambda |
regularization coefficient |
alpha |
elastic net mixing parameter between 0 (ridge) and 1 (lasso) |
theta |
a numeric vector as model parameter. |
learning_rate |
learning rate for gradient descent |
max_iter |
maximum number of iteration |
tol |
tolerance level for convergence |
accelerate |
if |
a list of model fitting results
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.