Description Usage Arguments Value See Also
Batch Gradient Q-Learning
| 1 2 3 4 5 6 7 8 9 10 11 12 13 | BatchGradientQ(
  phis,
  discount,
  method = "FQI",
  loss = NULL,
  lambda = 0,
  alpha = 1,
  theta = NULL,
  learning_rate = 1,
  max_iter = 1000,
  tol = 0.001,
  accelerate = TRUE
)
 | 
| phis | a list of processed outcome from  | 
| discount | a numeric number between 0 and 1. | 
| method | Q-learning method, choice of "FQI", "GGQ", and "BEM" | 
| loss | loss function for evaluation, choice of "MSPBE" and "MSBE" | 
| lambda | regularization coefficient | 
| alpha | elastic net mixing parameter between 0 (ridge) and 1 (lasso) | 
| theta | a numeric vector as model parameter. | 
| learning_rate | learning rate for gradient descent | 
| max_iter | maximum number of iteration | 
| tol | tolerance level for convergence | 
| accelerate | if  | 
a list of model fitting results
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.