BatchGradientQ: Batch Gradient Q-Learning

Description Usage Arguments Value See Also

View source: R/learn.R

Description

Batch Gradient Q-Learning

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
BatchGradientQ(
  phis,
  discount,
  method = "FQI",
  loss = NULL,
  lambda = 0,
  alpha = 1,
  theta = NULL,
  learning_rate = 1,
  max_iter = 1000,
  tol = 0.001,
  accelerate = TRUE
)

Arguments

phis

a list of processed outcome from SARS2Phis().

discount

a numeric number between 0 and 1.

method

Q-learning method, choice of "FQI", "GGQ", and "BEM"

loss

loss function for evaluation, choice of "MSPBE" and "MSBE"

lambda

regularization coefficient

alpha

elastic net mixing parameter between 0 (ridge) and 1 (lasso)

theta

a numeric vector as model parameter.

learning_rate

learning rate for gradient descent

max_iter

maximum number of iteration

tol

tolerance level for convergence

accelerate

if TRUE, use accelerated proximal gradient method; otherwise use proximal gradient method.

Value

a list of model fitting results

See Also

SARS2Phis, MSPBE, GradientFQI


XiaoqiLu/PhD-Thesis documentation built on March 1, 2021, 10:49 a.m.