Man pages for XiaoqiLu/PhD-Thesis
Regularized Q-Learning

BatchGradientQBatch Gradient Q-Learning
BindSARSBind a List of SARS Objects
CartPoleCartPole Object
CumRewardCumulative Reward
EnvEnvironment Object
EpsilonGreedyEpsilon-Greedy Action
GaussianRadial Kernels
gibbsGibbs Action
GradientFQIGradient functions for Q-learning
greedyGreedy Action
LogSumExpLogSumExp
MSPBEObjective functions for Q-learning
ObserveObserve from Environment
PolyPolynomial Basis
ProximalElasticProximal Mapping for Elastic Net
randomRandom Action
RBFRadial Basis Function
ResetReset the Environment
RowWiseKroneckerRow-Wise Kronecker
sarsSARS Object
SARS2PhisConvert SARS to Basis Representation
SeedSet Seed for Environment
SoftSoft Thresholding
StepEvolve the Environment
TrajTrajectory Object
Traj2SARSConvert Trajectory to SARS
XiaoqiLu/PhD-Thesis documentation built on March 1, 2021, 10:49 a.m.