BatchGradientQ | Batch Gradient Q-Learning |
BindSARS | Bind a List of SARS Objects |
CartPole | CartPole Object |
CumReward | Cumulative Reward |
Env | Environment Object |
EpsilonGreedy | Epsilon-Greedy Action |
Gaussian | Radial Kernels |
gibbs | Gibbs Action |
GradientFQI | Gradient functions for Q-learning |
greedy | Greedy Action |
LogSumExp | LogSumExp |
MSPBE | Objective functions for Q-learning |
Observe | Observe from Environment |
Poly | Polynomial Basis |
ProximalElastic | Proximal Mapping for Elastic Net |
random | Random Action |
RBF | Radial Basis Function |
Reset | Reset the Environment |
RowWiseKronecker | Row-Wise Kronecker |
sars | SARS Object |
SARS2Phis | Convert SARS to Basis Representation |
Seed | Set Seed for Environment |
Soft | Soft Thresholding |
Step | Evolve the Environment |
Traj | Trajectory Object |
Traj2SARS | Convert Trajectory to SARS |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.