| BatchGradientQ | Batch Gradient Q-Learning |
| BindSARS | Bind a List of SARS Objects |
| CartPole | CartPole Object |
| CumReward | Cumulative Reward |
| Env | Environment Object |
| EpsilonGreedy | Epsilon-Greedy Action |
| Gaussian | Radial Kernels |
| gibbs | Gibbs Action |
| GradientFQI | Gradient functions for Q-learning |
| greedy | Greedy Action |
| LogSumExp | LogSumExp |
| MSPBE | Objective functions for Q-learning |
| Observe | Observe from Environment |
| Poly | Polynomial Basis |
| ProximalElastic | Proximal Mapping for Elastic Net |
| random | Random Action |
| RBF | Radial Basis Function |
| Reset | Reset the Environment |
| RowWiseKronecker | Row-Wise Kronecker |
| sars | SARS Object |
| SARS2Phis | Convert SARS to Basis Representation |
| Seed | Set Seed for Environment |
| Soft | Soft Thresholding |
| Step | Evolve the Environment |
| Traj | Trajectory Object |
| Traj2SARS | Convert Trajectory to SARS |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.