Description Usage Arguments Value Functions
Objective functions for Q-learning
1 2 3 |
theta |
a numeric vector as model parameter. |
phis |
a list of processed outcome from |
discount |
a numeric number between 0 and 1. |
a (non-negative) number
MSPBE
: Mean Squared Projected Bellman Error
MSBE
: Mean Squared Bellman Error
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.