Description Usage Arguments Value
Compute backward recursion to obtain the optimal policy, which has the form at time t -log(psi_t) = x_t' A_t x_t + x_t'b_t + c_t.
1 | LQG_policy_recursion(RNderiv, transitions)
|
RNderiv |
list with keys:
|
transitions |
list with keys:
|
list with keys:
As
bs
cs
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.