policyIteAve: Perform policy iteration (average criterion) on the MDP.
In MDP: Markov Decision Processes (MDPs) in R

Description Usage Arguments Value Author(s) See Also

View source: R/loadMDP.R

The policy can afterwards be recieved using functions getPolicy and getPolicyW.

1	policyIteAve(mdp, w, dur, maxIte = 100)

`mdp`	The MDP loaded using loadMDP.
`w`	The label of the weight we optimize.
`dur`	The label of the duration/time such that discount rates can be calculated.
`maxIte`	Max number of iterations. If the model does not satisfy the unichain assumption the algorithm may loop.

The optimal gain (g) calculated.

Lars Relund lars@relund.dk

getPolicy, getPolicyW.

MDP documentation built on May 2, 2019, 6:48 p.m.

MDP index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com