calcRPO: Calculate the rentention payoff (RPO) or opportunity cost for...
In MDP: Markov Decision Processes (MDPs) in R

Description Usage Arguments Value Author(s)

The RPO is defined as the difference between the weight of the state when using action iA and the maximum weight of the node when using another predecessor different from iA.

1
2
3

  calcRPO(mdp, w, iA, sId = 1:mdp$states - 1,
    criterion = "expected", dur = 0, rate = 0.1,
    rateBase = 1, g = 0)

`mdp`	The MDP loaded using loadMDP.
`w`	The label of the weight we calculate RPO for.
`iA`	The action index we calculate the RPO with respect to.
`sId`	Vector of id's of the states we want to retrive.
`criterion`	The criterion used. If `expected` used expected reward, if `discount` used discounted rewards, if `average` use average rewards.
`dur`	The label of the duration/time such that discount rates can be calculated.
`rate`	The interest rate.
`rateBase`	The time-horizon the rate is valid over.
`g`	The optimal gain (g) calculated (used if `criterion = "average"`).