dynOAB | R Documentation |
Dynamic programming of the optimal One-Armed Bernoulli Bandits process
dynOAB(N, al)
N |
number of trials. |
al |
the known probability of reward on arm A. |
For dynOAB the matrix of maximal predicted rewards. For dynOAB2 the optimal predicted reward.
Shelemyahu Zacks
simOAB
dynOAB(10, 0.5) dynOAB2(10, 0.5)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.