dynOAB: Dynamic programming of the optimal One-Armed Bandits

View source: R/dynOAB.R

dynOABR Documentation

Dynamic programming of the optimal One-Armed Bandits

Description

Dynamic programming of the optimal One-Armed Bernoulli Bandits process

Usage

dynOAB(N, al)

Arguments

N

number of trials.

al

the known probability of reward on arm A.

Value

For dynOAB the matrix of maximal predicted rewards. For dynOAB2 the optimal predicted reward.

Author(s)

Shelemyahu Zacks

See Also

simOAB

Examples

dynOAB(10, 0.5)
dynOAB2(10, 0.5)

mistat documentation built on March 7, 2023, 6:43 p.m.