Thompson sampling algorithm is used in order to make decisions. With this function you can choose the mean-reward functions, as well as the basis used to model them.
1 |
nsteps |
The number of steps in simulation. |
sd |
A vector which contains the standard deviation of the reward from each lever. |
posteriors |
A list of posterior distributions. Each posterior distribution is itself a list of model parameters. |
bas_type |
Either "fourier" or "poly". |
mean |
A list, each element should be a function representing the mean for one of the levers available to be interacted with. |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.