This is a package containing functions that allow the user to simulate finite trajectories of the (contextual) multi-armed bandit problem. The key focus of this package is the performance of the nonparametric approach used to model the reward function.
Package details |
|
---|---|
Author | Douglas Corbin |
Maintainer | Douglas Corbin <doug.corbin@bristol.ac.uk> |
License | GPL (>= 2) |
Version | 1.0 |
Package repository | View on GitHub |
Installation |
Install the latest version of this package by entering the following in R:
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.