dfcorbin/MABsim: Run (contextual) Multi-Armed Bandit Simulations

This package contains functions that allow the user to investigate the performance of a particular non-parametric approach to modelling expected reward functions in the contextual multi-armed bandit (MAB) setting. We use Thompson sampling in order to explore and choose the actions, and we partition the context space in order to better approximate the true expected-reward functions.

Getting started

Package details

AuthorDouglas Corbin
MaintainerDouglas Corbin <doug.corbin@bristol.ac.uk>
LicenseGPL (>= 2)
Version0.1.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("dfcorbin/MABsim")
dfcorbin/MABsim documentation built on April 26, 2020, 8:26 a.m.