| Agent | Agent |
| Bandit | Bandit: Superclass |
| BasicBernoulliBandit | Bandit: BasicBernoulliBandit |
| BasicGaussianBandit | Bandit: BasicGaussianBandit |
| BootstrapTSPolicy | Policy: Thompson sampling with the online bootstrap |
| clipr | Clip vectors |
| ContextualBernoulliBandit | Bandit: Naive Contextual Bernouilli Bandit |
| ContextualBinaryBandit | Bandit: ContextualBinaryBandit |
| ContextualEpochGreedyPolicy | Policy: A Time and Space Efficient Algorithm for Contextual... |
| ContextualEpsilonGreedyPolicy | Policy: ContextualEpsilonGreedyPolicy with unique linear... |
| ContextualHybridBandit | Bandit: ContextualHybridBandit |
| ContextualLinearBandit | Bandit: ContextualLinearBandit |
| ContextualLinTSPolicy | Policy: Linear Thompson Sampling with unique linear models |
| ContextualLogitBandit | Bandit: ContextualLogitBandit |
| ContextualLogitBTSPolicy | Policy: ContextualLogitBTSPolicy |
| ContextualPrecachingBandit | Bandit: ContextualPrecachingBandit |
| ContextualTSProbitPolicy | Policy: ContextualTSProbitPolicy |
| ContextualWheelBandit | Bandit: ContextualWheelBandit |
| ContinuumBandit | Bandit: ContinuumBandit |
| data_table_factors_to_numeric | Convert all factor columns in data.table to numeric |
| dec-set | Decrement |
| EpsilonFirstPolicy | Policy: Epsilon First |
| EpsilonGreedyPolicy | Policy: Epsilon Greedy |
| Exp3Policy | Policy: Exp3 |
| FixedPolicy | Policy: Fixed Arm |
| formatted_difftime | Format difftime objects |
| get_arm_context | Return context vector of an arm |
| get_full_context | Get full context matrix over all arms |
| GittinsBrezziLaiPolicy | Policy: Gittins Approximation algorithm for choosing arms in... |
| GradientPolicy | Policy: Gradient |
| History | History |
| inc-set | Increment |
| ind | On-the-fly indicator function for use in formulae |
| inv | Inverse from Choleski (or QR) Decomposition. |
| invgamma | The Inverse Gamma Distribution |
| invlogit | Inverse Logit Function |
| is_rstudio | Check if in RStudio |
| LifPolicy | Policy: Continuum Bandit Policy with Lock-in Feedback |
| LinUCBDisjointOptimizedPolicy | Policy: LinUCB with unique linear models |
| LinUCBDisjointPolicy | Policy: LinUCB with unique linear models |
| LinUCBGeneralPolicy | Policy: LinUCB with unique linear models |
| LinUCBHybridOptimizedPolicy | Policy: LinUCB with hybrid linear models |
| LinUCBHybridPolicy | Policy: LinUCB with hybrid linear models |
| mvrnorm | Simulate from a Multivariate Normal Distribution |
| OfflineBootstrappedReplayBandit | Bandit: Offline Bootstrapped Replay |
| OfflineDirectMethodBandit | Bandit: Offline Direct Methods |
| OfflineDoublyRobustBandit | Bandit: Offline Doubly Robust |
| OfflineLookupReplayEvaluatorBandit | Bandit: Offline Replay with lookup tables |
| OfflinePropensityWeightingBandit | Bandit: Offline Propensity Weighted Replay |
| OfflineReplayEvaluatorBandit | Bandit: Offline Replay |
| one_hot | One Hot Encoding of data.table columns |
| ones_in_zeroes | A vector of zeroes and ones |
| OraclePolicy | Policy: Oracle |
| Plot | Plot |
| plot.history | Plot Method for Contextual History |
| Policy | Policy: Superclass |
| print.history | Print Method for Contextual History |
| prob_winner | Binomial Win Probability |
| RandomPolicy | Policy: Random |
| sample_one_of | Sample one element from vector or list |
| set_external | Change Default Graphing Device from RStudio |
| sherman_morrisson | Sherman-Morrisson inverse |
| sim_post | Binomial Posterior Simulator |
| Simulator | Simulator |
| SoftmaxPolicy | Policy: Softmax |
| summary.history | Summary Method for Contextual History |
| sum_of | Sum of list |
| ThompsonSamplingPolicy | Policy: Thompson Sampling |
| UCB1Policy | Policy: UCB1 |
| UCB2Policy | Policy: UCB2 |
| value_remaining | Potential Value Remaining |
| var_welford | Welford's variance |
| which_max_list | Get maximum value in list |
| which_max_tied | Get maximum value randomly breaking ties |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.