Agent | Agent |
Bandit | Bandit: Superclass |
BasicBernoulliBandit | Bandit: BasicBernoulliBandit |
BasicGaussianBandit | Bandit: BasicGaussianBandit |
BootstrapTSPolicy | Policy: Thompson sampling with the online bootstrap |
clipr | Clip vectors |
ContextualBernoulliBandit | Bandit: Naive Contextual Bernouilli Bandit |
ContextualBinaryBandit | Bandit: ContextualBinaryBandit |
ContextualEpochGreedyPolicy | Policy: A Time and Space Efficient Algorithm for Contextual... |
ContextualEpsilonGreedyPolicy | Policy: ContextualEpsilonGreedyPolicy with unique linear... |
ContextualHybridBandit | Bandit: ContextualHybridBandit |
ContextualLinearBandit | Bandit: ContextualLinearBandit |
ContextualLinTSPolicy | Policy: Linear Thompson Sampling with unique linear models |
ContextualLogitBandit | Bandit: ContextualLogitBandit |
ContextualLogitBTSPolicy | Policy: ContextualLogitBTSPolicy |
ContextualPrecachingBandit | Bandit: ContextualPrecachingBandit |
ContextualTSProbitPolicy | Policy: ContextualTSProbitPolicy |
ContextualWheelBandit | Bandit: ContextualWheelBandit |
ContinuumBandit | Bandit: ContinuumBandit |
data_table_factors_to_numeric | Convert all factor columns in data.table to numeric |
dec-set | Decrement |
EpsilonFirstPolicy | Policy: Epsilon First |
EpsilonGreedyPolicy | Policy: Epsilon Greedy |
Exp3Policy | Policy: Exp3 |
FixedPolicy | Policy: Fixed Arm |
formatted_difftime | Format difftime objects |
get_arm_context | Return context vector of an arm |
get_full_context | Get full context matrix over all arms |
get_global_seed | Lookup .Random.seed in global environment |
GittinsBrezziLaiPolicy | Policy: Gittins Approximation algorithm for choosing arms in... |
GradientPolicy | Policy: Gradient |
History | History |
inc-set | Increment |
ind | On-the-fly indicator function for use in formulae |
inv | Inverse from Choleski (or QR) Decomposition. |
invgamma | The Inverse Gamma Distribution |
invlogit | Inverse Logit Function |
is_rstudio | Check if in RStudio |
LifPolicy | Policy: Continuum Bandit Policy with Lock-in Feedback |
LinUCBDisjointOptimizedPolicy | Policy: LinUCB with unique linear models |
LinUCBDisjointPolicy | Policy: LinUCB with unique linear models |
LinUCBGeneralPolicy | Policy: LinUCB with unique linear models |
LinUCBHybridOptimizedPolicy | Policy: LinUCB with hybrid linear models |
LinUCBHybridPolicy | Policy: LinUCB with hybrid linear models |
mvrnorm | Simulate from a Multivariate Normal Distribution |
OfflineBootstrappedReplayBandit | Bandit: Offline Bootstrapped Replay |
OfflineDirectMethodBandit | Bandit: Offline Direct Methods |
OfflineDoublyRobustBandit | Bandit: Offline Doubly Robust |
OfflineLookupReplayEvaluatorBandit | Bandit: Offline Replay with lookup tables |
OfflinePropensityWeightingBandit | Bandit: Offline Propensity Weighted Replay |
OfflineReplayEvaluatorBandit | Bandit: Offline Replay |
one_hot | One Hot Encoding of data.table columns |
ones_in_zeroes | A vector of zeroes and ones |
OraclePolicy | Policy: Oracle |
Plot | Plot |
plot.history | Plot Method for Contextual History |
Policy | Policy: Superclass |
print.history | Print Method for Contextual History |
prob_winner | Binomial Win Probability |
RandomPolicy | Policy: Random |
sample_one_of | Sample one element from vector or list |
set_external | Change Default Graphing Device from RStudio |
set_global_seed | Set .Random.seed to a pre-saved value |
sherman_morrisson | Sherman-Morrisson inverse |
sim_post | Binomial Posterior Simulator |
Simulator | Simulator |
SoftmaxPolicy | Policy: Softmax |
summary.history | Summary Method for Contextual History |
sum_of | Sum of list |
ThompsonSamplingPolicy | Policy: Thompson Sampling |
UCB1Policy | Policy: UCB1 |
UCB2Policy | Policy: UCB2 |
value_remaining | Potential Value Remaining |
var_welford | Welford's variance |
which_max_list | Get maximum value in list |
which_max_tied | Get maximum value randomly breaking ties |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.