Man pages for MartinKies/USLR
Reinforcement Learning with R

Act.A3CDetermines which action the algorithm takes
Action.2.Choice.PDAction to Array for Prisoners Dilemma
Action.Encoding.Info.PDGet Info of Action Encoding
Action.Encoding.Info.Simple.GameGet Info of Action Encoding
Act.QLearningPersExpPathDetermines which action to take
Advantage.functionCalculates N-Step Returns or weighted Temporal Difference...
Alphabet3A student strategy
a.MindA student strategy
a.MindHighDA student strategy
a.net.nice0A student strategy
answer.strat1A student strategy
answer.strat2A student strategy
answer.strat3A student strategy
answer.to.undertaker.2A student strategy
a.ntft.1A student strategy
AntimindA student strategy
Antimind2A student strategy
antiprof2A student strategy
antistrat2A student strategy
antistrat2.High.errA student strategy
Antitiktak1A student strategy
Antitiktak1.improvedAn improved Version of AntiTikTak1
Antitiktak1.simplifiedA simplified Version of AntiTikTak1
Antitiktak2A student strategy
a.tadaaa.1A student strategy
a.td.2A student strategy
a.TikTak1A student strategy
a.tiktak.2A student strategy
Calc.R.phideprecated
Choice.2.Action.PDArray to Action for Prisoners Dilemma
Choice.2.Action.Simple.GameArray to Action for Simple Game
compare.explorationStrategy highlighting differences in Exploration states are...
Convert.2.trainConverts stored Memory into arrays.
counter.da.taaaA student strategy
counter.MindA student strategy
counter.MindHighDA student strategy
counter.nice.tit.for.tatA student strategy
counter.nice.tit.for.tat.2A student strategy
counter.rainbow.unicorns.twoA student strategy
counter.strat2A student strategy
counter.ta.daaaA student strategy
counter.tft.forgive.fastA student strategy
counter.tft.forgive.slowA student strategy
counter.the.undertaker.1A student strategy
counter.the.undertaker.2A student strategy
da.taaaA student strategy
da.taaa.counterA student strategy
Define_GraphGraph for Network Loss according to A3C.
Define_Graph_Gradient_UpdateGraph to update Network weights
destab.strat2.0A student strategy
destab.ta.daaaA student strategy
Discounted.Reward.PDUpdate Score based on expected Value of reward
dont.forgiveA student strategy
dont.mindA student strategy
dont.mind.High.errA student strategy
eat.mindfreaks.2A student strategy
eat.unicorns.48A student strategy
eat.unicorns.50A student strategy
eat.unicorns.65A student strategy
el.majestro.incredibileA student strategy
Encode.Game.States.PDTransforms List of Gamestates to std encoding form
Encoding.Harper.PDEncoding based on Reinforcement Learning Produces Dominant...
Encoding.last.X.rounds.PDFlexible Encoding Function which expects the following...
Encoding.Manager.PDInternal Function to make working with different encodings...
Extend.Memory.QLearningPersExpPathExtend Memory by specified experiences
External.Eval.PDEvaluate the current strategy using StratTourn
false.friendA student strategy
fix.price.locExample srategy for the Hotelling game
forgive.slowerA student strategy
Generate.Start.State.PDGenerates Start State for Prisoners Dilemma Game
Generate.Start.State.Simple.GameGenerates Start State for Simple Game
get.against.itself.benchmarkPayoff of strategy against itself
get.antistratGet vector of names of counter strategy
get.benchmarkPayoff of best answer against the strategy
get.conversionConversion factor
Get.Def.Par.A3CGet Default Parameters of A3C.
Get.Def.Par.Neural.NetworkDefine default Parameters of the Neural Network Function
Get.Def.Par.Neural.Network.A3CGet Default Parameters of the Feed-Forward Neural Network for...
Get.Def.Par.Neural.Network.A3C.LSTMGet Default Parameters of the LSTM Neural Network for the A3C...
Get.Def.Par.QLearningPersExpPathDefault Parameters for (improved) Q-Learning
Get.Def.Par.QLearningPersExpPath.Legacy.ThesisOpt.XGBDefault Parameters QLearningPersExpPath of the thesis of...
Get.Def.Par.QLearningPersExpPath.Legacy.v.0.1.6Default Parameters QLearningPersExpPath v.0.1.6
Get.Def.Par.RNNDefine default Parameters of the RNN Function
Get.Def.Par.XGBoostDefault Parameters for XGBoost
Get.Game.Object.PDGet Game Object which fully defines Prisoners Dilemma.
Get.Game.Object.Simple.GameGet Game Object which fully defines simple game.
Get.Game.Param.PDDefines the game environment of the IPD
Get.Par.PDDefines model parameters for 'Prisoners Dilemma'
Get.Par.Simple.GameDefines model parameters for 'Simple Game'
getrichA student strategy
Globaler.Tit.4.TatA student strategy
harryA student strategy
Initialise.A3CSet changeable A3C Parameters.
Initialise.QLearningPersExpPathSet changeable model variables
into.spaaaaceA grad student strategy
mean.tit.for.tatA student strategy
meineStrat2A student strategy
Memory.Random.Play.PDGenerate Memory where strategies play against a random...
Memory.Self.Play.PDGenerate Memory where strategies play against themselves
MindA student strategy
MindHighDA student strategy
Model.strat.maximum.full.TenA strategy to be used after model has been trained
nashtag1A student strategy
net.nice0A prof strategy
net.nice1A variant to net.nice0
net.nice.minus1A variant to net.nice0
net.nice.start1A variant to net.nice0
nice.tit.for.tatA student strategy
nice.tit.for.tat.High.errA student strategy
NN.strat.full.zeroA strategy to be used after model has been trained
NN.strat.mainThe actual strategy after model has been trained
NN.strat.Slim.TenTenA strategy to be used after model has been trained
NN.strat.Slim.TenTen.QLearningA strategy to be used after model has been trained
NN.strat.static.end.TenA strategy to be used after model has been trained
nottitfortatA student strategy
overtakerA student strategy
overtaker.High.errA student strategy
phasesA student strategy
Play.Multiple.Games.QLearningPersExpPathTrain multiple games
Play.On.Strategy.QLearningPersExpPathPlay the game based on strategy
Predict.Neural.NetworkEvaluate Neural Network
Predict.Neural.Network.A3CPredict Neural Network
Predict.RNNEvaluate Recurrent RNN
prep.data.4.shinyPrepare Worker Memory to visualize with shiny
probably.nice.BaseA student strategy
probably.nice.High.errA student strategy
probably.not.so.niceA student strategy
probably.not.that.niceA student strategy
prof.stratA student strategy
pudb.strat2A student strategy
Q.on.hist.PD.QLearningQ-values based on history of IPD
Rainbow.Unicorns.oneA student strategy
Rainbow.Unicorns.one.killerA student strategy
Rainbow.Unicorns.twoA student strategy
Rainbow.Unicorns.two.killerA student strategy
redim.stateChange dimensionality of the state array.
regenbogenA student strategy
Replay.QLearningPersExpPathTrain model of Q learning
ronA student strategy
schachmatt_tournamentA student strategy
screams.in.spaceA grad student strategy
seda.strat2A student strategy
Setup.Neural.NetworkSetup a Neural Network
Setup.Neural.Network.A3CSetup a Feed-Forward Neural Network for the...
Setup.Neural.Network.A3C.LSTMSetup a Neural Network with an LSTM-Layer for the...
Setup.QLearningPersExpPathSets up a model based on model parameters
Setup.RNNSetup a RNN
smooth.averageCalculates a sensible moving average based on smoothing...
smooth.triangleCalculates a sensible moving average based on smoothing...
squishy.the.octopusA student strategy
State.2.Array.PDState to Array for Prisoners Dilemma
State.2.Array.Simple.GameState to Array for Simple Game
State.Transition.PDGet next State of Prisoners Dilemma Game
State.Transition.Simple.GameGet next State of Simple Game
strat1A student strategy
strat2A student strategy
strat2.killerA student strategy
strat3A student strategy
strat4A student strategy
strategoA student strategy
strat.fot.tik.takA student strategy
ta.daaaA student strategy
TakTik1A student strategy
TakTik2A student strategy
tatadaA student strategy
tft.forgive.fastA student strategy
tft.forgive.fast.2A student strategy
tft.forgive.fast.killerA student strategy
tft.forgive.slowA student strategy
tft.forgive.slow.killerA student strategy
the.overtaker.1A student strategy
the.undertaker.1A student strategy
the.undertaker.2A student strategy
TikTak1A student strategy
TikTak2A student strategy
TokTokA student strategy
Train.A3cUse the A3C algorithm to train a model
Train.Neural.NetworkTrain Neural Network
Train.On.Memory.QLearningPersExpPathTrains model based on memory
Train.QLearningPersExpPathTrain a model based on Q-Learning
Train.RNNTrain RNN
Update.Evaluator.QLearningPersExpPathControlled Copying of Models
Update.Memory.QLearningPersExpPathAdd historic Q-Values and Curiosity to memory
viva.PD.StrategyA student strategy
Weighted.DiscountCalculates a weighted Mean
Worker.A3CDefines an Agent based on the A3C-Algorithm
MartinKies/USLR documentation built on Nov. 10, 2019, 5:24 a.m.