Man pages for nproellochs/ReinforcementLearning
Model-Free Reinforcement Learning

computePolicyComputes the reinforcement learning policy
epsilonGreedyActionSelectionPerforms \varepsilon-greedy action selection
experienceReplayPerforms experience replay
gridworldEnvironmentDefines an environment for a gridworld example
lookupActionSelectionConverts a name into an action selection function
lookupLearningRuleLoads reinforcement learning algorithm
policyComputes the reinforcement learning policy
randomActionSelectionPerforms random action selection
ReinforcementLearningPerforms reinforcement learning
replayExperiencePerforms experience replay
sampleExperienceSample state transitions from an environment function
sampleGridSequenceSample grid sequence
selectEpsilonGreedyActionPerforms \varepsilon-greedy action selection
selectRandomActionPerforms random action selection
stateCreates a state representation for arbitrary objects
tictactoeGame states of 100,000 randomly sampled Tic-Tac-Toe games.
nproellochs/ReinforcementLearning documentation built on March 3, 2020, 12:22 a.m.