Files in markdumke/reinforcelearn
Reinforcement Learning

.Rbuildignore
.gitignore
.travis.yml
DESCRIPTION
LICENSE
NAMESPACE
NEWS.md R/accessor_functions.R R/agent.R R/algorithm.R R/eligibility.R R/environment.R R/environment_gridworld.R R/environment_gym.R R/environment_mdp.R R/environment_mountaincar.R R/experience_replay.R R/interact.R R/policy.R R/reinforcelearn.R R/tiles.R R/valuefunction.R README.Rmd README.md
_pkgdown.yml
benchmark/Images/qlearning_windygrid-1.png
benchmark/Images/qlearning_windygrid_elig-1.png
benchmark/Images/qlearning_windygrid_expreplay-1.png
benchmark/Images/qlearning_windygrid_neuralnetwork-1.png
benchmark/benchmark_windy_gridworld.Rmd benchmark/benchmark_windy_gridworld.md
codecov.yml
cran-comments.md
docs/LICENSE.html
docs/articles/agents.R
docs/articles/agents.html
docs/articles/environments.R
docs/articles/environments.html
docs/articles/gridworld.JPG
docs/articles/index.html
docs/articles/mountaincar.JPG
docs/articles/references.bib
docs/authors.html
docs/index.html
docs/jquery.sticky-kit.min.js
docs/link.svg
docs/news/index.html
docs/pkgdown.css
docs/pkgdown.js
docs/pkgdown.yml
docs/reference/CliffWalking.html
docs/reference/Eligibility.html
docs/reference/Environment.html
docs/reference/EpsilonGreedyPolicy.html
docs/reference/GymEnvironment.html
docs/reference/MdpEnvironment.html
docs/reference/QLearning.html
docs/reference/RandomPolicy.html
docs/reference/SoftmaxPolicy.html
docs/reference/ValueNetwork.html
docs/reference/ValueTable.html
docs/reference/figures/logo.png
docs/reference/getEligibilityTraces.html
docs/reference/getReplayMemory.html
docs/reference/getStateValues.html
docs/reference/getValueFunction.html
docs/reference/gridworld.html
docs/reference/index.html
docs/reference/interact.html
docs/reference/makeAgent.html
docs/reference/makeAlgorithm.html
docs/reference/makeEnvironment.html
docs/reference/makePolicy.html
docs/reference/makeReplayMemory.html
docs/reference/makeValueFunction.html
docs/reference/mountainCar.html
docs/reference/nHot.html
docs/reference/reinforcelearn.html
docs/reference/tilecoding.html
docs/reference/windyGridworld.html
docs/reinforcelearn.png
docs/session_info.txt
examples/user_interface.R man/CliffWalking.Rd man/Eligibility.Rd man/Environment.Rd man/EpsilonGreedyPolicy.Rd man/GymEnvironment.Rd man/MdpEnvironment.Rd man/MountainCar.Rd man/QLearning.Rd man/RandomPolicy.Rd man/SoftmaxPolicy.Rd man/ValueNetwork.Rd man/ValueTable.Rd
man/figures/logo.png
man/getEligibilityTraces.Rd man/getReplayMemory.Rd man/getStateValues.Rd man/getValueFunction.Rd man/gridworld.Rd man/interact.Rd man/makeAgent.Rd man/makeAlgorithm.Rd man/makeEnvironment.Rd man/makePolicy.Rd man/makeReplayMemory.Rd man/makeValueFunction.Rd man/nHot.Rd man/reinforcelearn.Rd man/tilecoding.Rd man/windyGridworld.Rd
session_info.txt
tests/testthat.R tests/testthat/test_accessor_functions.R tests/testthat/test_agent.R tests/testthat/test_environment.R tests/testthat/test_policy.R vignettes/agents.R vignettes/agents.Rmd
vignettes/agents.html
vignettes/environments.R vignettes/environments.Rmd
vignettes/environments.html
vignettes/gridworld.JPG
vignettes/mountaincar.JPG
vignettes/references.bib
markdumke/reinforcelearn documentation built on Nov. 17, 2022, 12:53 a.m.