sampleGridSequence: Sample grid sequence
In nproellochs/ReinforcementLearning: Model-Free Reinforcement Learning

Description Usage Arguments Value See Also

View source: R/gridworld.R

Function uses an environment function to generate sample experience in the form of state transition tuples.

1 2	sampleGridSequence(N, actionSelection = "random", control = list(alpha = 0.1, gamma = 0.1, epsilon = 0.1), model = NULL, ...)

`N`	Number of samples.
`actionSelection`	(optional) Defines the action selection mode of the reinforcement learning agent. Default: `random`.
`control`	(optional) Control parameters defining the behavior of the agent. Default: `alpha = 0.1`; `gamma = 0.1`; `epsilon = 0.1`.
`model`	(optional) Existing model of class `rl`. Default: `NULL`.
`...`	Additional parameters passed to function.

An dataframe containing the experienced state transition tuples s,a,r,s_new. The individual columns are as follows:

State: The current state.
Action: The selected action for the current state.
Reward: The reward in the current state.
NextState: The next state.

gridworldEnvironment

ReinforcementLearning

nproellochs/ReinforcementLearning documentation built on March 3, 2020, 12:22 a.m.

nproellochs/ReinforcementLearning index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com