Description Usage Arguments Value See Also
Function uses an environment function to generate sample experience in the form of state transition tuples.
1 2 | sampleGridSequence(N, actionSelection = "random", control = list(alpha
= 0.1, gamma = 0.1, epsilon = 0.1), model = NULL, ...)
|
N |
Number of samples. |
actionSelection |
(optional) Defines the action selection mode of the reinforcement learning agent. Default: |
control |
(optional) Control parameters defining the behavior of the agent.
Default: |
model |
(optional) Existing model of class |
... |
Additional parameters passed to function. |
An dataframe containing the experienced state transition tuples s,a,r,s_new.
The individual columns are as follows:
StateThe current state.
ActionThe selected action for the current state.
RewardThe reward in the current state.
NextStateThe next state.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.