Description Usage Arguments Value
Choose an Action
1 |
state |
The state the agent is currently in. |
epsilon |
The epsilon to be used in epsilon-greedy policy choices. |
tau |
The tau (temperature) to be used in softmax policy choices. |
softmax |
Logical: TRUE if softmax policy decisions should be used; FALSE if epsilon-greedy policy decisions should be used. By default, softmax is used. |
A string depicting an action (left or right) to take in the current state.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.