choice: Choose an Action

Description Usage Arguments Value

View source: R/choice.R

Description

Choose an Action

Usage

1
choice(state, epsilon = 0.1, tau = 0.08, softmax = TRUE)

Arguments

state

The state the agent is currently in.

epsilon

The epsilon to be used in epsilon-greedy policy choices.

tau

The tau (temperature) to be used in softmax policy choices.

softmax

Logical: TRUE if softmax policy decisions should be used; FALSE if epsilon-greedy policy decisions should be used. By default, softmax is used.

Value

A string depicting an action (left or right) to take in the current state.


jdtrat/dynaq documentation built on July 24, 2020, 7:18 a.m.