rl_arms_get_outcome: Get Arm's Outcome based on its Probability and Reward...
In jdtrat/rlsims: Simulate Reinforcement Learning Agents in R

rl_arms_get_outcome

R Documentation

Get Arm's Outcome based on its Probability and Reward Structure

Description

This function defines the reinforcement delivery for an individual arm, and is used internaly by RL Bandit Agents. With probability prob, it an arm will yield a reinforcement of magnitude; with probability 1 - prob, an arm will yield a reinforcement of alternative (default of zero).

Usage

rl_arms_get_outcome(arm_definitions, action, trial)

Arguments

`arm_definitions`	A list of arm definitions where each element contains a data frame with columns 'probability', 'magnitude', 'alternative', and 'trial' describing, respectively, the `probability` of receiving a reward `magnitude` with the `alternative` for each `trial`.
`action`	A numeric scalar representing which action was selected on a given trial.
`trial`	The trial in which an action was selected.