Traj: Trajectory Object
In XiaoqiLu/PhD-Thesis: Regularized Q-Learning

Description Usage Arguments Details Value Examples

The function Traj() creates a trajectory object for discrete-time RL data.

1	Traj(observations, actions)

`observations`	a list for observations, the number of elements should be one more than that of `actions`.
`actions`	a list for actions, each element for each time step.

The trajectory object is designed to represent typical RL trajectory data:

O_1, A_1, O_2, A_2, …, O_n, A_n, O_{n+1}

This representation is compatible with many RL data generator (for example the data structure of OpenAI Gym library). With user-defined interpreter, observations and actions (or their history) can be encoded/converted to states, rewards and actions under Markov Decision Process (MDP) framework.

a trajectory object (class = "Traj")

observations <- list(0, -1, -1, 0, 1, 1)
actions <- list(-1, 1, 1, 0, -1)
tj <- Traj(observations, actions)
tj

XiaoqiLu/PhD-Thesis documentation built on March 1, 2021, 10:49 a.m.

XiaoqiLu/PhD-Thesis index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

XiaoqiLu/PhD-Thesis
Regularized Q-Learning

Traj: Trajectory Object
In XiaoqiLu/PhD-Thesis: Regularized Q-Learning

Description

Usage

Arguments

Details

Value

Examples

Related to Traj in XiaoqiLu/PhD-Thesis...

R Package Documentation

Browse R Packages

We want your feedback!

XiaoqiLu/PhD-Thesis Regularized Q-Learning

Traj: Trajectory Object In XiaoqiLu/PhD-Thesis: Regularized Q-Learning

Description

Usage

Arguments

Details

Value

Examples

Related to Traj in XiaoqiLu/PhD-Thesis...

R Package Documentation

Browse R Packages

We want your feedback!

XiaoqiLu/PhD-Thesis
Regularized Q-Learning

Traj: Trajectory Object
In XiaoqiLu/PhD-Thesis: Regularized Q-Learning