Description Usage Arguments Details Value Examples
The function Traj() creates a trajectory object for discrete-time RL data.
| 1 | Traj(observations, actions)
 | 
| observations | a list for observations, the number of elements should be
one more than that of  | 
| actions | a list for actions, each element for each time step. | 
The trajectory object is designed to represent typical RL trajectory data:
O_1, A_1, O_2, A_2, …, O_n, A_n, O_{n+1}
This representation is compatible with many RL data generator (for example the data structure of OpenAI Gym library). With user-defined interpreter, observations and actions (or their history) can be encoded/converted to states, rewards and actions under Markov Decision Process (MDP) framework.
a trajectory object (class = "Traj")
| 1 2 3 4 | 
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.