Description Usage Arguments Details Value Examples
The function Traj()
creates a trajectory object for discrete-time RL data.
1 | Traj(observations, actions)
|
observations |
a list for observations, the number of elements should be
one more than that of |
actions |
a list for actions, each element for each time step. |
The trajectory object is designed to represent typical RL trajectory data:
O_1, A_1, O_2, A_2, …, O_n, A_n, O_{n+1}
This representation is compatible with many RL data generator (for example the data structure of OpenAI Gym library). With user-defined interpreter, observations and actions (or their history) can be encoded/converted to states, rewards and actions under Markov Decision Process (MDP) framework.
a trajectory object (class = "Traj"
)
1 2 3 4 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.