Description Usage Arguments Details
View source: R/AsynchronousAdvantageActorCritic.R
Defines a tensorflow graph specifiying the loss calculation according to the A3C algorithm.
| 1 | Define_Graph(model, model.par, game.object)
 | 
| model | A Neural Network e.g. as given by  | 
| model.par | A list with parameters to set up the Network e.g. as given by  | 
| game.object | A Game Object (list) as defined by  | 
Returns a list of tensors with the following items:
gradient.clipped Calculated gradients of the network weights.
loss.policy Calculated loss of the policy.
loss.value Calculated value loss.
loss Calculated total loss of the network.
s_ Placeholder for states.
a_ Placeholder for actions.
r_ Placeholder for rewards.
adv_ Placeholder for calculated advantages.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.