Description Usage Arguments Details
View source: R/AsynchronousAdvantageActorCritic.R
Defines a tensorflow graph specifiying the loss calculation according to the A3C algorithm.
1 | Define_Graph(model, model.par, game.object)
|
model |
A Neural Network e.g. as given by |
model.par |
A list with parameters to set up the Network e.g. as given by |
game.object |
A Game Object (list) as defined by |
Returns a list of tensors with the following items:
gradient.clipped Calculated gradients of the network weights.
loss.policy Calculated loss of the policy.
loss.value Calculated value loss.
loss Calculated total loss of the network.
s_ Placeholder for states.
a_ Placeholder for actions.
r_ Placeholder for rewards.
adv_ Placeholder for calculated advantages.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.