Define_Graph: Graph for Network Loss according to A3C.
In MartinKies/RLR: Reinforcement Learning with R

Description Usage Arguments Details

View source: R/AsynchronousAdvantageActorCritic.R

Defines a tensorflow graph specifiying the loss calculation according to the A3C algorithm.

1	Define_Graph(model, model.par, game.object)

`model`	A Neural Network e.g. as given by `Setup.Neural.Network.A3c` or `Setup.Neural.Network.A3c.LSTM`.
`model.par`	A list with parameters to set up the Network e.g. as given by `Get.Def.Par.Neural.Network.A3C` or `Get.Def.Par.Neural.Network.A3C.LSTM`.
`game.object`	A Game Object (list) as defined by `Get.Game.Object.<NAME>`.

Returns a list of tensors with the following items:

gradient.clipped Calculated gradients of the network weights.
loss.policy Calculated loss of the policy.
loss.value Calculated value loss.
loss Calculated total loss of the network.
s_ Placeholder for states.
a_ Placeholder for actions.
r_ Placeholder for rewards.
adv_ Placeholder for calculated advantages.

MartinKies/RLR documentation built on Dec. 24, 2019, 10:02 p.m.

MartinKies/RLR index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com