Description Usage Arguments Details
View source: R/AsynchronousAdvantageActorCritic.R
Internal function which is called from Train.A3C
.
1 2 | Worker.A3C(worker.number, game.state, model.par, algo.par, algo.var.worker,
game.object, weights, Worker.Episode, model)
|
worker.number |
Number to identify the given worker. |
game.state |
Game State from the last round. |
model.par |
A list as given by |
algo.par |
A list as given by |
algo.var.worker |
A list as given by |
game.object |
A Game Object (list) as defined by |
weights |
Network weights from the Master Network. |
Worker.Episode |
The actual Episode of the Worker. |
model |
A Model as given by |
Returns a list with the following items
training A list containing calculated gradients.
reward Last obtained reward during the current n-steps.
finished Boolean specifiying wether episode is finished or not.
algo.var.Worker List containing the stored experience of the given worker.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.