In compstat-lmu/rlR: Reinforcement Learning in R

library(rlR)
set.seed(123)
knitr::opts_chunk$set(cache = TRUE, collapse = FALSE, dev = "svg", fig.height = 3.5)
knitr::knit_hooks$set(document = function(x){
  gsub("```\n*```r*\n*", "", x)
})
library(reticulate)
os = import("os")
os$environ[["TF_CPP_MIN_LOG_LEVEL"]]="3"

rlR: play Atari games

Convolutional Neural Network Structure

Atari Environment

For Atari Games, it makes more since to stack several recent frames since the agent need to know what is happening and with only one frame it is hard to judge the current situation. So we have the observ_stack_len parameter.

library(rlR)
env = makeGymEnv("Seaquest-v0", observ_stack_len = 4L, state_preprocess = list(fun = rlR:::subsample))

Since the input state space is RGB image, we would like to down sample the state space by the following function

rlR:::subsample

env$overview()

env$snapshot(preprocess = T)
env$snapshot(steps = 500, preprocess = F)

conf = getDefaultConf("AgentFDQN")

The rlR package has been optimized to handle replay memory in a very efficient way, to ensure performance, you could also use the following parameters which has a bigger replay memory.

conf$set(replay.batchsize = 32, 
  replay.freq = 1L, 
  console = TRUE, 
  agent.lr.decay = 1, 
  agent.lr = 0.00025, 
  agent.update.target.freq = 1e4,
  replay.memname = "Png", 
  render = F, 
  policy.minEpsilon = 0.1, 
  agent.start.learn = 5e4L, 
  policy.aneal.steps = 1e6,
  replay.mem.size = 1e6, 
  log = FALSE, 
  agent.clip.td = TRUE, 
  policy.decay.type = "decay_linear")

makeCnnCritic = function(state_dim, act_cnt) {
  require("keras")
  text = paste("model <- keras_model_sequential();",
  'model %>%',
  ' layer_conv_2d(filter = 16, kernel_size = c(8,8), strides = c(4, 4), 
  padding = "same", input_shape = state_dim) %>%',
    'layer_activation("relu") %>%',
    'layer_conv_2d(filter = 32, kernel_size = c(4,4), strides = c(2, 2)) %>%',
    'layer_activation("relu") %>%',
    'layer_flatten() %>%',
    'layer_dense(256) %>%',
    'layer_activation("relu") %>%',
    'layer_dense(act_cnt) %>%',
    'layer_activation("linear");',
    'opt <- optimizer_rmsprop(lr = 0.00025);',
    'model %>% compile(loss = "mse", optimizer = opt, metrics = "accuracy")')
  model = eval(parse(text = text))
  return(model)
}

agent = initAgent("AgentFDQN", env, conf, custom_brain = TRUE)
agent$customizeBrain(list(value_fun = makeCnnCritic))

agent$learn(1L)

compstat-lmu/rlR documentation built on June 26, 2019, 5:56 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

compstat-lmu/rlR
Reinforcement Learning in R

In compstat-lmu/rlR: Reinforcement Learning in R

rlR: play Atari games

Convolutional Neural Network Structure

Atari Environment

R Package Documentation

Browse R Packages

We want your feedback!

compstat-lmu/rlR Reinforcement Learning in R

In compstat-lmu/rlR: Reinforcement Learning in R

rlR: play Atari games

Convolutional Neural Network Structure

Atari Environment

R Package Documentation

Browse R Packages

We want your feedback!

compstat-lmu/rlR
Reinforcement Learning in R