README.md
In benjaminguinaudeau/executoR:

Executor orchestrates scripts: it can run, schedule and restart the execution of scripts. Scripts are executed in the background (similarly to rstudio jobs). Executor (unlike rstudio jobs) can be used in any environment, even if rstudio is not running.

To create an instance, you simply need to specify a folder, where all skript logs will be saved and an executor_id. If you don’t specify an id, a random one will be automatically attributed.

# Random ID
exec_model <- executor$new(folder = "exec_test")
exec_model$ex_id
#> [1] "d3568bcf"

# Specified id
exec_model <- executor$new(folder = "exec_test", exec_id = "xg15")
exec_model$ex_id
#> [1] "xg15"

To execute a task, you need to:

Register the task in the executor
Start the task

A task is basically a script and a name. Within one executof two tasks cannot have the same name.

To add a task to an existing executor, you need to specify at least two arguments:

a name: general description of the task, that will be used as an id internally
a script path: a path to the script that should be executed

You can eventually specify further arguments

wd: the working directory for the script
env: environment variables that should be provided to the script. This can be used to parametrize scripts.
infinite_loop: when True (default), the task will be restarted each times an errors occurs
period: a string specifying how often the task should executed. “1 hour” to execute the task every hour ; “1 day” for every day ; etc..
start: a timestamp specifying when the interval should start. To execute a task once a day at 13:00, use period = “1 day” and start = lubridate::as_datetime(“2021-02-27 13:00:00”)

## Adding one task
exec_model$add_task(name = "LOL", script = "jobs/test.R", wd = getwd(), env = c("SYMBOL" = "LOL"))
exec_model$add_task(name = "LOL_Schedule", script = "jobs/test.R", wd = getwd(), env = c("SYMBOL" = "LOL_schedule"), infinite_loop = F, period = "2 min")
exec_model$add_task(name = "LOL_Schedule_day", script = "jobs/test.R", wd = getwd(), env = c("SYMBOL" = "LOL_schedule"), infinite_loop = F, period = "day", start = lubridate::as_datetime("2021-02-27 12:50:00", tz = "EST"))
## Adding a serie of task
c("LMFAO", "LOL") %>%
  purrr::walk(~{
    exec_model$add_task(name = .x, script = "jobs/test.R", wd = getwd(), env = c("SYMBOL" = .x))
  })
#> Warning in exec_model$add_task(name = .x, script = "jobs/test.R", wd =
#> getwd(), : Skipping LOL because a task named LOL already exists

Once a task is added, it can be start/stop. Remember that the name provided to the task it the internal id, hence it should be unique.

# Start task
exec_model$start_task(name = "LOL")
# Start all exising tasks
exec_model$start_all()
#> # A tibble: 4 x 5
#>   exec_id name             running infinite_loop period 
#>   <chr>   <chr>            <lgl>   <lgl>         <chr>  
#> 1 xg15    LOL              TRUE    TRUE          ""     
#> 2 xg15    LOL_Schedule     FALSE   FALSE         "2 min"
#> 3 xg15    LOL_Schedule_day FALSE   FALSE         "day"  
#> 4 xg15    LMFAO            FALSE   TRUE          ""
#> ℹ Scheduled: LOL_Schedule
#> ℹ Scheduled: LOL_Schedule_day
#> ℹ Already running: LOL
#> ℹ Starting LMFAO

# Stop task
exec_model$stop_task(name = "LOL")
# Stop all tasks
exec_model$stop_all()

To make sure, the scheduled script are correctly executed or breaking scripts are restarted, the executor needs to be runned. This will basically check every 60 seconds, if a script that should be running is running. In doing so, it will run script that were scheduled for the past 60 seconds and restart scripts that are forever loops.

sleep <- 60 # in seconds
exec_model$keep_restarting(sleep = sleep) # forever loop, you'll need to stop this, once this is started

An overall log is stored in a log vector

exec_model$log
#>  [1] "[ 2021-05-24 12:25:00 ] Initializing"                           
#>  [2] "[ 2021-05-24 12:25:00 ] Adding LOL jobs/test.R"                 
#>  [3] "[ 2021-05-24 12:25:00 ] Adding LOL_Schedule jobs/test.R"        
#>  [4] "[ 2021-05-24 12:25:00 ] Adding LOL_Schedule_day jobs/test.R"    
#>  [5] "[ 2021-05-24 12:25:00 ] Adding LMFAO jobs/test.R"               
#>  [6] "[ 2021-05-24 12:25:00 ] Starting LOL jobs/test.R (pid: 94491)"  
#>  [7] "[ 2021-05-24 12:25:01 ] Starting LMFAO jobs/test.R (pid: 94511)"
#>  [8] "[ 2021-05-24 12:25:01 ] Stopping LOL"                           
#>  [9] "[ 2021-05-24 12:25:01 ] Stopping LOL"                           
#> [10] "[ 2021-05-24 12:25:01 ] Stopping LOL_Schedule"                  
#> [11] "[ 2021-05-24 12:25:01 ] Stopping LOL_Schedule_day"              
#> [12] "[ 2021-05-24 12:25:01 ] Stopping LMFAO"

To know the existing tasks in an executor, you can take a look at exec_model$task

exec_model$tasks
#> # A tibble: 4 x 11
#>   exec_id name       script    wd        stamp               status   pid env   
#>   <chr>   <chr>      <chr>     <chr>     <dttm>              <chr>  <dbl> <list>
#> 1 xg15    LOL        jobs/tes… /bgr/exe… 2021-05-24 12:25:00 stopp… 94491 <chr …
#> 2 xg15    LOL_Sched… jobs/tes… /bgr/exe… 2021-05-24 12:25:00 stopp…    NA <chr …
#> 3 xg15    LOL_Sched… jobs/tes… /bgr/exe… 2021-05-24 12:25:00 stopp…    NA <chr …
#> 4 xg15    LMFAO      jobs/tes… /bgr/exe… 2021-05-24 12:25:00 stopp… 94511 <chr …
#> # … with 3 more variables: infinite_loop <lgl>, period <chr>, start <dttm>

If you’re interested in which scripts are running, you can use the following function:

exec_model$list_running_task()
#> # A tibble: 4 x 5
#>   exec_id name             running infinite_loop period 
#>   <chr>   <chr>            <lgl>   <lgl>         <chr>  
#> 1 xg15    LOL              FALSE   TRUE          ""     
#> 2 xg15    LOL_Schedule     FALSE   FALSE         "2 min"
#> 3 xg15    LOL_Schedule_day FALSE   FALSE         "day"  
#> 4 xg15    LMFAO            FALSE   TRUE          ""

exec_model$list_running_task(next_run = T)
#> # A tibble: 4 x 7
#>   exec_id name             running infinite_loop period  start              
#>   <chr>   <chr>            <lgl>   <lgl>         <chr>   <dttm>             
#> 1 xg15    LOL              FALSE   TRUE          ""      2021-05-24 11:25:00
#> 2 xg15    LOL_Schedule     FALSE   FALSE         "2 min" 2021-05-24 11:25:00
#> 3 xg15    LOL_Schedule_day FALSE   FALSE         "day"   2021-02-27 12:50:00
#> 4 xg15    LMFAO            FALSE   TRUE          ""      2021-05-24 11:25:00
#> # … with 1 more variable: next_run <dttm>

Each script produces one output, saved in the log_folder specified when creating the executor.

If you navigate to "test", you will see that each script has one output file LOL.txt. If the script breaks and is restarted, the previous output is archived with a timestamp, so that we can know which error happened. So LOL_111111111.txt is the output of a previous run of the task LOL. Using this output, you can communicate with what’s happening in the process.

exec_model$read_out(name = "LOL", n_tail = 50) %>% glimpse # Read the last 50 lines of the stream output of the task LOL
#>  chr [1:4] "[1] \"xg15\"" "[1] \"LOL\"" "[1] \"2021-05-24 12:25:00 EDT\"" ...

benjaminguinaudeau/executoR documentation built on June 11, 2022, 12:33 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com