generate_slurm_calls: Generate present/absent calls on slurm queuing system

View source: R/runCallsSlurm.R

generate_slurm_callsR Documentation

Generate present/absent calls on slurm queuing system

Description

This function is meant to be used with a cluster where the Slurm queuing system is installed. It processes all steps to generate present/absent calls at RNA-Seq library level. This function does not generate the kallisto indexes. If they are not already generated please run function “'generate_slurm_indexes“' first. Steps of present/absent gene expression calls generation are :

  • Quantifying abundances of transcripts from RNA-Seq libraries

  • Summarizing abundance at gene level

  • generate present/absent expression calls

Usage

generate_slurm_calls(
  kallistoMetadata = new("KallistoMetadata"),
  bgeeMetadata = new("BgeeMetadata"),
  userMetadata = new("UserMetadata"),
  userFile,
  submit_sh_template = NULL,
  slurm_options = NULL,
  rscript_path = NULL,
  modules = NULL,
  submit = TRUE,
  nodes = 10,
  checkTxVersion = FALSE
)

Arguments

kallistoMetadata

A Reference Class KallistoMetadata object (optional) allowing to tune your gene quantification abundance analyze. If no object is provided a new one will be created with default values.

bgeeMetadata

A Reference Class BgeeMetadata object (optional) allowing to choose the version of reference intergenic sequences. If no object is provided a new one will be created with default values.

userMetadata

A Class UserMetadata object (optional). If no object is provided a new one will be created with default values.

userFile

Path to the file where each line corresponds to one abundance quantification to be run. The structure of the file is the same than the 'userFile' used as input of the 'generate_calls_workflow' function. A template of this file can be loaded with the command : “'inputFile <- read.table(system.file("userMetadataTemplate.tsv", package = "BgeeCall"), header = TRUE)“' It is important to keep the same column names.

submit_sh_template

A template of the bash script used to submit the jobs. By default the submition script provided by rslurm is used. Modify only if module dependancies have to be added (like kallisto or R)

slurm_options

A named list of options recognized by sbatch. More details in the documentation of the rslurm::slurm_apply function

rscript_path

The location of the Rscript command. If not specified, defaults to the location of Rscript within the R installation being run.

modules

A list of modules you want to load in the invironment. Should stay empty except if you need to load R and/or kallisto (e.g module add R)

submit

Whether or not to submit the job to the cluster with sbatch. Default value is TRUE

nodes

The (maximum) number of cluster nodes to spread the calculation over. slurm_apply automatically divides params in chunks of approximately equal size to send to each node. Less nodes are allocated if the parameter set is too small to use all CPUs on the requested nodes. By default this number is 10.

checkTxVersion

boolean used to define if BgeeCall check rather transcript version should be removed. Default value is FALSE

Value

generate calls

Examples

## Not run: 
# use function with all default values
userFile <- "/path/to/userList.tsv"
sjobs <- generate_slurm_calls(userFile = userFile)

## End(Not run)

BgeeDB/BgeeCall documentation built on Nov. 10, 2023, 5:40 a.m.