Introduction to seeker

knitr::opts_chunk$set(collapse = TRUE, comment = '#>')

RNA-seq data

The seeker package is designed to be a wrapper around various command-line and R-based tools. The main function is, well, seeker(), which is targeted at processing bulk RNA-seq data. seeker()'s main argument is a list of parameters specifying which steps of RNA-seq data processing to perform and how to perform them. The list of parameters can come from a yaml file, an example of which is shown below.


An empty template yaml file is available at system.file('extdata', 'params_template.yml', package = 'seeker'). You can copy these yaml files to your working directory like so:

for (filename in c('PRJNA600892.yml', 'params_template.yml')) {
  file.copy(system.file('extdata', filename, package = 'seeker'), '.')}

If you've already installed the system dependencies, such as with installSysDeps(), a basic way to run seeker() is then:

library('seeker')
doParallel::registerDoParallel()

yamlPath = 'PRJNA600892.yml'
params = yaml::read_yaml(yamlPath)
seeker(params)

Beware even this minimal example could take some time.

Microarray data

Here you can use the seekerArray() function, which can process data from NCBI GEO and ArrayExpress, and can process raw Affymetrix data stored locally. The main arguments are study and geneIdType. For example:

library('seeker')

study = 'GSE25585'
geneIdType = 'entrez'
seekerArray(study, geneIdType)


Try the seeker package in your browser

Any scripts or data that you put into this service are public.

seeker documentation built on Sept. 11, 2024, 7:54 p.m.