import_peaks_geo: Import peaks: GEO

View source: R/import_peaks_geo.R

import_peaks_geoR Documentation

Import peaks: GEO

Description

Import narrow/broad/generic peaks from GEO, or compute peaks with call_peaks.

Usage

import_peaks_geo(
  ids,
  build,
  query_granges,
  query_granges_build,
  split_chromosomes = FALSE,
  method = "MACSr",
  cutoff = NULL,
  searches = construct_searches(),
  condense_queries = TRUE,
  peaks_dir = tempdir(),
  timeout = 3 * 60,
  nThread = 1,
  verbose = TRUE
)

Arguments

ids

IDs from one of the supported databases. IDs can be at any level: file, sample, or experiment.

query_granges

[Optional] GRanges object indicating which genomic regions to extract from each sample.

query_granges_build

[Optional] Genome build that query_granges is aligned to.

split_chromosomes

Split single-threaded query into multi-threaded query across chromosomes. This is can be helpful especially when calling peaks from large bigWig/bedGraph files. The number of threads used is set by the nThread argument.

method

Method to call peaks with:

  • "MACSr" : Uses MACS3 via bdgpeakcall.

  • "SEACR" : Uses SEACR via find_packages.

cutoff
  • when method="MACSr" : Passed to cutoff argument. Cutoff depends on which method you used for score track. If the file contains pvalue scores from MACS3, score 5 means pvalue 1e-5. If NULL, a reasonable cutoff value will be inferred through a cutoff_analysis.

  • when method="SEACR" : Passed to control argument. Control (IgG) data bedgraph file to generate an empirical threshold for peak calling. Alternatively, a numeric threshold n between 0 and 1 returns the top n fraction of peaks based on total signal within peaks (default: 0.05).

searches

Named list of regex queries.

condense_queries

Condense query_granges by taking the min/max position per chromosome (default: TRUE). This helps to reduce the total number of queries, which can cause memory allocation problems due to repeated calls to the underlying C libraries.

peaks_dir

Directory to save peaks to (only used when calling peaks from bedGraph files).

nThread

When nThread>1, accelerates file importing and peak calling using multi-core parallelisation.

verbose

Print messages.

gsm

GEO GSM id (e.g. "GSM4271282").

Details

Must import new in my function because it seems getGEO forgot to do this (only works when you load the entire GEOquery package first).

Value

Named list of peak files in GRanges format.

Source

curl connection timeout issues

Issue with importing bigWig files in parallel


neurogenomics/PeakyFinders documentation built on March 24, 2024, 4:28 p.m.