BedGC.chr: Compute GC-content of genomic regions defined by a BED file...

Description Usage Arguments Details Value Author(s)

Description

This computes the GC-content of a BED file, and possibly extensions of these regions, using (samtools-indexed) FASTA files for each chromosome.

Usage

1
2
3
EaCoN.BedGC.chr(binned.bed.file = NULL, genome = "hg19", fasta.dir = NULL, na.to0 = TRUE,
nt.add = c(0,50,100, 250, 500, 1000, 2500, 5000), out.dir = getwd(), return.data = FALSE,
nthread = 2)

Arguments

binned.bed.file

BED file with binned region (ie, neither the WES capture BED, nor its cleaned version. See Details).

genome

Genome build (one supported by datasets in the chromosomes package).

fasta.dir

A directory containing (samtools-indexed) FASTA files corresponding the nucleotidic sequence of the given genome chromosomes.

na.to0

Replace putative NA values of the computed GC-content by zero.

nt.add

A numeric vector corresponding to Window(s) of extension of the BED regions (see Details).

out.dir

Output directory.

return.data

Return the computed GC-content as a data.frame (and still write it as a BED-like file).

nthread

Number of threads for multithreading (see Details).

Details

Value

A new BED-like file (with a ".gc" extension) containing the computed GC-content, and returned as a data.frame if return.data is set to TRUE.

Author(s)

Bastien Job


gustaveroussy/EaCoN documentation built on Oct. 20, 2021, 2:41 a.m.