Description Usage Arguments Details Value Author(s)
View source: R/BED_functions.R
This computes the GC-content of a BED file, and possibly extensions of these regions.
1 2 3 |
binned.bed.file |
BED file with binned region (ie, neither the WES capture BED, nor its cleaned version. See |
genome |
Genome build (one supported by datasets in the |
fasta |
FASTA file corresponding the nucleotidic sequence of the given |
na.to0 |
Replace putative NA values of the computed GC-content by zero. |
nt.add |
A numeric vector corresponding to Window(s) of extension of the BED regions (see |
out.dir |
Output directory. |
return.data |
Return the computed GC-content as a data.frame (and still write it as a BED-like file). |
nthread |
Number of threads for multithreading (see |
binned.bed.file
: This BED file corresponds to the BED generated by A2p::A2p.BAMConv()
, thus corresponding to binning of the capture BED after being processed through A2p::A2p.BedCheck()
.
fasta
: Single FASTA file containing the genomic sequence corresponding to the given genome
. This has to be a samtools-indexed FASTA file.
nt.add
: To simulate different states of degradation of DNA, GC-content can be computed on several tracks corresponding to different sizes of extension of the regions defined by te binned.bed.file
. The nt.add
parameter corresponds to a numeric vector containing the different extension size(s) desired. Please note that these sizes are expressed in nucleotides unit, and that the extension will be performed on both sides of the regions. Example : with an extension size of 50, the genomic region chr1:1000-1100 will get extended to chr1:950-1150. Default value for this parameter is recommended for WES data.
nthread
: When nthread
> 1, multithreading is performed using one track (one extension) per thread, so please do not use a value greater than the length of nt.add
. Please also note that this function can consume large amounts of RAM (example : 3 GB per thread is require for human hg19, to add to the current session used RAM !)
A new BED-like file (with a ".gc" extension) containing the computed GC-content, and returned as a data.frame
if return.data
is set to TRUE
.
Bastien Job
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.