View source: R/fitAlpineBiasModel.R
fitAlpineBiasModel | R Documentation |
This function provides a wrapper around some of the functions from the
alpine
package. Given a gtf file and a bam file with reads aligned to
the genome, it will find single-isoform genes (with lengths and expression
levels within given ranges) and use the observed read coverages to fit a
fragment bias model.
fitAlpineBiasModel(gtf, bam, organism, genome, genomeVersion, version,
minLength = 600, maxLength = 7000, minCount = 500, maxCount = 10000,
subsample = TRUE, nbrSubsample = 200, seed = 1, minSize = NULL,
maxSize = NULL, verbose = FALSE)
gtf |
Path to gtf file with genomic features. Preferably in Ensembl format. |
bam |
Path to bam file with read alignments to the genome. |
organism |
The organism (e.g., 'Homo_sapiens'). This argument will be
passed to |
genome |
A |
genomeVersion |
Genome version (e.g., 'GRCh38'). This argument will be
passed to |
version |
The version of the reference annotation (e.g., 90). This
argument will be passed to |
minLength , maxLength |
Minimum and maximum length of single-isoform genes used to fit fragment bias model. |
minCount , maxCount |
Minimum and maximum read coverage of single-isoform genes used to fit fragment bias model. |
subsample |
Whether to subsample the set of single-isoform genes
satisfying the |
nbrSubsample |
If |
seed |
If |
minSize , maxSize |
Smallest and largest fragment size to consider. One or
both of these can be |
verbose |
Logical, whether to print progress messages. |
A list with three elements:
biasModel
:The fitted fragment bias model.
exonsByTx
:A GRangesList
object with exons grouped by transcript.
transcripts
:A
GRanges
object with all the reference transcripts.
Charlotte Soneson, Michael I Love
Soneson C, Love MI, Patro R, Hussain S, Malhotra D, Robinson MD: A junction coverage compatibility score to quantify the reliability of transcript abundance estimates and annotation catalogs. bioRxiv doi:10.1101/378539 (2018).
Love MI, Hogenesch JB, Irizarry RA: Modeling of RNA-seq fragment sequence bias reduces systematic errors in transcript abundance estimation. Nature Biotechnology 34(12):1287-1291 (2016).
## Not run:
gtf <- system.file("extdata/Homo_sapiens.GRCh38.90.chr22.gtf.gz",
package = "jcc")
bam <- system.file("extdata/reads.chr22.bam", package = "jcc")
biasMod <- fitAlpineBiasModel(gtf = gtf, bam = bam,
organism = "Homo_sapiens",
genome = Hsapiens, genomeVersion = "GRCh38",
version = 90, minLength = 230,
maxLength = 7000, minCount = 10,
maxCount = 10000, subsample = TRUE,
nbrSubsample = 30, seed = 1, minSize = NULL,
maxSize = 220, verbose = TRUE)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.