annotatePTC.R: PTC and NMD-sensitivity detection from assembled RNA-seq data

Description Usage Arguments Details Value Author(s) References Examples

Description

PTC and NMD-sensitivity detection from assembled RNA-seq data.

Usage

1
annotatePTC(transcriptData, cds, genomeObject, PTCDistance=50)

Arguments

transcriptData

A SpliceRList object, containing transcript and exon information.

cds

A CDSSet object, containing CDS information.

genomeObject

A BSgenome object, containing sequence for the relevant genome. Contained in BSGenome objects, downloadable from BioConductor.

PTCDistance

A numeric giving the premature termination codon-distance: The minimum distance from a STOP to the final exon-exon junction, for a transcript to be marked as NMD-sensitive.

Details

annotatePTC retrieves sequence data for all exons given in transcriptData, uses the CDS-information in cds to scan for the most upstream reading frame, and translates the mRNA, storing information about the first codon in relation to distance from TTS, distance to the final exon-exon junction, etc. If the STOP distance to the final exon-exon junction is larger than the threshold given in PTCDistance (and the STOP does not fall in the last exon), the STOP is considered premature and the transcript is marked as NMD (nonsense mediated decay) sensitive. For a review of the PTC and NMD mechanism, see Weischenfeldt et al. 2012.

Value

A SpliceRList, with the transcript_features object containing additional columns:

spliceR.cdsPosGenomic, the genomic position of the used START codon.

spliceR.stopPosGenomic, the genomic position of the identified STOP codon,

spliceR.ExonWithStart, the exon which the used START codon falls within,

spliceR.ExonWithStop, the exon which the STOP codon falls within.

spliceR.cdsPosTranscipt, the position relative to transcript start where the used START codon is (measured in nucleotides).

spliceR.stopPosTranscipt, the position relative to transcript start where the found STOP codon is (measured in nucleotides).

spliceR.stopDistance, the distance from the found STOP codon to the last exon-exon junction, realative to transcript start.

spliceR.junctionIndex, the exon number in which the found STOP codon falls when compared to the last exon-exon junction, where 0 is the last exon of the transcript, -1 is the second-last, etc NA, if annotatePTC was not able to find a ORF.

spliceR.PTC, a boolean, indicating whether the transcript is (theoretically) susceptible to nonsense mediated decay. annotatePTC sets this value to TRUE if the stop codon falls if any exon other than the last, and the distance to the final downstream exon-exon junction is larger than PTCDistance (default 50 nt).

Author(s)

Kristoffer Vitting-Seerup, Johannes Waage

References

Vitting-Seerup K , Porse BT, Sandelin A, Waage J. (2014) spliceR: an R package for classification of alternative splicing and prediction of coding potential from RNA-seq data. BMC Bioinformatics 15:81.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
## Not run: 

#Rebuild cummeRbund's internal dataset
cuffDB <- readCufflinks(dir=system.file("extdata", package="cummeRbund"), gtf=system.file("extdata/chr1_snippet.gtf", package="cummeRbund"), genome='hg19' ,rebuild=TRUE)

#Generate SpliceRList from cufflinks data
cuffDB_spliceR <- prepareCuff(cuffDB)

# Require BSgenome object, containing genomic sequence
require("BSgenome.Hsapiens.UCSC.hg19",character.only = TRUE)

#Get CDS from UCSC
ucscCDS <- getCDS(selectedGenome="hg19", repoName="UCSC")

#Annotate with PTCs
cuffDB_spliceR_PTC <- annotatePTC(cuffDB_spliceR, cds=ucscCDS, Hsapiens, PTCDistance=50)


## End(Not run)

spliceR documentation built on May 2, 2018, 2:50 a.m.