Plot_Coverage | R Documentation |
Generate plotly / ggplot RNA-seq genome and coverage plots from command line. For some quick working examples, see the Examples section below.
Plot_Coverage( se, Event, Gene, seqname, start, end, coordinates, strand = c("*", "+", "-"), zoom_factor, bases_flanking = 100, tracks, track_names = tracks, condition, selected_transcripts, condense_tracks = FALSE, stack_tracks = FALSE, t_test = FALSE, norm_event ) Plot_Genome( se, reference_path, Gene, seqname, start, end, coordinates, zoom_factor, bases_flanking = 100, selected_transcripts, condense_tracks = FALSE ) as_egg_ggplot(p_obj)
se |
A NxtSE object, created by MakeSE.
COV files must be linked to the NxtSE object. To do this, see the example
in MakeSE. Required by |
Event |
The |
Gene |
Whether to use the range for the given |
seqname, start, end |
The chromosome (string) and genomic |
coordinates |
A string specifying genomic coordinates can be given
instead of |
strand |
Whether to show coverage of both strands "*" (default), or from the "+" or "-" strand only. |
zoom_factor |
Zoom out from event. Each level of zoom zooms out by a
factor of 3. E.g. for a query region of chr1:10000-11000, if a
|
bases_flanking |
(Default = |
tracks |
The names of individual samples,
or the names of the different conditions to be plotted. For the latter, set
|
track_names |
The names of the tracks to be displayed. If omitted, the
track_names will default to the input in |
condition |
To display normalised coverage per condition, set this to
the condition category. If omitted, |
selected_transcripts |
(Optional) A vector containing transcript ID or transcript names of transcripts to be displayed on the gene annotation track. Useful to remove minor isoforms that are not relevant to the samples being displayed. |
condense_tracks |
(default |
stack_tracks |
(default |
t_test |
(default |
norm_event |
Whether to normalise by an event different to that given
in "Event". The difference between this and Event is that the genomic
coordinates can be centered around a different |
reference_path |
The path of the reference generated by
BuildReference. Required by |
p_obj |
In |
In RNA sequencing, alignments to spliced transcripts will "skip" over genomic regions of introns. This can be illustrated in a plot using a horizontal genomic axis, with the vertical axis representing the number of alignments covering each nucleotide. As a result, the coverage "hills" represent the expression of exons, and "valleys" to introns.
Different alternatively-spliced isoforms thus produce different coverage patterns. The change in the coverage across an alternate exon relative to its constitutively-included flanking exons, for example, represents its alternative inclusion or skipping. Similarly, elevation of intron valleys represent increased intron retention.
With multiple replicates per sample, coverage is dependent on library size and gene expression. To compare alternative splicing ratios, normalisation of the coverage of the alternate exon (or alternatively retained intron) relative to their constitutive flanking exons, is required. There is no established method for this normalisation, and can be confounded in situations where flanking elements are themselves alternatively spliced.
NxtIRF performs this coverage normalisation using the same method as its
estimate of spliced / intronic transcript abundance using the SpliceOverMax
method (see details section in CollateData). This normalisation can be
applied to correct for library size and gene expression differences between
samples of the same experimental condition. After normalisation, mean and
variance of coverage can be computed as ratios relative to total transcript
abundance. This method can visualise alternatively included genomic regions
including casette exons, alternate splice site usage, and intron retention.
Plot_Coverage
generates plots showing depth of alignments to
the genomic axis. Plots can be generated for individual samples or samples
grouped by experimental conditions. In the latter, mean and 95% confidence
intervals are shown.
Plot_Genome
generates genome transcript tracks only. Protein-coding
regions are denoted by thick rectangles, whereas non-protein coding
transcripts or untranslated regions are denoted with thin rectangles.
Introns are denoted as lines.
A list containing two objects. final_plot is the plotly object. ggplot is a list of ggplot tracks, with:
ggplot[[n]]
is the nth track (where n = 1, 2, 3 or 4).
ggplot[[5]]
contains the T-test track if one is generated.
ggplot[[6]]
always contains the genome track.
Plot_Coverage
: generates plots showing depth of alignments to
the genomic axis. Plots can be generated for individual samples or samples
grouped by experimental conditions. In the latter, mean and 95
intervals are shown.
Plot_Genome
: Generates a plot of transcripts within a given
genomic region, or belonging to a specified gene
as_egg_ggplot
: Coerce the 'Plot_Coverage()' output as a vertically
stacked ggplot, using egg::ggarrange
se <- NxtIRF_example_NxtSE() # Plot the genome track only, with specified gene: p <- Plot_Genome(se, Gene = "SRSF3") p$ggplot # View the genome track, specifying a genomic region via coordinates: p <- Plot_Genome(se, coordinates = "chrZ:10000-20000") p$ggplot # Assign annotation re experimental conditions colData(se)$treatment <- rep(c("A", "B"), each = 3) # Verify that the COV files are linked to the NxtSE object: covfile(se) # Return a list of ggplot and plotly objects p <- Plot_Coverage( se = se, Event = rowData(se)$EventName[1], tracks = colnames(se)[1:4] ) # Display a static ggplot / egg::ggarrange stacked plot: as_egg_ggplot(p) # Display the plotly-based interactive Coverage plot: p$final_plot # Plot the same event but by condition "treatment" p <- Plot_Coverage( se, rowData(se)$EventName[1], tracks = c("A", "B"), condition = "treatment" ) as_egg_ggplot(p)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.