View source: R/read_length_plot.R
rlength_distr | R Documentation |
This function generates read length distributions, displayed as bar plots. Multiple samples and replicates can be handled..
rlength_distr(
data,
sample,
multisamples = "average",
plot_style = "split",
scale_factors = "auto",
transcripts = NULL,
length_range = NULL,
cl = 100,
colour = NULL
)
data |
Either list of data tables or GRangesList object from
|
sample |
Either character string, character string vector or named list
of character string(s)/character string vector(s) specifying the name of
the sample(s) and replicate(s) of interest. If a list is provided, each
element of the list is considered as an independent sample associated with
one ore multiple replicates. Multiple samples and replicates are handled
and visualised according to |
multisamples |
Either "average" or "independent". It specifies how to
handle multiple samples and replicates stored in
|
plot_style |
Either "split", "facet", "dodge" or "mirror". It specifies how to organize and display multiple bar plots:
|
scale_factors |
Either "auto", a named numeric vector or "none". It specifies how read length distributions should be scaled before merging multiple samples (if any):
|
transcripts |
Character string vector listing the name of transcripts to be included in the analysis. Default is NULL, i.e. all transcripts are used. |
length_range |
Integer or integer vector for restricting the plot to a
chosen range of read lengths. Default is NULL, i.e. all read lengths are
used. If specified, this parameter prevails over |
cl |
Integer value in 1,100 specifying a confidence level for restricting the plot to an automatically-defined range of read lengths. The new range is computed according to the most frequent read lengths, which accounts for the cl% of the sample and is defined by discarding the (100-cl)% of read lengths falling in the tails of the read lengths distribution. If multiple samples are analysed, a single range of read lengths is computed such that at least the cl% of all samples is represented. Default is 100. |
colour |
Character string or character string vector specifying the
colour of the bar plot(s). If |
List containing: one or more ggplot object(s) and the data table with the corresponding x- and y-axis values ("plot_dt"); an additional data table with raw and scaled number of reads per length in each sample ("count_dt").
data(reads_list)
## Generate fake samples and replicates
for(i in 2:6){
samp_name <- paste0("Samp", i)
set.seed(i)
reads_list[[samp_name]] <- reads_list[["Samp1"]][sample(.N, 5000)]
}
## Define the list of samples and replicate to use as input
input_samples <- list("S1" = c("Samp1", "Samp2"),
"S2" = c("Samp3", "Samp4", "Samp5"),
"S3" = c("Samp6"))
## Generate the length distribution for a sub-range of read lengths:
example_length_dist <- rlength_distr(reads_list,
sample = input_samples,
multisamples = "average",
plot_style = "facet",
cl = 99,
colour = c("#333f50", "#39827c", "gray70"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.