View source: R/rnaseq_workflow_qc.R
load_paired_end_qc_data | R Documentation |
Load QC data files generated by MultiQC for various stages of paired-end RNA-seq data processing using the automated RNA-seq data processing WDL workflow.
load_paired_end_qc_data(data_dir)
data_dir |
A string denoting the local path where all the data files are stored. |
The automated RNA-seq data processing workflow produces several tables with various metrics. Additionally, several tables can be exported from the MultiQC HTML report. This function will import all of these data into a list for further processing in R. The assumed data files are
multiqc_fastqc.txt
: An auto-generated workflow file with general FASTQC statistics.
multiqc_hisat2.txt
: An auto-generated workflow file with HISAT2 alignment statistics.
multiqc_rseqc_bam_stat.txt
: An auto-generated workflow file with alignment statistics
generated by RSeQC.
multiqc_rseqc_read_distribution.txt
: An auto-generated workflow file with alignment
categorizations to different types of genomic regions (introns, exons, integenic, etc.).
multiqc_salmon.txt
: An auto-generated workflow file with mapping statistics from Salmon.
multiqc_trimmomatic.txt
: An auto-generated workflow file with trimming and read quality
filtering statistics using Trimmomatic.
rseqc_inner_distance_plot.tsv
: Table exported from the MultiQC HTML report with inner
distance statistics between pairs of reads.
salmon_plot.tsv
: Table exported from the MultiQC HTML report with RNA fragment length
distribution statistics as estimated by Salmon.
fastqc_per_base_sequence_quality_plot.tsv
: Table exported from the MultiQC HTML report with
per base PHRED scores statistics generated by FASTQC.
fastqc_per_sequence_quality_scores_plot.tsv
: Table exported from the MultiQC HTML report
with per sequence PHRED score statistics generated by FASTQC.
fastqc_per_sequence_gc_content_plot.tsv
: Table exported from the MultiQC HTML report with
per sequence GC content statistics generated by FASTQC.
fastqc_sequence_duplication_levels_plot.tsv
: Table exported from the MultiQC HTML report
with sequence duplication statistics generated by FASTQC.
fastqc_adapter_content_plot.tsv
: Table exported from the MultiQC HTML report with adapter
content statistics generated by FASTQC.
rseqc_known_junction_saturation_plot.tsv
: Table exported from the MultiQC HTML report with
junction saturation statistics for known splice junctions.
rseqc_novel_junction_saturation_plot.tsv
: Table exported from the MultiQC HTML report with
junction saturation statistics for novel splice junctions.
A list with several elements:
fastqc
: Data frame imported from multiqc_fastqc.txt
.
hisat2
: Data frame imported from multiqc_hisat2.txt
.
rseqc_bam
: Data frame imported from multiqc_rseqc_bam_stat.txt
.
rseqc_alignment_category
: Data frame imported from multiqc_rseqc_read_distribution.txt
.
salmon
: Data frame imported from multiqc_salmon.txt
.
trimmomatic
: Data frame imported from multiqc_trimmomatic.txt
.
inner_dist
: Data frame imported from rseqc_inner_distance_plot.tsv
.
frag_length
: Data frame imported from salmon_plot.tsv
.
phred_bp
: Data frame imported from fastqc_per_base_sequence_quality_plot.tsv
.
phred_seq
: Data frame imported from fastqc_per_sequence_quality_scores_plot.tsv
.
gc_content
: Data frame imported from fastqc_per_sequence_gc_content_plot.tsv
.
seq_duplication
: Data frame imported from fastqc_sequence_duplication_levels_plot.tsv
.
adapter_content
: Data frame imported from fastqc_adapter_content_plot.tsv
.
known_junction
: Data frame imported from rseqc_known_junction_saturation_plot.tsv
.
novel_junction
: Data frame imported from rseqc_novel_junction_saturation_plot.tsv
.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.