BiocStyle::markdown()
pngnox11 = function(filename, width, height){
   bitmap(file=filename, width=width, height=height, res=300)
}

Rqc r packageVersion("Rqc") - Quality Control Report

File Information

This table describes input files. reads column can be total number of reads (sample=FALSE) or sample size.

kable(perFileInformation(rqcResultSet))

Per Read Mean Quality Distribution of Files

This plot describe an overview of per read mean quality distribution of all files

rqcReadQualityBoxPlot(rqcResultSet)

Average Quality

This plot describes the average quality pattern by showing on the X-axis quality thresholds and on the Y-axis the percentage of reads that exceed that quality level.

groups <- unique(perFileInformation(rqcResultSet)$group)
for (group in groups) {
  rqcResultSet.sub <- subsetByGroup(rqcResultSet, group)
  print(rqcReadQualityPlot(rqcResultSet.sub) +
          ggtitle(paste("Group", group)))
}

Cycle-specific Average Quality

This plot describes the average quality scores for each cycle of sequencing.

for (group in groups) {
  rqcResultSet.sub <- subsetByGroup(rqcResultSet, group)
  print(rqcCycleAverageQualityPlot(rqcResultSet.sub) +
          ggtitle(paste("Group", group)))
}

PCA Biplot (cycle-specific read average quality)

Biplot from Principal Component Analysis (PCA) of cycle-specific read average quality.

for (group in groups) {
  rqcResultSet.sub <- subsetByGroup(rqcResultSet, group)
  print(rqcCycleAverageQualityPcaPlot(rqcResultSet.sub) +
          ggtitle(paste("Group", group)))
}

Read Frequency

This plot shows the proportion of reads that appeared many times.

for (group in groups) {
  rqcResultSet.sub <- subsetByGroup(rqcResultSet, group)
  print(rqcReadFrequencyPlot(rqcResultSet.sub) +
          ggtitle(paste("Group", group)))
}

Read Length Distribution

Barplot that presents the distribuition of the lengths of the reads available in the FASTQ file.

pairs <- unique(perFileInformation(rqcResultSet)$pair)
for(pair in pairs) {
  rqcResultSet.sub <- subsetByPair(rqcResultSet, pair)
  print(rqcReadWidthPlot(rqcResultSet.sub))
}

Cycle-specific GC Content

Line plot showing the average GC content for every cycle of sequencing.

for (group in groups) {
  rqcResultSet.sub <- subsetByGroup(rqcResultSet, group)
  print(rqcCycleGCPlot(rqcResultSet.sub) +
          ggtitle(paste("Group", group)))
}

Cycle-specific Quality Distribution

Bar plot showing the proportion of quality calls per cycle. Colors are presented in a gradient Red-Blue, where red identifies calls of lower quality. This visualization is preferred as it is cleaner than the boxplots described below.

for(pair in pairs) {
  rqcResultSet.sub <- subsetByPair(rqcResultSet, pair)
  print(rqcCycleQualityPlot(rqcResultSet.sub))
}

Cycle-specific Quality Distribution - Boxplot

Boxplots describing empirical patterns of quality distribution on each cycle of sequencing.

for(pair in pairs) {
  rqcResultSet.sub <- subsetByPair(rqcResultSet, pair)
  print(rqcCycleQualityBoxPlot(rqcResultSet.sub))
}

Cycle-specific Base Call Proportion

This stacked bar plot describes the proportion of each nucleotide called for every cycle of sequencing.

for(pair in pairs) {
  rqcResultSet.sub <- subsetByPair(rqcResultSet, pair)
  print(rqcCycleBaseCallsPlot(rqcResultSet.sub))
}

The line plot shown below contains the same information as the plot above. However, some may find this easier to read when comparing the calling rates for each of the nucleotides.

for(pair in pairs) {
  rqcResultSet.sub <- subsetByPair(rqcResultSet, pair)
  print(rqcCycleBaseCallsLinePlot(rqcResultSet.sub))
}


labbcb/Rqc documentation built on Aug. 2, 2021, 8:43 a.m.