knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
Variant calling format (VCF) is the final file format that results from various steps of transforming sequencing data into high quality annotated genetic variants. Variants in a VCF file are prone to errors, so, different quality measurements are taken to validate the information in the file. If these measurements are not met, a further step of hard-filtering can be done to remove low quality calls. Genome Analysis Toolkit (GTAK) is an established, well-documented variant calling pipeline published by the broad institute. It provides various quality annotations to detect errors in the different steps of variant calling. This package visualizes these annotations and prints out GATK guidelines on hard-filtering based on the chosen parameter.
This tool aims to guide the user on filtering GATK VCF files by visualizing the quality distribution of variants and providing GTAK filtering threshold values and tips.
The package functions on GATK annotated VCF files. A GATK user needs to convert the VCF into table format using the following bash command:
```{bash, eval=FALSE, tidy = TRUE, size="tiny"} gatk VariantsToTable -V $file\ -F CHROM -F POS -F REF -F ALT \ -F FILTER -F QUAL -F QD -F FS \ -F SOR -F MQ -F MQRankSum -F ReadPosRankSum\ -O /home/alihh/freq/stat/ksu/$file.table
# How to use Install the latest version of vcfqc package by copying the following commands into R console: ```r library(devtools) devtools::install_github("Hatoonli/vcfqc", build_vignettes = TRUE)
Load the package
library(vcfqc)
Then choose a quality parameter from the list below and run it on your VCF file. QD FS SOR MQ MQRankSum ReadPosRankSum
You can read more about each parameter by inquiring about it with a question mark, for example:
?QC
You can preview the performance of the package on the example file provided in /extdata
, to get the file pathway run the following command:
system.file("extdata", "HG001_GRCh38.table", package = "vcfqc")
and to learn more about the file, read the accompanying documentation by running the following command:
?extdata/HG001_GRCh38.table
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.