gatkDNA: Running realignment and recalibration, GATK

View source: R/gatkDna.R

gatkDNAR Documentation

Running realignment and recalibration, GATK

Description

This function executes the docker container snv.1 where GATK software is used to do INDEL realignment and quality recalibration. This analysis is required only to run mutect1. The bwa index has to be prepared with bwaIndex

Usage

gatkDNA(
  group = c("sudo", "docker"),
  bam.folder = getwd(),
  scratch.folder = "/data/scratch",
  gatk.filename,
  genome.folder,
  threads = 1
)

Arguments

group

a character string. Two options: "sudo" or "docker", depending to which group the user belongs

bam.folder

a character string indicating where bam files generated with bwa.R are located. In this folder should be loacted also the GATK file GenomeAnalysisTK-X.X-0.tar.bz2.

scratch.folder

a character string indicating the scratch folder where docker container will be mounted

gatk.filename

a character string for GenomeAnalysisTK-X.X-0.tar.bz2.

genome.folder

a character string indicating the folder where the indexed reference genome for bwa is located

threads

a number indicating the number of cores to be used from the application

Value

three files: dedup_reads.bam, which is sorted and duplicates marked bam file, dedup_reads.bai, which is the index of the dedup_reads.bam, and dedup_reads.stats, which provides mapping statistics

Examples

## Not run: 
    #downloading fastq files
    system("wget http://130.192.119.59/public/test_R1.fastq.gz")
    system("wget http://130.192.119.59/public/test_R2.fastq.gz")
    #running bwa
    gatkDNA(group="sudo",bam.folder=getwd(), scratch.folder="/data/scratch",
    gatk.filename="GenomeAnalysisTK-3.7.tar.bz2"
    genome.folder="/data/scratch/hg19_bwa", threads=24)

## End(Not run)

kendomaniac/docker4seq documentation built on April 8, 2024, 5:39 p.m.