filterVcfBasic | R Documentation |
Function to remove artifacts and low confidence/quality variant calls.
filterVcfBasic(
vcf,
tumor.id.in.vcf = NULL,
use.somatic.status = TRUE,
snp.blacklist = NULL,
af.range = c(0.03, 0.97),
contamination.range = c(0.01, 0.075),
min.coverage = 15,
min.base.quality = 25,
max.base.quality = 50,
base.quality.offset = 1,
min.supporting.reads = NULL,
error = 0.001,
target.granges = NULL,
remove.off.target.snvs = TRUE,
model.homozygous = FALSE,
interval.padding = 50,
DB.info.flag = "DB"
)
vcf |
|
tumor.id.in.vcf |
The tumor id in the |
use.somatic.status |
If somatic status and germline data is available, then use this information to remove non-heterozygous germline SNPs or germline SNPS with biased allelic fractions. |
snp.blacklist |
A file with blacklisted genomic regions. Must
be parsable by |
af.range |
Exclude variants with allelic fraction smaller or greater than the two values, respectively. The higher value removes homozygous SNPs, which potentially have allelic fractions smaller than 1 due to artifacts or contamination. If a matched normal is available, this value is ignored, because homozygosity can be confirmed in the normal. |
contamination.range |
Count variants in germline databases with allelic fraction in the specified range. If the number of these putative contamination variants exceeds an expected value and if they are found on almost all chromosomes, the sample is flagged as potentially contaminated and extra contamination estimation steps will be performed later on. |
min.coverage |
Minimum coverage in tumor. Variants with lower coverage are ignored. |
min.base.quality |
Minimium base quality in tumor. Requires a |
max.base.quality |
Maximum base quality in tumor. Requires a |
base.quality.offset |
Subtracts the specified value from the base quality score.
Useful to add some cushion for too optimistically calibrated scores.
Requires a |
min.supporting.reads |
Minimum number of reads supporting the alt
allele. If |
error |
Estimated sequencing error rate. Used to calculate minimum
number of supporting reads using |
target.granges |
|
remove.off.target.snvs |
If set to a true value, will remove all SNVs outside the covered regions. |
model.homozygous |
If set to |
interval.padding |
Include variants in the interval flanking regions of
the specified size in bp. Requires |
DB.info.flag |
Flag in INFO of VCF that marks presence in common
germline databases. Defaults to |
A list with elements
vcf |
The filtered |
flag |
A flag ( |
flag_comment |
A comment describing the flagging. |
Markus Riester
calculatePowerDetectSomatic
# This function is typically only called by runAbsolute via
# fun.filterVcf and args.filterVcf.
vcf.file <- system.file("extdata", "example.vcf.gz", package="PureCN")
vcf <- readVcf(vcf.file, "hg19")
vcf.filtered <- filterVcfBasic(vcf)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.