RunFIREVAT | R Documentation |
Runs FIREVAT using configuration data. Filters point mutations in the user-specified vcf file based on mutational signature identification and outputs the refined and artifact vcf files as well as metadata related to the refinement process.
RunFIREVAT( vcf.file, vcf.file.genome = "hg19", check.chromosome.name = TRUE, config.file, df.ref.mut.sigs = GetPCAWGMutSigs(), target.mut.sigs = GetPCAWGMutSigsNames(), df.ref.mut.sigs.groups.colors = GetPCAWGMutSigsEtiologiesColors(), sequencing.artifact.mut.sigs = PCAWG.All.Sequencing.Artifact.Signatures, num.cores = 2, output.dir, mode = "ga", init.artifact.stop = 0.05, objective.fn = Default.Obj.Fn, use.suggested.soln = TRUE, ga.type = "real-valued", ga.pop.size = 100, ga.max.iter = 100, ga.run = 50, ga.pmutation = 0.1, ga.preemptive.killing = FALSE, ga.seed = NULL, mutalisk = TRUE, mutalisk.method = "all", mutalisk.must.include.sigs = NULL, mutalisk.random.sampling.count = 20, mutalisk.random.sampling.max.iter = 10, perform.strand.bias.analysis = FALSE, filter.by.strand.bias.analysis = TRUE, filter.by.strand.bias.analysis.cutoff = 0.25, strand.bias.perform.fdr.correction = TRUE, strand.bias.fdr.correction.method = "BH", ref.forward.strand.var = NULL, ref.reverse.strand.var = NULL, alt.forward.strand.var = NULL, alt.reverse.strand.var = NULL, annotate = FALSE, df.annotation.db = NULL, annotated.columns.to.display = NULL, annotation.filter.key.value.pairs = NULL, annotation.filter.condition = "AND", write.vcf = TRUE, report = TRUE, save.rdata = TRUE, save.tsv = TRUE, report.format = "html", verbose = TRUE )
vcf.file |
String value corresponding to input .vcf file. Please provide the full path. |
vcf.file.genome |
Genome assembly of the input .vcf file. The genome should be supported by BSgenome. |
check.chromosome.name |
Boolean value. If TRUE, FIREVAT checks chromosome names |
config.file |
String value corresponding to input configuration file. For more details please refer to ... |
df.ref.mut.sigs |
A data.frame of the reference mutational signatures |
target.mut.sigs |
A character vector of the target mutational signatures from reference mutational signatures. |
df.ref.mut.sigs.groups.colors |
A data.frame of the reference mutational signatures groups and colors |
sequencing.artifact.mut.sigs |
A character vector of the sequencing artifact mutational signatures from reference mutational signatures. |
num.cores |
Number of cores to allocate |
output.dir |
String value of the desired output directory |
mode |
String value. The value should be either 'ga' or 'manual'. |
init.artifact.stop |
Numeric value less than 1. If the sum of sequencing artifact weights in the user-specified original VCF file (i.e. vcf.file) is less than or equal to this value then FIREVAT does not perform variant refinement. Default value is 0.05. Note that this option does not apply if 'mode' is 'manual'. |
objective.fn |
Objective value derivation function. Default: Default.Obj.Fn. |
use.suggested.soln |
Boolean value. If TRUE, then FIREVAT passes the default values of filter variables declared as 'use_in_filter' in the config file to the 'suggestions' parameter of the Genetic Algorithm package. If FALSE, then FIREVAT supplies NULL to the GA package 'suggestions' parameter. FIREVAT also computes baseline performance of each filter variable and uses fittest population from each variable as a suggested solution. |
ga.type |
String value. The value should be either 'binray' or 'real-valued'. |
ga.pop.size |
Integer value of the Genetic Algorithm 'population size' parameter. Default: 100. This value should be set based on the number of filter parameters. Recommendation: 40 per filter parameter. |
ga.max.iter |
Integer value of the Genetic Algorithm 'maximum iterations' parameter. Default: 100. This value should be set based on the number of filter parameters. Recommendation: same as 'ga.pop.size'. |
ga.run |
Integer value of the Genetic Algorithm 'run' parameter. Default: 50. This value should be set based on the 'ga.max.iter' parameter. Recommendation: 25 percent of 'ga.max.iter'. |
ga.pmutation |
Float value of the Genetic Algorithm 'mutation probability' parameter. Default: 0.1. |
ga.preemptive.killing |
If TRUE, then preemptively kills populations that yield greater sequencing artifact weights sum compared to the original mutatational signatures analysis |
ga.seed |
Integer value of the Genetic Algorithm 'seed' parameter. Default: NULL. |
mutalisk |
If TRUE, confirm mutational signature analysis with Mutalisk. Default: TRUE. |
mutalisk.method |
Mutalisk signature identification method. Default: 'random.sampling'. The value can be either 'all' or 'random.sampling'. 'all' uses all target.mut.sigs to identify mutational signatures. 'random.sampling' randomly samples from target.mut.sigs to identify mutational signatures. |
mutalisk.must.include.sigs |
Signatures that must be included in the Mutalisk signature identification A character vector corresponding to the signature names. |
mutalisk.random.sampling.count |
Mutalisk random sampling count. Default: 20. The number of signatures to sample from target.mut.sigs |
mutalisk.random.sampling.max.iter |
Mutalisk random sampling maximum iteration. Default: 10. The number of times Mutalisk randomly samples from target.mut.sigs before determining the candidate signatures. |
perform.strand.bias.analysis |
If TRUE, then performs strand bias analysis. |
filter.by.strand.bias.analysis |
If TRUE, then filters out variants in refined vcf based on strand bias analysis results |
filter.by.strand.bias.analysis.cutoff |
The p.value or q value cutoff for filtering out variants. |
strand.bias.perform.fdr.correction |
If TRUE, then performs false discovery rate correction for strand bias analysis. |
strand.bias.fdr.correction.method |
A string value. Default value is 'BH'. Refer to 'p.adjust()' function method. |
ref.forward.strand.var |
A string value. |
ref.reverse.strand.var |
A string value, |
alt.forward.strand.var |
A string value, |
alt.reverse.strand.var |
A string value, |
annotate |
A boolean value. Default value is TRUE. |
df.annotation.db |
A data.frame. Please refer to |
annotated.columns.to.display |
A character vector. |
annotation.filter.key.value.pairs |
A list. |
annotation.filter.condition |
'AND' or 'OR'. |
write.vcf |
If TRUE, write original/refined/artifact vcfs. Default: TRUE. |
report |
If TRUE, generate report. Default: TRUE. |
save.rdata |
If TRUE, save rdata. Default: TRUE. |
save.tsv |
If TRUE, save tsv. Default: TRUE. |
report.format |
The format of FIREVAT report. We currently only support 'html'. |
verbose |
If TRUE, provides process detail. Default: TRUE. |
A list with the following elements
f = A ggarrange object
graphs = A list of length 3; each element is a ggplot histogram
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.