Description Usage Arguments Value Examples
View source: R/computeReport.R
The function take a path to a genome fasta file and its FAS annotation file (if the annotation file is not provided, the funtion will compute it) and compute the detailed report of the completeness of the genome
1 2 3 4 5 6 |
genome |
The path to the genome fasta file |
fasAnno |
The path to the fas annotation file. It can equal NULL |
root |
The path to the core directory, where the core set is stored within weight_dir, blast_dir, etc. |
coreSet |
The name of the interested core set. The core directory can contains more than one core set and the user must specify the interested core set. The core set will be stored in the folder core_orthologs in subfolder, specify them by the name of the subfolder |
extend |
The output of the function is a phylogenetic profile of the interested genome. It contains 4 files, .phyloprofile, .extended.fa, _reverse.domains and _forward.domains. If extend = TRUE, the files will be appended into the old files in the folder output of the core directory or in the inputed folder by the user with the argument ppDir. If there is no old files in the folder, the output files of the function will be writen in the new files. |
scoreMode |
the mode determines the method to scoring the founded ortholog and how to classify them. Choices: 1, 2, 3, "busco" |
priorityList |
A list contains one or many genome ID of the genomes, which were used to build the core set. The genome ID of this list will be stored with an priority order, the tool look at into the fasta file of each core group and determine with the priority order to determine the references species for each core group. |
cpu |
determines the cores that fDOG and fdogFAS will uses to be run parallel |
blastDir |
The user can replace the blast_dir folder in the core directory by specifying it in this argument. By default is NULL |
weightDir |
The user can replace the weight_dir folder in the core directory by specifying it in this argument. By default is NULL |
output |
The directory which contains the output directory |
outDir |
The user can specify the directory to save the output report file of the completeness of the interested genome by specifying the path to the folder in this argument. By default is NULL |
cleanup |
The fDOG's output is a set of phylogenetic profile of each core group to the interested genome. The phylogenetic profile will be stored into a folder in the core set. The function will merge all the small phylogenetic profile, calculate the FAS score or length to have the whole phylogenetic profile of the interested genome to the core set. This fDOG's output can be reused for all score modes. When cleanup is set to TRUE, the fDOG's output will not be stored to be reused but to be removed |
a report of the completeness of the interested genome with detailed information of every core genes in the core set. Which core gene is "similar" , which is "dissimilar", "duplicated" and "missing"
1 2 3 4 5 6 7 8 | coreFolder <- system.file("extdata", "sample", package = "fCAT")
genome <- system.file("extdata", "HUMAN@9606@3.fa", package = "fCAT")
fasAnno <- system.file("extdata", "HUMAN@9606@3.json", package = "fCAT")
report <- computeReport(genome, fasAnno, coreFolder, "test",
scoreMode = 2, priorityList = c("HUMAN@9606@1"), cpu = 4,
output = getwd()
)
print.data.frame(report)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.