| pre_gtdb_tk | R Documentation |
This function reads and processes the output files from a GTDB-Tk classify workflow.
It combines bacterial (bac120) and archaeal (ar53) classification summaries and phylogenetic trees (if available) into a unified format.
pre_gtdb_tk(classify_dir)
classify_dir |
A character string specifying the path to the GTDB-Tk |
The function performs the following steps:
Checks if the provided directory exists and contains the necessary *.summary.tsv files.
Reads the bacterial backbone tree.
If an archaeal tree file exists, it binds it to the bacterial tree.
Reads and combines all *.summary.tsv files in the directory.
Parses the semicolon-separated classification string into separate columns for each taxonomic rank.
Ensures the resulting taxonomy table has standard ranks (Domain, Phylum, Class, Order, Family, Genus, Species).
A list with two components:
A data frame containing the combined taxonomic classification for all genomes.
The classification column is parsed into standard taxonomic ranks (Domain to Species).
A phylogenetic tree (phylo object) combining the bacterial and (if present) archaeal trees.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.