getDomain | R Documentation |
Extracting taxonomic information from FASTA headers.
getDomain(header)
getPhylum(header)
getClass(header)
getOrder(header)
getFamily(header)
getGenus(header)
getTag(header)
getTaxonomy(header)
header |
A vector of texts, the |
The ConTax data sets (see package microcontx
) are FASTA files
where the Header
contains texts according to a strict format inherited from QIIME:
It always starts with a short text, a Tag, which is a unique identifier for every sequence.
The function getTag
will extract this from the header
.
After the Tag follows one or more tokens. One of these tokens is a string with the following format, inherited from QIIME:
"k__<...>;p__<...>;c__<...>;o__<...>;f__<...>;g__<...>;"
where <...> is some proper text. Here is an example of a proper string:
"k__Bacteria;p__Firmicutes;c__Bacilli;o__Bacillales;f__Staphylococcaceae;g__Staphylococcus;"
The functions getDomain
, ..., getGenus
extracts the
corresponding information from the header
. getTaxonomy
combines all taxonomy extractors, combines these in a table
and imputes missing taxa with parent taxa.
A vector containing the sub-texts extracted from each header
text, but
getTaxonomy
returns a table with the full taxonomy, one row for each input header
Lars Snipen.
contax.trim
, medoids
.
data(contax.trim)
getTag(contax.trim$Header)
getGenus(contax.trim$Header)
getPhylum(contax.trim$Header)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.