| unwanted_tax_patterns | R Documentation |
A named character vector of regular expressions used to identify common problematic values in taxonomy tables. Each element is a regex pattern; names provide human-readable descriptions.
Used as the default replace_to_NA argument in verify_tax_table() and
can be reused by other pqverse packages (e.g. dbpq::count_unwanted_tax()).
unwanted_tax_patterns
A named character vector with 17 elements:
"^[Nn][Aa][Nn]?$"
"^[Nn]/[Aa]$"
"^[Nn]one$"
"^$"
"^\\\\s+$"
"[Uu]nclassified"
"[Uu]nknown"
"[Uu]nidentified"
"[Uu]ncultured"
"[Ii]ncertae[_\\\\s]?[Ss]edis"
"^[Mm]etagenome$"
"^[Ee]nvironmental"
"^[kpcofgs]__$"
"^_sp"
"^_species"
"_uc$"
"__X+$"
verify_tax_table()
unwanted_tax_patterns
# Use with grepl to check a value
any(vapply(
unwanted_tax_patterns,
\(pat) grepl(pat, "unclassified"),
logical(1)
))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.