taxonomy-methods: Functions for accessing taxonomic data stored in 'rowData'.
In microbiome/mia: Microbiome analysis

taxonomyRanks

R Documentation

Functions for accessing taxonomic data stored in `rowData`.

Description

These function work on data present in rowData and define a way to represent taxonomic data alongside the features of a SummarizedExperiment.

Usage

taxonomyRanks(x)

taxonomyRankEmpty(
  x,
  rank = taxonomyRanks(x)[1L],
  empty.fields = c(NA, "", " ", "\t", "-", "_")
)

checkTaxonomy(x, ...)

getTaxonomyLabels(x, ...)

mapTaxonomy(x, ...)

## S4 method for signature 'SummarizedExperiment'
taxonomyRanks(x)

## S4 method for signature 'SummarizedExperiment'
taxonomyRankEmpty(
  x,
  rank = taxonomyRanks(x)[1],
  empty.fields = c(NA, "", " ", "\t", "-", "_")
)

## S4 method for signature 'SummarizedExperiment'
checkTaxonomy(x)

setTaxonomyRanks(ranks)

getTaxonomyRanks()

## S4 method for signature 'SummarizedExperiment'
getTaxonomyLabels(
  x,
  empty.fields = c(NA, "", " ", "\t", "-", "_"),
  with.rank = with_rank,
  with_rank = FALSE,
  make.unique = make_unique,
  make_unique = TRUE,
  resolve.loops = resolve_loops,
  resolve_loops = FALSE,
  ...
)

## S4 method for signature 'SummarizedExperiment'
mapTaxonomy(
  x,
  taxa = NULL,
  from = NULL,
  to = NULL,
  use.grepl = use_grepl,
  use_grepl = FALSE
)

IdTaxaToDataFrame(from)

Arguments

`x`	`TreeSummarizedExperiment`.
`rank`	`Character scalar`. Defines a taxonomic rank. Must be a value of `taxonomyRanks()` function.
`empty.fields`	`Character vector`. Defines which values should be regarded as empty. (Default: `c(NA, "", " ", "\t")`). They will be removed if `na.rm = TRUE` before agglomeration.
`...`	additional arguments `lowest.rank`: A lowest taxonomy level to be considered in `getTaxonomyLabels`. Ranks lower than this will be collapsed into rank specified by `lowest.rank`. For example, if genus level is specified, species will be collapsed into genus. If `NULL`, the data is not collapsed. (Default: `NULL`)
`ranks`	`Character vector`. A vector of ranks to be set.
`with.rank`	`Logical scalar`. Should the level be add as a suffix? For example: "Phylum:Crenarchaeota". (Default: `FALSE`)
`with_rank`	Deprecated. Use `with.rank` instead.
`make.unique`	`Logical scalar`. Should the labels be made unique, if there are any duplicates? (Default: `TRUE`)
`make_unique`	Deprecated. Use `make.unique` instead.
`resolve.loops`	`Logical scalar`. Should `resolveLoops` be applied to the taxonomic data? Please note that has only an effect, if the data is unique. (Default: `TRUE`)
`resolve_loops`	Deprecated. Use `resolve.loops` instead.
`taxa`	`Character vector`. Used for subsetting the taxonomic information. If no information is found,`NULL` is returned for the individual element. (Default: `NULL`)
`from`	For `mapTaxonomy`: `character scalar`. A value which must be a valid taxonomic rank. (Default: `NULL`) otherwise a `Taxa` object as returned by `IdTaxa`
`to`	`Character Scalar`. Must be a valid taxonomic rank. (Default: `NULL`)
`use.grepl`	`Logical`. Should pattern matching via `grepl` be used? Otherwise literal matching is used. (Default: `FALSE`)
`use_grepl`	Deprecated. Use `use.grepl` instead.

Details

taxonomyRanks returns, which columns of rowData(x) are regarded as columns containing taxonomic information.

taxonomyRankEmpty checks, if a selected rank is empty of information.

checkTaxonomy checks, if taxonomy information is valid and whether it contains any problems. This is a soft test, which reports some diagnostic and might mature into a data validator used upon object creation.

getTaxonomyLabels generates a character vector per row consisting of the lowest taxonomic information possible. If data from different levels, is to be mixed, the taxonomic level is prepended by default.

IdTaxaToDataFrame extracts taxonomic results from results of IdTaxa.

mapTaxonomy maps the given features (taxonomic groups; taxa) to the specified taxonomic level (to argument) in rowData of the SummarizedExperiment data object (i.e. rowData(x)[,taxonomyRanks(x)]). If the argument to is not provided, then all matching taxonomy rows in rowData will be returned. This function allows handy conversions between different

Taxonomic information from the IdTaxa function of DECIPHER package are returned as a special class. With as(taxa,"DataFrame") the information can be easily converted to a DataFrame compatible with storing the taxonomic information a rowData. Please note that the assigned confidence information are returned as metatdata and can be accessed using metadata(df)$confidence.

Value

taxonomyRanks: a character vector with all the taxonomic ranks found in colnames(rowData(x))
taxonomyRankEmpty: a logical value
mapTaxonomy: a list per element of taxa. Each element is either a DataFrame, a character or NULL. If all character results have the length of one, a single character vector is returned.

Examples

data(GlobalPatterns)
GlobalPatterns
taxonomyRanks(GlobalPatterns)

checkTaxonomy(GlobalPatterns)

table(taxonomyRankEmpty(GlobalPatterns,"Kingdom"))
table(taxonomyRankEmpty(GlobalPatterns,"Species"))

getTaxonomyLabels(GlobalPatterns[1:20,])
# Taxonomy labels represent the lowest taxonomy name that identifies each
# taxa. For instance, they can represent OTUs which does no necessarily
# tell much. In this case, you might want to get the labels with higher
# taxonomy rank
getTaxonomyLabels(GlobalPatterns[1:20,], lowest.rank = "Class")

# mapTaxonomy
## returns the unique taxonomic information
mapTaxonomy(GlobalPatterns)
# returns specific unique taxonomic information
mapTaxonomy(GlobalPatterns, taxa = "Escherichia")
# returns information on a single output
mapTaxonomy(GlobalPatterns, taxa = "Escherichia",to="Family")

# setTaxonomyRanks
tse <- GlobalPatterns
colnames(rowData(tse))[1] <- "TAXA1"

setTaxonomyRanks(colnames(rowData(tse)))
# Taxonomy ranks set to: taxa1 phylum class order family genus species

# getTaxonomyRanks is to get/check if the taxonomic ranks is set to "TAXA1"
getTaxonomyRanks()

microbiome/mia documentation built on April 17, 2025, 7:33 p.m.