differential_abundance: Differential abundance analysis

Description Usage Arguments Value Author(s) References Examples

Description

This function finds the features that are significantly differentially abundant in the provided data, using DESeq implementation which models taxa abundance as a negative binomial distribution. See DESeq for more details. The significance of differentially abundant taxa is defined by log2 fold change and pvalue thresholds. These features are then assigned importance using random forest classifer. The measure of importance used in this implementation is mean decrease in accuracy.

Usage

1
2
differential_abundance(physeq, grouping_column, pvalue.threshold = 0.05,
  lfc.threshold = 0, filename = "NB_significant", output_norm = NULL)

Arguments

physeq

(Required). A link[phyloseq]phyloseq object containing merged information of abundance, taxonomic assignment, sample data including the measured variables and categorical information of the samples, and / or phylogenetic tree if available.

grouping_column

(Required). Character string specifying name of a categorical variable that is preffered for grouping the. information, this should be one of the components of grouping vector.

output_norm

(optional). A character string specifying method to be used for transforming abundance data to be used for plotting. note that, this normalisation occurs after DESeq analysis. Therefore, it is strictly for purposes of the output data and not for differential expression analysis.

pvalue.threshold.

Cut off p-value for significance of differentially abundance taxa, default is 0.05.

lfc.threshold.

Threshold for log2 fold change over which significance of differentially expressed taxa is considered.

Value

Returns a ggplot object. This can further be manipulated as preferred by user.

Returns a list of three items:

Author(s)

Alfred Ssekagiri assekagiri@gmail.com, Umer Zeeshan Ijaz Umer.Ijaz@glasgow.ac.uk

References

http://userweb.eng.gla.ac.uk/umer.ijaz/, Umer Ijaz, 2015

Examples

1
2
3
4
5
data(pitlatrine)
physeq <- taxa_level(pitlatrine,"Phylum")
deseq_sig  <- differential_abundance(physeq, grouping_column = "Country")
#plot the significant features
plot_signif(deseq_sig$plotdata) #see function \link[microbiomeSeq]{plot_signif}

umerijaz/microbiomeSeq documentation built on May 30, 2019, 3:13 p.m.