In TeamMacLean/atacr: Analysing Capture Seq Count Data

knitr::opts_chunk$set(
  message = FALSE,
  warning = FALSE
)

Finding windows that correspond to differentially expressed or accessible windows is possible with two related functions in atacr - estimate_fdr() which implements bootstrap t-tests, via the boot package and estimate_bayes_factor() which implements a Bayes factor ANOVA using the BayesFactor package. A tidy dataframe of results is returned in each case.

library(atacr)
normalized_counts <- simulate_counts()
result <- estimate_fdr(normalized_counts,
             treatment_a =  "treatment",
              treatment_b = "control")

Bootstrap t-tests

For simple comparison of two treatments with bootstrap t tests, provide treatment 'a' and 'b' names and the number of bootstrap iterations (default is 10, which is fast for testing code, but useless analytically). You can set the threshold for marking as significant with fdr_level.

 result <- estimate_fdr(normalized_counts,
              treatment_a =  "treatment",
              treatment_b = "control",
              iterations = 100000,
              fdr_level = 0.01)

head(result)

The output has columns as follows:

window - the name of the window with data on this row
t - the value of the t statistic for the first (non-bootstrap) iteration
p_value - the computed p value for the window
fdr - the false detection rate at this window
mean_count_a - the mean count for treatment 'a'
mean_count_b - the mean count for treatment 'b'
sd_a - standard deviation for treatment 'a'
sd_b - standard deviation for treatment 'b'
log2_fc - log 2 of the ratio of the mean counts
is_sig - flag showing whether window was significant according to the level set in the function with parameter fdr_level

To analyse all treatments against a common comparison at once you can use the wrapper function estimate_fdr_multiclass() which requires the name of the common comparison treatment

multi_result <-  estimate_fdr_multiclass(normalized_counts,
              common_control = "control",
              iterations = 100000,
              fdr_level = 0.01)

head(multi_result)

multi_result <-  estimate_fdr_multiclass(normalized_counts,
              common_control = "control")
head(multi_result)

The results here has two extra columns:

a - the name of the treatment
b - the name of the common control

Bayes Factor Analysis

A similar pair of functions is available for Bayes factor analysis. estimate_bayes_factor() for the two-way comparison. The factor argument sets the Bayes factor at which to mark the window as having different counts.

result_bf <-  estimate_bayes_factor(normalized_counts,
                           treatment_a =  "treatment",
                           treatment_b = "control",
                                factor = 2.0)

head(result_bf)

Again, a estimate_bayes_factor_multiclass() function works for all comparisons to a common control.

The results data frame is similar to that from the Bootstrap t methods, with a factor column in place of the t and fdr columns.

EdgeR analysis

The single comparison edgeR analysis returns a dataframe similar to the above methods.

In all the runs of edgeR the estimateDisp() function is used. This means that the edgeR_exact() methods will be increasingly less useful as a greater proportion of windows show differential counts. edgeR is the most powerful method when only a few genes are showing differential counts, use the other methods in other cases.

You can tell edgeR to ignore data with zero counts in all samples using remove_zeros

result_edger <-  edgeR_exact(normalized_counts,
                           treatment_a =  "treatment",
                           treatment_b = "control",
                           remove_zeros = TRUE)

head(result_edger)

The edgeR multiclass variant, edgeR_multiclass() also uses the estimateDisp() function in all cases. The edgeR_multiclass() function does not return a dataframe, instead it returns the native DGELRT objects (see the DGELRT manual for more information) from each comparison in a list() object with names as per the treatment used.

edgeR_multiclass(normalized_counts,"mock", 
  remove_zeros = TRUE)

TeamMacLean/atacr documentation built on May 9, 2019, 4:24 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

TeamMacLean/atacr
Analysing Capture Seq Count Data

In TeamMacLean/atacr: Analysing Capture Seq Count Data

Bootstrap t-tests

Bayes Factor Analysis

EdgeR analysis

R Package Documentation

Browse R Packages

We want your feedback!

TeamMacLean/atacr Analysing Capture Seq Count Data

In TeamMacLean/atacr: Analysing Capture Seq Count Data

Bootstrap t-tests

Bayes Factor Analysis

EdgeR analysis

R Package Documentation

Browse R Packages

We want your feedback!

TeamMacLean/atacr
Analysing Capture Seq Count Data