getEISAcompR-getEISAcompR: getEISAcompR

getEISAcompRR Documentation

getEISAcompR

Description

This is the main function used to calculate exon/intron split estimates and compute transcriptional and post-transcriptional components of each gene, as well as to infer the significance of each regulatory component independently. The pipeline is implemented for two-group contrast (tipically control vs treated) and includes the possibility of user-defined batch correction. Users must prepare their exon/intron counts, design and batch (optional) matrices including data for same number of selected samples. In the event that design matrix includes an additional batch effect apart from sample name (1st) and group assignment (2nd), it will be considered as independent batch effect. Further details: If no batch correction needs to be implemented, please only use a two-column design matrix. For transcriptional (Tc) and post-transcriptional (PTc) components, generally, the higher their absolute values (either showing negative or positive regulatory influence), the more relevant regulatory effects could be inferred. Please be aware that the abscence of significance in PTc component might indicate the presence of mixed transcriptional and post-transcriptional componentes affecting the same gene. Any PTc component showing significant interaction with any other transcriptional (Tc) influence detected at intronic levels will be shown as not significant. Users should compare canonical differential expression (DE) results and significance for their genes of interest, as well as their Tc and PTc components, in order to extract meaningfull information about the putative regulatory influence affecting their genes of interest.

Output interpretation: Significant PTc + Significant DE = Gene showing Post-transcriptional regulatory signal Significant Tc + Significant DE = Gene showing Transcriptional regulatory signal Non-significant PTc + Significant Tc + Significant DE = Gene showing mixed Transcriptional and Post-transcriptional regulatory signal

Usage

getEISAcompR(
  Exons,
  Introns,
  design,
  model = "QLF",
  filterExpr = TRUE,
  percent = 0.5,
  cpm = 1
)

Arguments

Exons

Exonic raw counts (genes in rows and samples in columns).

Introns

Intronic raw counts (genes in rows and samples in columns).

design

Design matrix (1st = sample names; 2nd = group assignment + optionally one additional column with batch effect).

filterExpr

Boolean to perform filtering based on expression criteria to remove lowly expressed genes (TRUE/FALSE).

percent

Percentage of samples showing minimum expression threshold for filtering (50% by default).

cpm

counts-per-million (CPM) expression threshold for filtering lowly expressed genes (1 CPM by default).

Type

of test to perform for inferring significance in EISA and DE results. Quasi-likelihood F-test ("QLF") or likelihood ratio test ("LRT"). Default is QLF.

Value

object with four tables. resDE = Differential Expression analysis for exonic counts using glmQLFTest from edgeR package. resTc = EISA for the transcriptional component effect on each analyzed gene. resPTc = EISA for the post-transcriptional component effect on each analyzed gene. Expr_Int = Normalized log2 expression matrix for Intronic counts. Expr_Ex = Normalized log2 expression matrix for Exonic counts

Author(s)

Emilio Mármol Sánchez

References

  1. Gaidatzis D et al. (2015) Analysis of intronic and exonic reads in RNA-seq data characterizes transcriptional and post-transcriptional regulation. Nature Biotechnology, 33, 722–729.

  2. Lawrence M et al. (2013) Software for computing and annotating genomic ranges. PLoS Computational Biology, 9, e1003118.

  3. Liao Y et al. (2019) The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleid Acids Research, 47, e47.

  4. Robinson MD et al. (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics, 26, 139–140.

Examples

{
## Not run: 
eisa <- getEISAcomp(Exons=exon_counts, Introns=intron_counts, design=design_matrix,
filterExpr=TRUE, percent=0.5, cpm=1)

## End(Not run)

}

emarmolsanchez/EISAcompR documentation built on May 22, 2024, 2:07 a.m.