califano-lab/NaRnEA: Nonparametric analytical Rank-based Enrichment Analysis - An Information Theoretic Framework for Protein Activity Measurement

Nonparametric analytical Rank-based Enrichment Analysis (NaRnEA) is a newly developed gene set analysis method which leverages an analytical null model derived under the Principle of Maximum Entropy. NaRnEA critically improves over two widely used methods – Gene Set Enrichment Analysis (GSEA) and analytical Rank-based Enrichment Analysis (aREA) – as shown by differential activity measurement of ~2,500 transcriptional regulatory proteins across three cohorts in The Cancer Genome Atlas (TCGA) based on the enrichment of their transcriptional targets in differentially expressed genes. Independent phenotype-matched proteomic data from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) was used to compare differential protein abundance and differential protein activity, confirming the accuracy of measurements made by NaRnEA. Our analysis crucially demonstrates that the sample-shuffling empirical null models leveraged by GSEA and aREA for evaluating gene set enrichment are overly conservative, a shortcoming that is overcome by NaRnEA’s optimal analytical null model.

Getting started

Package details

Maintainer
License`use_mit_license()`, `use_gpl3_license()` or friends to pick a license
Version0.2.0
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("califano-lab/NaRnEA")
califano-lab/NaRnEA documentation built on March 10, 2023, 7:18 p.m.