README.md

geneSA v0.1.8

I. Introduction

The package geneSA is built to serve as a support tool for the paper "Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data". A log-rank test in univariate Cox regression analysis with a proportional hazards model is performed to examine an association between each gene and the survival rates of patients separately, and then adjust identified log-rank P-values following Benjamini-Hochberg FDR. Genes with adjusted log-rank P-values (also known as Q-values) <= 0.05 are preserved.

II. Understanding the tool

The following are parameters provided by geneSA:

Please see & download data data_n_code as examples to well grasp the GeneSA's requirement on data structure and its usage.

III. Pipeline

Figure Figure: Pipeline of the package geneSA.

IV. Implementation

Use the following command to install directly from GitHub;

devtools::install_github("huynguyen250896/geneSA")

Call the library;

library(geneSA)

running example:

# exp is a matrix whose rows are samples and columns are genomic features
#>median is up-regulated genes and <median is down regulated genes
exp1 <- apply(exp,2, function(x) ifelse(x > median(x),"up","down")) %>% as.data.frame()

#Make sure samples that in rows of exp1 are also included in rows of clinical_exp and in exactly the same order
all(rownames(exp1) == rownames(clinical_exp))
#[1] FALSE
exp1 = exp1[rownames(clinical_exp),]

#RUN!!!
geneSA(data = exp1, time = clinical_exp$OS_MONTHS, status = clinical_exp$status, Pcut = 0.05, Qcut= 0.05, univariate = T) #univariate survival
geneSA(data = exp1, time = clinical_exp$OS_MONTHS, status = clinical_exp$status, Pcut = 0.05, Qcut= 0.05, univariate = F) #multivariate survival

V. What's new

VI. Citation

Please kindly cite the following paper (and Star this Github repository if you find this tool of interest) if you use the tool in this repo:

Reference Type: Journal Article
Author: Nguyen, Quang-Huy
Le, Duc-Hau
Year: 2020
Title: Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
Journal: Scientific Reports
Volume: 10
Issue: 1
Pages: 20521
Date: 2020/11/25
ISSN: 2045-2322
DOI: 10.1038/s41598-020-77318-1

Feel free to contact Quang-Huy Nguyen for any questions about the code and results.



huynguyen250896/geneSA documentation built on Aug. 3, 2022, 6:57 p.m.