example: Example data of HER2-positive breast cancer

Description Usage Format References Examples

Description

We downloaded a set of RNA-seq gene expression data from The Cancer Genome Atlas (TCGA). The data are sequenced on the Illumina GA platform with tissues collected from breast cancer subjects. In particular, we selected 28 samples from the tissue source site “BH”, which are controlled for white female subjects with the HER2-positive biomarkers. After data preprocessing based on nonspecific filtering, a total number of 11,453 genes are kept for subsequent analyses. Among these data are 12 pairs of matched tumor and normal samples, and 4 unmatched tumor samples. The sequencing depths of the selected samples range from 23.80 million reads to 76.08 million reads. In the proposed analysis, we inversely weigh the sequencing samples by their sequencing depths (per million reads).

Usage

1
data("HER2")

Format

The object HER2 is a list with the following elements:

dat

Data matrix with samples in rows and genes in columns. Column names are gene symbols.

Grp

Vector of group labels.

seq.depth

Vector of sequencing depth, corresponding to the rows of dat.

Subj

Vector of subject IDs, corresponding to the rows of dat.

References

The Cancer Genome Atlas (TCGA) https://cancergenome.nih.gov/.

Zhang et al. (2019) Highly efficient hypothesis testing methods for regression-type tests with correlated observations and heterogeneous variance structure. BMC Bioinformatics, 20:185.

Examples

1
2
3
4
5
6
7
8
library(PBtest)
data(HER2)

## attach list elements to the global environment
list2env(HER2,globalenv())

## the PB-transformed t-test, with Sigma estimation
pvec.PB <- PBtest(YY=dat, xx=Grp, test="t", id=Subj, weights=seq.depth)$p.value

yunzhang813/PBtest-R-Package documentation built on March 18, 2020, 5:29 p.m.