tsea.expression.normalization: RNA-Seq expression profiles normalization

Description Usage Arguments Details Value Note Author(s) References See Also Examples

Description

To avoid the data bias and adapt better data heterogeneity, before tsea.expression.decode() analysis, the raw discrete RPKM value have to normalized to continuous variable meet the normal distribution before t-test.

Usage

1
2
tsea.expression.normalization(query_mat, 
correction_factor, normalization = "abundance")

Arguments

query_mat

a RNA-seq RPKM object, row name should be gene symbol, and column name should be sample name.

correction_factor

correction_factor, a gene table object contain genes average expression level and standard variance in GTEx database, can be loaded by data(correction_factor).

normalization

normalization methods, c("z-score", "abundance")

Details

As RNA-Seq samples are often heterogeneous, before in-depth analysis, it is necessary to decode tissue heterogeneity to avoid samples with confounding effects. However, the raw discrete RPKM value have to normalized to continuous variable meet the normal distribution before t-test.

Value

A data frame with normalized RNA-Seq expression profiles.

Rows stand for tissue names and columns stand for sample names.

Note

nothing

Author(s)

Guangsheng Pei

References

Pei G., Dai Y., Zhao Z., Jia P. (2019) deTS: Tissue-Specific Enrichment Analysis to decode tissue specificity. Bioinformatics, In submission.

See Also

https://github.com/bsml320/deTS

Examples

1
2
3
4
5
data(query_GTEx)
query_matrix = query_GTEx[,1:2]
data(correction_factor)
query_mat_zscore_nor = tsea.expression.normalization(query_matrix, 
	correction_factor, normalization = "z-score")

deTS documentation built on May 2, 2019, 4:51 a.m.