DCp: Identify DCGs (Differential Coexpressed Genes) based on the...

Description Usage Arguments Details Value Author(s) References Examples

Description

A method to pick out DCGs from microarray data based on a novel concept of 'Differential Coexpression Profiles' (DCp) (Yu et al. 2010).

Usage

1
2
3
4
5
6
7
DCp(exprs.1, exprs.2, 
	r.method = c("pearson", "spearman")[1], 
	link.method = c("qth", "rth", "percent")[1], 
	cutoff = 0.25, 
	N = 0, 
	N.type = c("pooled", "gene_by_gene")[1], 
	q.method = c("BH", "holm", "hochberg", "hommel", "bonferroni", "BY", "fdr")[1])

Arguments

exprs.1

a data frame or matrix for condition A, with rows as variables (genes) and columns as samples.

exprs.2

a data frame or matrix for condition B, with rows as variables (genes) and columns as samples.

r.method

a character string indicating which correlation coefficient (or covariance) is to be computed. One of "pearson" (default) or "spearman", can be abbreviated.

link.method

a character string indicating link filtering method, default is 'qth'.

cutoff

cutoff used for link filtration, can be rth, qth, or percent depending on link.method. must be within [0,1].

N

permutation times. If N>0, the permutation step will be implemented. The default value for N is 0.

N.type

a character string indicating permutation type, default is 'pooled'.

q.method

a character string indicating which correction method to be utilized. the default is 'BH'.

Details

DCp starts with a set of gene coexpression value pairs, where each pair is made up with two coexpression values of a gene pair calculated under two different conditions. For a particular gene, a 'condition-specific coexpression profile' is defined as the vector of the coexpression values that relate to it in one condition, and the two condition-specific coexpression profiles of one gene become the two components of the gene's 'differential coexpression profile'. A differential coexpression measure (dC) is calculated from the differential coexpression profile as a length-normalized Euclidean Distance.

Then the samples between the two conditions will be disturbed and the samples will be separated to two conditions. Calculate dC of this condition. Repeat the above process for N times. Pool all the dC together to form a null distribution of dC. The corresponding statistical significance (p-value) is estimated against null statistics (short for pooled). Or calculate p-value of a gene only in this gene's null distribution of dC (short for gene_by_gene).

Value

A table with DCGs will be listed including 'dC' value and profile 'links' or 'dC' value, profile 'links','p.value' and 'q.vaule' value.

Author(s)

Bao-Hong Liu, Hui Yu, Jing Yang

References

Yu H., Liu B-H., et al., Link-specific Quantitative Methods to Identify Differentially Coexpressed Genes and Gene Pairs. Submitted. 2010

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
data(exprs)

## calculate differential coexpressed genes by DCp without permutation
DCp(exprs[1:50,1:16],exprs[1:50,17:63],
	link.method='qth',cutoff=0.25,
	N=0)

## calculate differential coexpressed genes by DCp with 100 times permutation
DCp(exprs[1:50,1:16],exprs[1:50,17:63],
	link.method='qth',cutoff=0.25,
	N=100)

DCGL documentation built on May 1, 2019, 8:38 p.m.