svpls: Function for identfying the optimal ANCOVA model and...

Description Usage Arguments Value Author(s) References See Also Examples

Description

This function calls fitModel repeatedly to fit a series of ANCOVA models along with the standard ANOVA model, to the log transformed gene expression data. The model with the minimum AIC is selected as the optimal one and its corresponding estimated effects are then used to perform a multiple testing of differential expression, over all the genes, using the Benjamini-Hochberg correction.

Usage

1
svpls(k1, k2, Y, pmax = 3, fdr = 0.05)

Arguments

k1

Number of subjects/samples under variety 1.

k2

Number of subjects/samples under variety 2.

Y

The log transformed gene expression data, with genes along the rows and subjects/samples along the columns.

pmax

Maximum number of surrogate variables to be incorporated in the ANCOVA model (means pmax ANCOVA models are fitted to the data). By default, it is taken as 3.

fdr

The specified False Discovery Rate (FDR) for multiple testing of differential expression, using the Benjamini-Hochberg correction. By Default it is taken as 0.05.

Value

opt.model

The optimal model. 1 denotes the standard ANOVA model.

PLS.imp

PlS imputed estimate of the hidden expression heterogeneity, evaluated from the optimal model (applicable only when opt.model>1).

Y.corr

Corrected gene expression matrix after adjusting for the hidden effects (applicable only when opt.model>1).

pvalues

p-values from the tests with the effects estimated from the standard ANOVA model (returned only when opt.model=1.

pvalues.adj

Adjusted p-values after correcting for the hidden effects (applicable only when opt.model>1).

genes

Genes that are deemed to be differentially expressed from the multiple hypotheses testing with effects estimated from the optimal model.

AIC.opt

AIC value for the optimal model.

Author(s)

Sutirtha Chakraborty, Somnath Datta and Susmita Datta.

References

Hirotsugu, A. (1980) Likelihood and the Bayes Procedure. The Institute of Statistical Mathematics, Tokyo., Benjamini, Y and Hochberg, Y (1995) Controlling the false discovery rate : a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society.

See Also

fitModel, hfp

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
## Loading the first dataset
data(hidden_fac.dat)

## Fitting the optimal ANCOVA model to the data gives:
fit <- svpls(10,10,hidden_fac.dat,pmax = 5)

## The optimal ANCOVA model, its AIC value and the positive genes detected from it are given by:
fit$opt.model 

fit$AIC.opt

fit$genes

## The corrected gene expression matrix obtained after removing the effects of
## the hidden variability is given by:

Y.corrected <- fit$Y.corr

Example output

Loading required package: class
Loading required package: pls

Attaching package: 'pls'

The following object is masked from 'package:stats':

    loadings

[1] 5
[1] 51789.12
 [1] "g31"  "g38"  "g42"  "g43"  "g65"  "g33"  "g57"  "g54"  "g30"  "g34" 
[11] "g25"  "g29"  "g41"  "g61"  "g68"  "g51"  "g62"  "g50"  "g55"  "g46" 
[21] "g52"  "g53"  "g63"  "g60"  "g28"  "g69"  "g24"  "g59"  "g40"  "g66" 
[31] "g21"  "g44"  "g27"  "g26"  "g37"  "g45"  "g48"  "g23"  "g39"  "g67" 
[41] "g36"  "g56"  "g49"  "g14"  "g47"  "g64"  "g35"  "g1"   "g70"  "g6"  
[51] "g4"   "g455" "g58"  "g12"  "g8"   "g13"  "g32"  "g7"   "g10"  "g3"  
[61] "g18"  "g22"  "g11"  "g184"

svapls documentation built on May 2, 2019, 9:34 a.m.