Description Usage Arguments Details Value Author(s) References Examples
scca is used to perform sparse canonical covariance analysis (SCCA). scca3 is the extension of scca to address 3 sets of variables on the same set of subjects.
1 2 |
X: | n-by-p data matrix, where n is the number of subjects and p is the number of variables |
Y: | n-by-q data matrix, where q is the number of variables |
Z: | n-by-r data matrix, where r is the number of variables |
penalty: | "HL" is the unbounded penalty proposed by Lee and Oh (2009). "LASSO" (Tibshirani, 1996), "SCAD" (Fan and Li, 2001) and "SOFT" (soft thresholding) are also available as other penalty options. Default is "HL". |
lamx: | A vector specifying grid points of the tuning parameter for X. Default is (1,2,3). |
lamy: | A vector specifying grid points of the tuning parameter for Y. Default is (1,2,3). |
lamz: | A vector specifying grid points of the tuning parameter for Z. Default is (1,2,3). |
nc: | Number of components (canonical vectors). Default is 1. |
Tuning: | How to find optimal tuning parameters for the sparsity. If tuning="CV.full", then the tuning parameters are selected automatically via K-fold cross-validation by using 2-dim'l grid search. If "CV.alt", then a sequential 1-dim'l search method is applied instead of the 2-dim'l grid search. For scca3, only "CV.alt" is available. Default is "CV.alt". |
K: | Perform K-fold cross-valiadation. Default is 5. |
seed: | Seed number for initialization. A random initial point is generated for tuning="CV.alt". |
center: | The columns of the data matrix are centered to have mean zero. Default is TRUE. |
scale: | The columns of the data matrix are scaled to have variance 1. Default is FALSE. |
Sparse CCA uses a random-effect model approach to obtain sparse regression. This model gives unbounded gains for zero loadings at the origin. Various penalty functions can be adapted as well.
A: | p-by-nc matrix, k-th colum of A corresponds to k-th pattern (canonical vector) for X |
B: | q-by-nc matrix, k-th colum of B corresponds to k-th pattern (canonical vector) for Y |
U: | n-by-nc matrix. k-th column of U corresponds to k-th score associated with k-th pattern for X |
V: | n-by-nc matrix. k-th column of V corresponds to k-th score associated with k-th pattern for Y |
lambda: | nc-by-2 matrix. k-th row of lambda corresponds to the optimal tuning parameters for k-th pattern pairs |
CR: | average cross-validated sample covariance |
Woojoo Lee, Donghwan Lee, Youngjo Lee and Yudi Pawitan
Lee, W., Lee, D., Lee, Y. and Pawitan, Y. (2011) Sparse Canonical Covariance Analysis for High-throughput Data
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 | ## Example 1
## A very simple simulation example
n<-10; p<-50; q<-20
X = matrix(rnorm(n*p),ncol=p)
Y = matrix(rnorm(n*q),ncol=q)
scca(X,Y)
## Example 2
## Simulation setting in Lee et al.(2011)
n<-50; p<-150; q<-100; r<-120
pcc<-5
sig.e<-0.1
sig.mu<-2
tx<-rnorm(pcc,1,0.1)
tx.norm<-sqrt(sum(tx^2))
ax<-tx/tx.norm;
ax<-c(ax,rep(0,p-pcc))
ty<-rnorm(pcc,1,0.1)
ty.norm<-sqrt(sum(ty^2))
by<-ty/ty.norm;
by<-c(by,rep(0,q-pcc))
tz<-rnorm(pcc,1,0.1)
tz.norm<-sqrt(sum(tz^2))
cz<-tz/tz.norm;
cz<-c(cz,rep(0,r-pcc))
mu<-rnorm(n,0,sig.mu)
X<-matrix(0,n,p)
Y<-matrix(0,n,q)
Z<-matrix(0,n,r)
for (i in 1:n){
for (j in 1:pcc){
X[i,j]<-ax[j]*mu[i]+rnorm(1,0,sig.e)
Y[i,j]<-by[j]*mu[i]+rnorm(1,0,sig.e)
Z[i,j]<-cz[j]*mu[i]+rnorm(1,0,sig.e)
}
}
for (i in 1:n){
for (j in (pcc+1):p){
X[i,j]<-rnorm(1,0,sig.e)
}
}
for (i in 1:n){
for (j in (pcc+1):q){
Y[i,j]<-rnorm(1,0,sig.e)
}
}
for (i in 1:n){
for (j in (pcc+1):r){
Z[i,j]<-rnorm(1,0,sig.e)
}
}
scca(X,Y)
scca3(X,Y,Z)
scca(X, Y, penalty="SOFT", lamx=c(0.01,0.05),lamy=c(0.01,0.05), center=FALSE)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.