Description Usage Arguments Details Value Author(s) References See Also Examples
Gathers several procedures to determine which explanatory variables have an effect
on a dependent variable.
Works whether there are more explanatory variables than observations or not.
Creates an object of class edrSelec
.
1 2 |
Y |
A numeric vector representing the dependent variable (a response vector). |
X |
A matrix representing the quantitative explanatory variables (bind by column). |
H |
When |
K |
The chosen dimension K. |
method |
This character string specifies the selection method. It should be either "CSS", "RSIR" or "SR-SIR". |
pZero |
When |
NZero |
When |
zeta |
When |
rho |
When |
baseEst |
An initial estimate of the EDR space on which each method relies. |
btspSamp |
When |
lassoParam |
When |
The "CSS" method builds NZero
submodels using only pZero
explanatory variables.
It estimates the indices for each of them.
The squared correlation between these indices and those found with the whole set of explanatory variables is computed.
Only the submodels with the highest squared correlation are kept.
The method then counts how many times each explanatory variable appears in these
'best' submodels.
The "RSIR" procedure uses an asymptotic test on each element of the estimated
EDR directions.
It was translated from a Matlab code made by Peng Zeng.
The "SR-SIR" procedure relies on a lasso penalty. The underlying parameter is chosen using the residual information criterion (RIC).
It was written using a R code made by Lexin Li.
edrSelec
returns an object of class edrSelec
, with some of the
following attributes, depending on the value of method
:
scoreVar |
A numeric vector filled with a score for each explanatory variable. Variables that have a high score should be kept. For the "CSS" method, the score is the presence of the variable in the 'best' submodels. For "RSIR", it is one minus the p-value of the test. For the "SR-SIR" procedure, it is a boolean that indicates if the variable should be kept when using the optimal lasso parameter. |
K |
The chosen dimension. |
H |
The chosen number(s) of slices. |
n |
The sample size. |
method |
The variable selection method used. |
X |
The matrix of the quantitative explanatory variables (bind by column). |
Y |
The numeric vector of the dependent variable (a response vector). |
matModels |
A |
matModTop |
A matrix with |
vectSqCor |
A vector containing the squared correlation between indices for each submodel, for the "CSS" method. |
aic |
A vector made of values of the Akaïke information criterion for every lasso parameter considered by the "SR-SIR" procedure. |
bic |
A vector made of values of the Bayesian information criterion for every lasso parameter considered by the "SR-SIR" procedure. |
ric |
A vector made of values of the residual information criterion for every lasso parameter considered by the "SR-SIR" procedure. |
matEDR |
A list which gives, for each lasso parameter studied with the "SR-SIR" procedure, a matrix spanning the estimated EDR space. |
Raphaël Coudret <rcoudret@gmail.com>, Benoît Liquet <benoit.liquet@univ-pau.fr> and Jérôme Saracco <jerome.saracco@math.u-bordeaux1.fr>
Coudret, R., Liquet, B. and Saracco, J. Comparison of sliced inverse regression approaches for underdetermined cases. Journal de la Société Française de Statistique, in press.
Li, L. and Yin, X. (2008). Sliced inverse regression with regularizations. Biometrics, 64(1):124-131.
Zhong, W., Zeng, P., Ma, P., Liu, J. S., and Zhu, Y. (2005). RSIR: regularized sliced inverse regression for motif discovery. Bioinformatics, 21(22):4169-4175.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | ## Not run:
n <- 100
p <- 110
K <- 1
H <- 5:12
NZero <- 1000
pZero <- 10
zeta <- 0.1
beta <- c(1,1,1,1,rep(0,p-4))
U <- matrix(runif(p^2,-0.05,0.05),ncol=p)
X <- rmvnorm(n,sigma=diag(p) + U %*% t(U))
eps <- rnorm(n,sd=10)
Y <- (X%*%beta)^3+eps
result <- edrSelec(Y,X,H,K,"CSS",NZero=NZero, pZero=pZero, zeta=zeta)
summary(result)
plot(result)
## End(Not run)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.