oneD_bic: Envelope dimension selection based on 1D-BIC

Description Usage Arguments Details Value References See Also Examples

View source: R/oneD_bic.R

Description

This function selects envelope subspace dimension using 1D-BIC proposed by Zhang, X., & Mai, Q. (2018). The constrained optimization in the 1D algorithm is based on the line search algorithm for optimization on manifold. The algorithm is developed by Wen and Yin (2013) and the Matlab version is in the Matlab package OptM.

Usage

1
oneD_bic(M, U, n, C = 1, maxdim = 10, ...)

Arguments

M

The p-by-p positive definite matrix M in the envelope objective function.

U

The p-by-p positive semi-definite matrix U in the envelope objective function.

n

The sample size.

C

The constant defined in 1D-BIC criterion, the default value is 1.

maxdim

The maximum dimension to consider, maxdim is smaller than p, the default value is 10.

...

Additional user-defined arguments for the line search algorithm:

  • maxiter: The maximal number of iterations.

  • xtol: The convergence tolerance for the relative changes of the consecutive iterates w, e.g., ||w^{(k)} - w^{(k-1)}||_F/√{p}

  • gtol: The convergence tolerance for the gradient of Lagrangian, e.g., ||G^{(k)} - w^{(k)} (G^{(t)})^T w^{(t)}||_F

  • ftol: The convergence tolerance for relative changes of the consecutive objective function values F, e.g., |F^{(k)} - F^{(k-1)}|/(1+|F^{(k-1)}|). Usually, max{xtol, gtol} > ftol

The default values are: maxiter=500; xtol=1e-08; gtol=1e-08; ftol=1e-12.

Details

The objective function F(w) and its gradient G(w) in line search algorithm are:

F(w)=\log|w^T M_k w|+\log|w^T(M_k+U_k)^{-1}w|

G(w) = dF/dw = 2 (w^T M_k w)^{-1} M_k w + 2 (w^T (M_k + U_k)^{-1} w)^{-1}(M_k + U_k)^{-1} w

See Cook, R. D., & Zhang, X. (2016) for more details of the 1D algorithm.

The 1D-BIC criterion is defined as

I(k) = ∑_{j=1}^k φ_j(\hat{w}_j) + Ck\log(n)/n, \quad k = 0,1, …, p,

where C > 0 is a constant, \hat{w} is the 1D solver, the function φ_j is the individual objective function solved by 1D algorithm, n is the sample size. Then the selected dimension u is the one yielding the smallest 1D-BIC I(k). See Zhang, X., & Mai, Q. (2018) for more details.

As suggested by Zhang, X., & Mai, Q. (2018), the number C should be set to its default value C = 1 when there is no additional model assumption or prior information. However, if additional model assumption or prior information are known, C should be set such that Ck best matches the degree-of-freedom or total number of free parameters of the model or estimation procedure. For example, in TRR model where the predictor design matrix is of dimension p \times n, C should be set as p. See Zhang, X., & Mai, Q. (2018) for more details.

Value

bicval

The BIC values for different envelope dimensions.

u

The dimension selected which corresponds to the smallest BIC values.

Gamma

The estimation of envelope subspace basis.

References

Zhang, X. and Mai, Q., 2018. Model-free envelope dimension selection. Electronic Journal of Statistics, 12(2), pp.2193-2216.

Wen, Z. and Yin, W., 2013. A feasible method for optimization with orthogonality constraints. Mathematical Programming, 142(1-2), pp.397-434.

See Also

OptM1D, MenvU_sim

Examples

1
2
3
4
5
6
7
8
##simulate two matrices M and U with an envelope structure
data <- MenvU_sim(p = 20, u = 5, wishart = TRUE, n = 200)
M <- data$M
U <- data$U
bic <- oneD_bic(M, U, n = 200)
## visualization
plot(1:10, bic$bicval, type="o", xlab="Envelope Dimension", ylab="BIC values",
main="Envelope Dimension Selection")

TRES documentation built on Oct. 20, 2021, 9:06 a.m.