dfpca_div: Degrees of freedom of PCA Models
In mlesnoff/rnirs: Dimension reduction, Regression and Discrimination for Chemometrics

dfpca_div

R Documentation

Degrees of freedom of PCA Models

Description

Monte Carlo estimation of the model complexity (df = number of degrees of freedom) of PCA models. See in particular Ye 1998 and Efron 2004. Other details and references are given in the help page of function (dfplsr_cov.

This is a pedagogical function since the true df of a PCA model can be computed directly. For a PCA of dimension a, df = p + a * (n - 1) + p * a - a^2 (Faber et al. 1994, Faber 2012, Josse & Husson 2012).

Usage


dfpca_div(
    X, ncomp, algo = NULL, 
    ns = 50, eps = 1e-4, seed = NULL, 
    print = TRUE, 
    ...
    )

Arguments

`X`	A `n x p` matrix or data frame of training observations.
`ncomp`	The maximal number of PLS scores (= components = latent variables) to consider.
`algo`	a PCA algorithm. Default to `NULL` (see `pca`).
`ns`	Number of observations in the data receiving perturbation. The maximum is `min(n, p)`.
`eps`	For `dfplsrdiv`. The `epsilon` quantity used for scaling the perturbation analysis.
`seed`	An integer defining the seed for the random simulation, or `NULL` (default). See `set.seed`.
`print`	Logical. If `TRUE`, fitting information are printed.
`...`	Optionnal arguments to pass in the function defined in `algo`.

Value

A list of outputs (see examples), such as:

`df`	The model complexity for the models with `a = 0, 1, ..., ncomp` components.

References

Efron, B., 2004. The Estimation of Prediction Error. Journal of the American Statistical Association 99, 619â632. https://doi.org/10.1198/016214504000000692

Faber, N.M., Buydens, L.M.C., Kateman, G., 1994. Aspects of pseudorank estimation methods based on the eigenvalues of principal component analysis of random matrices. Chemometrics and Intelligent Laboratory Systems 25, 203â226. https://doi.org/10.1016/0169-7439(94)85043-7

Faber, N. (Klaas) M., 2008. Degrees of freedom for the residuals of a principal component analysis â A clarification. Chemometrics and Intelligent Laboratory Systems 93, 80â86. https://doi.org/10.1016/j.chemolab.2008.04.006

Josse, J., Husson, F., 2012. Selecting the number of components in principal component analysis using cross-validation approximations. Computational Statistics & Data Analysis 56, 1869â1879. https://doi.org/10.1016/j.csda.2011.11.012

Ye, J., 1998. On Measuring and Correcting the Effects of Data Mining and Model Selection. Journal of the American Statistical Association 93, 120â131. https://doi.org/10.1080/01621459.1998.10474094

Examples


data(datoctane)

z <- datoctane$X
u <- which(!is.na(rowSums(z)))    ## Removing rows with NAs
X <- z[u, -4]
dim(X)

ncomp <- 20
fm <- dfpca_div(X, ncomp, ns = 30)
zncomp <- 0:ncomp
plot(zncomp, fm$df, xlab = "Nb. Components", ylab = "Degrees of freedom")
## True df
lines(zncomp, fm$dftrue, col = "grey")
fm

mlesnoff/rnirs documentation built on April 24, 2023, 4:17 a.m.

mlesnoff/rnirs index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

mlesnoff/rnirs
Dimension reduction, Regression and Discrimination for Chemometrics

dfpca_div: Degrees of freedom of PCA Models
In mlesnoff/rnirs: Dimension reduction, Regression and Discrimination for Chemometrics

Degrees of freedom of PCA Models

Description

Usage

Arguments

Value

References

Examples

Related to dfpca_div in mlesnoff/rnirs...

R Package Documentation

Browse R Packages

We want your feedback!

mlesnoff/rnirs Dimension reduction, Regression and Discrimination for Chemometrics

dfpca_div: Degrees of freedom of PCA Models In mlesnoff/rnirs: Dimension reduction, Regression and Discrimination for Chemometrics

Degrees of freedom of PCA Models

Description

Usage

Arguments

Value

References

Examples

Related to dfpca_div in mlesnoff/rnirs...

R Package Documentation

Browse R Packages

We want your feedback!

mlesnoff/rnirs
Dimension reduction, Regression and Discrimination for Chemometrics

dfpca_div: Degrees of freedom of PCA Models
In mlesnoff/rnirs: Dimension reduction, Regression and Discrimination for Chemometrics