CVd: Cross-validation using delete-d method.
In bestglm: Best Subset GLM and Regression Utilities

Description Usage Arguments Details Value Author(s) References See Also Examples

The delete-d method for cross-validation uses a random sample of d observations as the validation sample. This is repeated many times.

1	CVd(X, y, d = ceiling(n * (1 - 1/(log(n) - 1))), REP = 100, family = gaussian, ...)

`X`	training inputs
`y`	training output
`d`	size of validation sample
`REP`	number of replications
`family`	glm family
`...`	optional arguments passed to `glm` or `lm`

Shao (1993, 1997) suggested the delete-d algorithm implemented in this function. In this algorithm, a random sample of d observations are taken as the validation sample. This random sampling is repeated REP times. Shao (1997, p.234, eqn. 4.5 and p.236) suggests d= n(1-1/(log n - 1)), This is obtained by taking λ_n = log n on page 236 (Shao, 1997). As shown in the table Shao's recommended choice of the d parameter corresponds to validation samples that are typically much larger that used in 10-fold or 5-fold cross-validation. LOOCV corresponds to d=1 only!

n	d	K=10	K=5
50	33	5	10
100	73	10	20
200	154	20	40
500	405	50	100
1000	831	100	200

Vector of two components comprising the cross-validation MSE and its sd based on the MSE in each validation sample.

A.I. McLeod and C. Xu

Shao, Jun (1993). Linear Model Selection by Cross-Validation. Journal of the American Statistical Assocation 88, 486-494.

Shao, Jun (1997). An Asymptotic Theory for Linear Model Selection. Statistica Sinica 7, 221-264.

bestglm, CVHTF, CVDH, LOOCV

#Example 1. delete-d method
#For the training set, n=67. So 10-fold CV is like using delete-d
#with d=7, approximately.
data(zprostate)
train<-(zprostate[zprostate[,10],])[,-10]
X<-train[,1:2]
y<-train[,9]
set.seed(123321123)
CVd(X, y, d=7, REP=10)
#should set to 1000. Used 10 to save time in example.

Loading required package: leaps
[1] 0.5172489 0.2731087

bestglm documentation built on March 26, 2020, 7:25 p.m.

bestglm index

Package overview

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

bestglm
Best Subset GLM and Regression Utilities

CVd: Cross-validation using delete-d method.
In bestglm: Best Subset GLM and Regression Utilities

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Example output

Related to CVd in bestglm...

R Package Documentation

Browse R Packages

We want your feedback!

bestglm Best Subset GLM and Regression Utilities

CVd: Cross-validation using delete-d method. In bestglm: Best Subset GLM and Regression Utilities

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Example output

Related to CVd in bestglm...

R Package Documentation

Browse R Packages

We want your feedback!

bestglm
Best Subset GLM and Regression Utilities

CVd: Cross-validation using delete-d method.
In bestglm: Best Subset GLM and Regression Utilities