ordinalNetCV | R Documentation |

The data is divided into K folds. `ordinalNet`

is fit *K* times, each time
leaving out one fold as a test set. For each of the *K* model fits, lambda
can be tuned by AIC or BIC, or cross validation. If cross validation is used,
the user can choose whether to user the best average out-of-sample log-likelihood,
misclassification rate, Brier score, or percentage of deviance explained.
The user can also choose the number of cross validation folds to use for tuning.
Once the model is tuned, the out of sample log-likelihood,
misclassification rate, Brier score, and percentage of deviance explained
are calculated on the held out test set.

ordinalNetCV( x, y, lambdaVals = NULL, folds = NULL, nFolds = 5, nFoldsCV = 5, tuneMethod = c("cvLoglik", "cvMisclass", "cvBrier", "cvDevPct", "aic", "bic"), printProgress = TRUE, warn = TRUE, ... )

`x` |
Covariate matrix. |

`y` |
Response variable. Can be a factor, ordered factor, or a matrix where each row is a multinomial vector of counts. A weighted fit can be obtained using the matrix option, since the row sums are essentially observation weights. Non-integer matrix entries are allowed. |

`lambdaVals` |
An optional user-specified lambda sequence (vector). If |

`folds` |
An optional list, where each element is a vector of row indices
corresponding to a different cross validation fold. Indices correspond to rows
of the |

`nFolds` |
Numer of cross validation folds. Only used if |

`nFoldsCV` |
Number of cross validation folds used to tune lambda for each
training set (i.e. within each training fold). Only used of |

`tuneMethod` |
Method used to tune lambda for each training set (ie. within
each training fold). The "cvLoglik", "cvMisclass", "cvBrier", and "cvDevPct"
methods use cross validation with |

`printProgress` |
Logical. If |

`warn` |
Logical. If |

`...` |
Other arguments (besides |

The fold partition splits can be passed by the user via the

`folds`

argument. By default, the data are randomly divided into equally-sized partitions. Note that if lambda is tuned by cross validation, the fold splits are determined randomly and cannot be specified by the user. The`set.seed`

function should be called prior to`ordinalNetCV`

for reproducibility.A sequence of lambda values can be passed by the user via the

`lambdaVals`

argument. By default, the sequence is generated by first fitting the model to the full data set (this sequence is determined by the`nLambda`

and`lambdaMinRatio`

arguments of`ordinalNet`

).The

`standardize`

argument of`ordinalNet`

can be modified through the additional arguments (...). If`standardize=TRUE`

, then the data are scaled within each cross validation fold. If`standardize=TRUE`

and lambda is tuned by cross validation, then the data are also scaled within each tuning sub-fold. This is done because scaling is part of the statistical procedure and should be repeated each time the procedure is applied.

An S3 object of class "ordinalNetCV", which contains the following:

- loglik
Vector of out-of-sample log-likelihood values. Each value corresponds to a different fold.

- misclass
Vector of out-of-sample misclassificaton rates. Each value corresponds to a different fold.

- brier
Vector of out-of-sample Brier scores. Each value corresponds to a different fold.

- devPct
Vector of out-of-sample percentages of deviance explained. Each value corresponds to a different fold.

- bestLambdaIndex
The index of the value within the lambda sequence selected for each fold by the tuning method.

- lambdaVals
The sequence of lambda values used for all cross validation folds.

- folds
A list containing the index numbers of each fold.

- fit
An object of class "ordinalNet", resulting from fitting

`ordinalNet`

to the entire dataset.

## Not run: # Simulate x as independent standard normal # Simulate y|x from a parallel cumulative logit (proportional odds) model set.seed(1) n <- 50 intercepts <- c(-1, 1) beta <- c(1, 1, 0, 0, 0) ncat <- length(intercepts) + 1 # number of response categories p <- length(beta) # number of covariates x <- matrix(rnorm(n*p), ncol=p) # n x p covariate matrix eta <- c(x %*% beta) + matrix(intercepts, nrow=n, ncol=ncat-1, byrow=TRUE) invlogit <- function(x) 1 / (1+exp(-x)) cumprob <- t(apply(eta, 1, invlogit)) prob <- cbind(cumprob, 1) - cbind(0, cumprob) yint <- apply(prob, 1, function(p) sample(1:ncat, size=1, prob=p)) y <- as.factor(yint) # Evaluate out-of-sample performance of the cumulative logit model # when lambda is tuned by cross validation (best average out-of-sample log-likelihood) cv <- ordinalNetCV(x, y, tuneMethod="cvLoglik") summary(cv) ## End(Not run)

Embedding an R snippet on your website

Add the following code to your website.

For more information on customizing the embed code, read Embedding Snippets.