bsFolds: Bootstrapping folds

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/bsFolds.R

Description

Take data produced by input and amend it with (optionally) stratified folds determined through bootstrapping.

Usage

1
bsFolds(data, nfolds = 10L, stratify = FALSE)

Arguments

data

the data to use. The structure returned by input.

nfolds

the number of folds. Defaults to 10.

stratify

whether to stratify the folds. Makes really only sense for classification models. Defaults to FALSE.

Details

Partitions the data set into folds. Stratification, if requested, is done by the best algorithm, i.e. the one with the best performance. The distribution of the best algorithms in each fold will be approximately the same. For each fold, the training index set is assembled through .632 bootstrap. The remaining indices are used for testing. There is no guarantee on the sizes of either sets. The sets of indices are added to the original data set and returned.

If the data set has train and test partitions already, they are overwritten.

Value

train

a list of index sets for training.

test

a list of index sets for testing.

...

the original members of data. See input.

Author(s)

Lars Kotthoff

See Also

cvFolds, trainTest

Examples

1
2
3
4
5
6
7
8
data(satsolvers)
folds = bsFolds(satsolvers)

# use 5 folds instead of the default 10
folds5 = bsFolds(satsolvers, 5L)

# stratify
foldsU = bsFolds(satsolvers, stratify=TRUE)

llama documentation built on March 17, 2021, 1:06 a.m.