CreateUserPartition: Create a user-defined S3 object of class partition for the...

Description Usage Arguments Details Value

Description

Creates a list object used by the SetTarget function to specify either Training/Validation/Holdout (validationType = 'TVH') or cross-validation (validationType = 'CV') partitions of the modeling dataset based on the values included in a column from the dataset. In either case, the name of this data column must be specified (as userPartitionCol).

Usage

1
2
CreateUserPartition(validationType, userPartitionCol, cvHoldoutLevel = NULL,
  trainingLevel = NULL, holdoutLevel = NULL, validationLevel = NULL)

Arguments

validationType

Character string specifying the type of partition generated, either 'TVH' or 'CV'.

userPartitionCol

Character string naming the data column from the modeling dataset containing the subset designations.

cvHoldoutLevel

Data value from userPartitionCol that identifies the holdout subset under the 'CV' option.

trainingLevel

Data value from userPartitionCol that identifies the training subset under the 'TVH' option.

holdoutLevel

Data value from userPartitionCol that identifies the holdout subset under both 'TVH' and 'CV' options. To specify that the project should not use a holdout you can omit this parameter or pass NA directly.

validationLevel

Data value from userPartitionCol that identifies the validation subset under the 'TVH' option.

Details

For the 'TVH' option of cvMethod, no cross-validation is used. Users must specify the trainingLevel and validationLevel; use of a holdoutLevel is always recommended but not required. If no holdoutLevel is used, then the column must contain exactly 2 unique values. If a holdoutLevel is used, the column must contain exactly 3 unique values.

For the 'CV' option, each value in the column will be used to separate rows into cross-validation folds. Use of a holdoutLevel is optional; if not specified, then no holdout is used.

This function is one of several convenience functions provided to simplify the task of starting modeling projects with custom partitioning options. The other functions are CreateGroupPartition, CreateRandomPartition, and CreateStratifiedPartition.

Value

An S3 object of class 'partition' including the parameters required by the SetTarget function to generate a user-specified of the modeling dataset.



Search within the datarobot package
Search all R packages, documentation and source code

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.