View source: R/create_data_split.R
create_data_split | R Documentation |
Create a calibration (training) / validation data split for PLSR model fitting and testing
create_data_split(
dataset = NULL,
approach = NULL,
split_seed = 123456789,
prop = 0.8,
group_variables = NULL
)
dataset |
input full PLSR dataset to split into cal/val datasets |
approach |
approach to splitting the dataset. Options: base or dplyr |
split_seed |
random seed to use for splitting data |
prop |
the proportion of data to preserve for calibration (e.g. 0.8) and validation (0.2). This sets the calibration proportion |
group_variables |
Use factor variables to conduct a stratified sampling for cal/val |
output_list A list containing the calibration dataset (cal_data) and validation dataset (val_data)
Julien Lamour, Jeremiah Anderson, Shawn P. Serbin
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.