group_mc_cv: Group Monte Carlo Cross-Validation
In rsample: General Resampling Infrastructure

group_mc_cv

R Documentation

Group Monte Carlo Cross-Validation

Description

Group Monte Carlo cross-validation creates splits of the data based on some grouping variable (which may have more than a single row associated with it). One resample of Monte Carlo cross-validation takes a random sample (without replacement) of groups in the original data set to be used for analysis. All other data points are added to the assessment set. A common use of this kind of resampling is when you have repeated measures of the same subject.

Usage

group_mc_cv(
  data,
  group,
  prop = 3/4,
  times = 25,
  ...,
  strata = NULL,
  pool = 0.1
)

Arguments

`data`	A data frame.
`group`	A variable in `data` (single character or name) used for grouping observations with the same value to either the analysis or assessment set within a fold.
`prop`	The proportion of data to be retained for modeling/analysis.
`times`	The number of times to repeat the sampling.
`...`	These dots are for future extensions and must be empty.
`strata`	A variable in `data` (single character or name) used to conduct stratified sampling. When not `NULL`, each resample is created within the stratification variable. Numeric `strata` are binned into quartiles.
`pool`	A proportion of data used to determine if a particular group is too small and should be pooled into another group. We do not recommend decreasing this argument below its default of 0.1 because of the dangers of stratifying groups that are too small.

Value

A tibble with classes group_mc_cv, rset, tbl_df, tbl, and data.frame. The results include a column for the data split objects and an identification variable.

Examples


data(ames, package = "modeldata")

set.seed(123)
group_mc_cv(ames, group = Neighborhood, times = 5)

rsample documentation built on April 11, 2025, 5:54 p.m.

rsample index

Package overview README.md Common Resampling Patterns Introduction to rsample Working with resampling sets

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

rsample
General Resampling Infrastructure

group_mc_cv: Group Monte Carlo Cross-Validation
In rsample: General Resampling Infrastructure

Group Monte Carlo Cross-Validation

Description

Usage

Arguments

Value

Examples

Related to group_mc_cv in rsample...

R Package Documentation

Browse R Packages

We want your feedback!

rsample General Resampling Infrastructure

group_mc_cv: Group Monte Carlo Cross-Validation In rsample: General Resampling Infrastructure

Group Monte Carlo Cross-Validation

Description

Usage

Arguments

Value

Examples

Related to group_mc_cv in rsample...

R Package Documentation

Browse R Packages

We want your feedback!

rsample
General Resampling Infrastructure

group_mc_cv: Group Monte Carlo Cross-Validation
In rsample: General Resampling Infrastructure