GroupedShuffler: GroupedShuffler class for applying 'shuffling' transform with...

GroupedShufflerR Documentation

GroupedShuffler class for applying 'shuffling' transform with data aggregated

Description

Convert self to a list.

Character representation of the class

Arguments

limit

Minimum number of rows required to shuffle data

data

A data frame to be manipulated

...

Vector of variables in 'data' to transform.

Details

'Shuffling' refers to the a random sampling of a variable without replacement e.g. [A, B, C] becoming [B, A, C] but not [A, A, B]. "Grouped shuffling" refers to aggregating the data by another feature before applying the shuffling process. Grouped shuffling will preserve aggregate level metrics (e.g. mean, median, mode) but removes ordinal properties i.e. correlations and auto-correlations

Fields

group_on

Symbolic representation of grouping variables

limit

Minimum number of rows required to shuffle data Create new GroupedShuffler object


deident documentation built on April 3, 2025, 6:14 p.m.