resample_df: resampling
In metrumresearchgroup/PKPDmisc: Pharmacokinetic and Pharmacodynamic Data Management Functions

resample_df

R Documentation

resampling

Description

resampling

Usage

resample_df(
  df,
  key_cols,
  strat_cols = NULL,
  n = NULL,
  key_col_name = "KEY",
  replace = TRUE
)

Arguments

`df`	data frame
`key_cols`	key columns to resample on
`strat_cols`	columns to maintain proportion for stratification
`n`	number of unique sampled keys, defaults to match dataset
`key_col_name`	name of outputted key column. Default to "KEY"
`replace`	whether to stratify with replacement

Details

This function is valuable when generating a large simulated population where you goal is to create resampled sub-populations in addition to being able to maintain certain stratifications of factors like covariate distributions

A new keyed column will be created (defaults to name 'KEY') that contains the uniquely created new samples. This allows one to easily compare against the key'd columns. Eg, if you would like to see how many times a particular individual was resampled you can check the original ID column against the number of key's associated with that ID number.

Examples

library(PKPDmisc)
library(dplyr, quiet = TRUE)

# simple example resampling by ID maintaining Gender distribution, with 10 individuals
resample_df(sd_oral_richpk, key_cols = "ID", strat_cols = "Gender", n = 10)

# for a more complex example lets resample "simulated" data with multiple replicates
subset_data <- sd_oral_richpk %>%
   filter(ID < 20)
   
# make 'simulated' data with 5 replicates and combine to single dataframe
rep_dat <- lapply(1:5, function(x) {
subset_data %>% 
  mutate(REP = x)
  }) %>% bind_rows()

# now when we resample we also want to maintain the ID+REP relationship as resampling
# just the ID would give all rows associated for an ID with all reps, rather than 
# a single "unit" of ID/REP
resample_df(rep_dat, key_cols = c("ID", "REP"))

# check to see that stratification is maintained
rep_dat %>% group_by(Gender) %>% tally
resample_df(rep_dat, key_cols=c("ID", "REP"), strat_cols="Gender") %>%
  group_by(Gender) %>% tally
  
rep_dat %>% group_by(Gender, Race) %>% tally

resample_df(rep_dat, key_cols=c("ID", "REP"), strat_cols=c("Gender", "Race")) %>%
  group_by(Gender, Race) %>% tally

metrumresearchgroup/PKPDmisc documentation built on Oct. 21, 2022, 9:41 a.m.

metrumresearchgroup/PKPDmisc index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

metrumresearchgroup/PKPDmisc
Pharmacokinetic and Pharmacodynamic Data Management Functions

resample_df: resampling
In metrumresearchgroup/PKPDmisc: Pharmacokinetic and Pharmacodynamic Data Management Functions

resampling

Description

Usage

Arguments

Details

Examples

Related to resample_df in metrumresearchgroup/PKPDmisc...

R Package Documentation

Browse R Packages

We want your feedback!

metrumresearchgroup/PKPDmisc Pharmacokinetic and Pharmacodynamic Data Management Functions

resample_df: resampling In metrumresearchgroup/PKPDmisc: Pharmacokinetic and Pharmacodynamic Data Management Functions

resampling

Description

Usage

Arguments

Details

Examples

Related to resample_df in metrumresearchgroup/PKPDmisc...

R Package Documentation

Browse R Packages

We want your feedback!

metrumresearchgroup/PKPDmisc
Pharmacokinetic and Pharmacodynamic Data Management Functions

resample_df: resampling
In metrumresearchgroup/PKPDmisc: Pharmacokinetic and Pharmacodynamic Data Management Functions