cut_tv: cut_tv - Cuts a survival dataset on a time varying variable

Description Usage Arguments Details Examples

View source: R/cohort.R

Description

Survival datasets often have time-varying covariates that need to be dealt with. For example a drug exposure may occur after the entry into the cohort and you are interested in how this might affect your outcome.

Usage

1
2
cut_tv(.data, entry, exit, cut_var, tv_name, cores = 1, id_var,
  on_existing = c("flip", "increment"))

Arguments

.data

a dataframe

entry

name of the column in .data that defines entry time to cohort. Column must be numeric.

exit

name of the column in .data that defines exit time from cohort. Column must be numeric

cut_var

name of the column in .data that defines the time of the time-varying covariate event. Column must be numeric.

tv_name

name for the constructed time-varying covariate

cores

number of mc.cores to use.

id_var

name of the variable identifying individual cases

on_existing

see details for cutting behaviour

Details

This function cuts up a dataset based on times supplied for the time-varying covariate. If there is already a variable for the time-varying covariate, you can chose to flip the existing values or increment them. This means the function can be called multiple times to, e.g. deal with drugs starting and stopping and also to deal with progression of treatment.

The function is faster than other cutting methods, does not require conversion to Lexis format, and can be parallelised for large datasets and chained with dply workflows. Arguments should not be quoted.

This function can deal with the following scenarios (see examples):

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
# A simple example dataset to be cut
tv_test <- data.frame(id = 1:5, start = rep(0, 5), end = c(1000, 689, 1000, 874, 777),
                      event = c(0,1,0,1,1), drug_1 = c(NA, NA, NA, 340, 460),
                      drug_2 = c(NA, 234, 554, 123, NA),
                      drug_3_start = c(110, 110,111, 109, 110),
                      drug_3_stop = c(400, 400, 400, 400, 400),
                      stage_1 = c(300, NA, NA, NA, NA),
                      stage_2 = c(450, NA, NA, NA, NA))

# Binary chronic covariates:
tv_out1 <- cut_tv(tv_test, start, end, drug_1, id_var = id, drug_1_state)
tv_out1 <- cut_tv(tv_out1, start, end, drug_2, id_var = id, drug_2_state)
# Binary covariates:
tv_out3 <- cut_tv(tv_test, start, end, drug_3_start, id_var = id, drug_3_state)
tv_out3 <- cut_tv(tv_out3, start, end, drug_3_stop, id_var = id, drug_3_state)
# incremental covariates:
inc_1 <- cut_tv(tv_test, start, end, stage_1, id_var = id, disease_stage, on_existing = "inc")
inc_1 <- cut_tv(inc_1, start, end, stage_2, id_var = id, disease_stage, on_existing = "inc")
# Chaining combinations of the above
## Not run: 
library(dplyr)
tv_all <- tv_test %>%
          cut_tv(start, end, drug_1, id_var = id, drug_1_state) %>%
          cut_tv(start, end, drug_2, id_var = id, drug_2_state) %>%
          cut_tv(start, end, drug_3_start, id_var = id, drug_3_state) %>%
          cut_tv(start, end, drug_3_stop, id_var = id, drug_3_state) %>%
          cut_tv(start, end, stage_1, id_var = id, disease_stage, on_existing = "inc") %>%
          cut_tv(start, end, stage_2, id_var = id, disease_stage, on_existing = "inc")

## End(Not run)

rOpenHealth/rEHR documentation built on May 26, 2019, 8:51 p.m.