NCC: Simulation and Analysis of Platform Trials with Non-Concurrent Controls

Documented in timemachine_cont

#' Time machine analysis for continuous data
#'
#' @description This function performs analysis of continuous data using the Time Machine approach. It takes into account all data until the investigated arm leaves the trial. It is based on linear regression with treatment as a categorical variable and covariate adjustment for time via a second-order Bayesian normal dynamic linear model (separating the trial into buckets of pre-defined size).
#'
#' @param data Data frame with trial data, e.g. result from the `datasim_cont()` function. Must contain columns named 'treatment', 'response' and 'period'.
#' @param arm Integer. Index of the treatment arm under study to perform inference on (vector of length 1). This arm is compared to the control group.
#' @param alpha Double. Decision boundary (one-sided). Default=0.025.
#' @param prec_theta Double. Precision (\eqn{1/\sigma^2_{\theta}}) of the prior regarding the treatment effect \eqn{\theta}. I.e. \eqn{\theta \sim N(0, \sigma^2_{\theta})}. Default=0.001.
#' @param prec_eta Double. Precision (\eqn{1/\sigma^2_{\eta_0}}) of the prior regarding the control mean \eqn{\eta_0}. I.e. \eqn{\eta_0 \sim N(0, \sigma^2_{\eta_0})}. Default=0.001.
#' @param tau_a Double. Parameter \eqn{a_{\tau}} of the Gamma distribution for the precision parameter \eqn{\tau} in the model for the time trend. I.e., \eqn{\tau \sim Gamma(a_{\tau},b_{\tau})}. Default=0.1.
#' @param tau_b Double. Parameter \eqn{b_{\tau}} of the Gamma distribution for the precision parameter \eqn{\tau} in the model for the time trend. I.e., \eqn{\tau \sim Gamma(a_{\tau},b_{\tau})}. Default=0.01.
#' @param prec_a Double. Parameter \eqn{a_{\sigma^2}} of the Gamma distribution regarding the precision of the responses. I.e., \eqn{1/\sigma^2 \sim Gamma(a_{\sigma^2},b_{\sigma^2})}. Default=0.001.
#' @param prec_b Double. Parameter \eqn{b_{\sigma^2}} of the Gamma distribution regarding the precision of the responses. I.e., \eqn{1/\sigma^2 \sim Gamma(a_{\sigma^2},b_{\sigma^2})}. Default=0.001.
#' @param bucket_size Integer. Number of patients per time bucket. Default=25.
#' @param check Logical. Indicates whether the input parameters should be checked by the function. Default=TRUE, unless the function is called by a simulation function, where the default is FALSE.
#' @param ... Further arguments passed by wrapper functions when running simulations.
#'
#' @importFrom stats aggregate
#' @importFrom stats quantile
#' @importFrom stats var
#' @importFrom rjags jags.model
#' @importFrom rjags coda.samples
#'
#' @export
#'
#' @details
#'
#' The Time Machine divides the trial duration into \eqn{C} calendar time intervals of equal length ("buckets"), which are indexed backwards in time. That is to say, the most recent time interval is denoted by \eqn{c=1} and the time interval corresponding to the beginning of the trial by \eqn{c=C}.
#' The analysis is performed as soon as the analyzed treatment arm finishes in the trial.
#'
#' The model is defined as follows:
#'
#' \deqn{E(y_j) = \eta_0 + \theta_{k_j} + \alpha_{c_j}}
#'
#' where \eqn{y_j} is the continuous response for patient \eqn{j}.
#' The model intercept \eqn{\eta_0} denotes the response of the control group at time of the analysis, \eqn{\theta_{k_j}} is the effect of the treatment arm \eqn{k} that patient \eqn{j} was enrolled in, relative to control.
#' For the parameters \eqn{\eta_0} and \eqn{\theta_{k_j}}, normal prior distributions are assumed, with mean 0 and variances \eqn{\sigma^2_{\eta_0}} and \eqn{\sigma^2_{\theta}}, respectively:
#'
#' \deqn{\eta_0 \sim \mathcal{N}(0, \sigma^2_{\eta_0})}
#'
#' \deqn{\theta_{k_j} \sim \mathcal{N}(0, \sigma^2_{\theta})}
#'
#' In the Time Machine, time effect is represented by \eqn{\alpha_{c_j}}, which is the change in the response in time bucket \eqn{c_j} (which denotes the time bucket in which patient \eqn{j} is enrolled) compared to the most recent time bucket \eqn{c=1} and is modeled using a Bayesian second-order normal dynamic linear model.
#' This creates a smoothing over the control response, such that closer time buckets are modeled with more similar response rates:
#'
#' \deqn{\alpha_1 = 0}
#' \deqn{\alpha_2 \sim \mathcal{N}(0, 1/\tau)}
#' \deqn{\alpha_c \sim \mathcal{N}(2 \alpha_{c-1} - \alpha_{c-2}, 1/\tau), 3 \le c \le C}
#'
#' where \eqn{\tau} denotes the drift parameter that controls the degree of smoothing over the time buckets and is assumed to have a Gamma hyperprior distribution:
#'
#' \deqn{\tau \sim Gamma(a_{\tau}, b_{\tau})}
#'
#' The precision of the individual patient responses (\eqn{1/\sigma^2}) is also assumed to have a Gamma hyperprior distribution:
#'
#' \deqn{1/\sigma^2 \sim Gamma(a_{\sigma^2}, b_{\sigma^2})}
#'
#' @examples
#'
#' trial_data <- datasim_cont(num_arms = 3, n_arm = 100, d = c(0, 100, 250),
#' theta = rep(0.25, 3), lambda = rep(0.15, 4), sigma = 1, trend = "linear")
#'
#' timemachine_cont(data = trial_data, arm = 3)
#'
#'
#' @return List containing the following elements regarding the results of comparing `arm` to control:
#'
#' - `p-val` - posterior probability that the difference in means is less than zero
#' - `treat_effect` - posterior mean of difference in means
#' - `lower_ci` - lower limit of the (1-2*`alpha`)*100% credible interval for difference in means
#' - `upper_ci` - upper limit of the (1-2*`alpha`)*100% credible interval for difference in means
#' - `reject_h0` - indicator of whether the null hypothesis was rejected or not (`p_val` < `alpha`)
#'
#' @author Dominic Magirr, Peter Jacko

timemachine_cont <- function(data,
                             arm,
                             alpha = 0.025,
                             prec_theta = 0.001,
                             prec_eta = 0.001,
                             tau_a = 0.1,
                             tau_b = 0.01,
                             prec_a = 0.001,
                             prec_b = 0.001,
                             bucket_size = 25,
                             check = TRUE,...){

  if (check) {
    if (!is.data.frame(data) | sum(c("treatment", "response", "period") %in% colnames(data))!=3) {
      stop("The data frame with trial data must contain the columns 'treatment', 'response' and 'period'!")
    }

    if(!is.numeric(arm) | length(arm)!=1){
      stop("The evaluated treatment arm (`arm`) must be one number!")
    }

    if(!is.numeric(alpha) | length(alpha)!=1 | alpha>=1 | alpha<=0){
      stop("The significance level (`alpha`) must be one number between 0 and 1!")
    }

    if(!is.numeric(prec_theta) | length(prec_theta)!=1){
      stop("The precision of the prior regarding the treatment effect (`prec_theta`) must be one number!")
    }

    if(!is.numeric(prec_eta) | length(prec_eta)!=1){
      stop("The precision of the prior regarding the control response (`prec_eta`) must be one number!")
    }

    if(!is.numeric(tau_a) | length(tau_a)!=1){
      stop("The parameter `tau_a` must be one number!")
    }

    if(!is.numeric(tau_b) | length(tau_b)!=1){
      stop("The parameter `tau_b` must be one number!")
    }

    if(!is.numeric(prec_a) | length(prec_a)!=1){
      stop("The parameter `prec_a` must be one number!")
    }

    if(!is.numeric(prec_b) | length(prec_b)!=1){
      stop("The parameter `prec_b` must be one number!")
    }

    if(!is.numeric(bucket_size) | length(bucket_size)!=1){
      stop("The number of patients per time bucket (`bucket_size`) must be one number!")
    }
  }

  max_period <- max(data[data$treatment==arm,]$period)
  data <- data[data$period %in% c(1:max_period),]

  model_string_time_machine_cont <- "
  model {


    for (i in 1:n_trt_buckets){


       x_bar[i] ~ dnorm(mu[i], n[i] * prec)

       sigma_2[i] ~ dgamma((n[i] - 1) / 2, n[i] * prec / 2)

            ## alpha[1] is current interval, alpha[10] is most distant interval
            ## time j corresponds to alpha interval Int-(j-1)
            ## e.g. time 1 corresponds to alpha interval 10-(1-1)=10 if interim #10
            ## e.g. time 2 corresponds to alpha interval 7-(2-1)=6 if interim #7

       mu[i] = gamma0 + alpha[n_buckets - (time_bucket[i]-1)] + delta[trt[i]]

    }


    ## Priors for time, counting backwards from current time interval (k=1 for current, k=10 for most distant)
    alpha[1] = 0
    alpha[2] ~ dnorm(0, tau)
    for(k in 3:n_buckets) {
        alpha[k] ~ dnorm(2*alpha[k-1] - alpha[k-2], tau)  # 2*(most recent) - (more distant) puts trend in right direction
    }

    ## Other priors
    delta[1] <- 0
    for(i in 2:n_trts){
      delta[i] ~ dnorm(0, prec_theta)
    }

    tau ~  dgamma(tau_a, tau_b)

    ### Intercept
    gamma0 ~ dnorm(0, prec_eta)

    ### Precision (corresponding to random error term)
    prec ~ dgamma(prec_a, prec_b)


  }
"

  ## repeat this code to take account of awkward
  ## case where n == 1
  model_string_time_machine_cont_1 <- "
  model {


    for (i in 1:n_trt_buckets){


       x_bar[i] ~ dnorm(mu[i], n[i] * prec)

       sigma_2[i] ~ dgamma((n[i] - 1) / 2, n[i] * prec / 2)

            ## alpha[1] is current interval, alpha[10] is most distant interval
            ## time j corresponds to alpha interval Int-(j-1)
            ## e.g. time 1 corresponds to alpha interval 10-(1-1)=10 if interim #10
            ## e.g. time 2 corresponds to alpha interval 7-(2-1)=6 if interim #7

       mu[i] = gamma0 + alpha[n_buckets - (time_bucket[i]-1)] + delta[trt[i]]

    }

    for (i in 1:n_trt_buckets_1){


       x_bar_1[i] ~ dnorm(mu_1[i], prec)

            ## alpha[1] is current interval, alpha[10] is most distant interval
            ## time j corresponds to alpha interval Int-(j-1)
            ## e.g. time 1 corresponds to alpha interval 10-(1-1)=10 if interim #10
            ## e.g. time 2 corresponds to alpha interval 7-(2-1)=6 if interim #7

       mu_1[i] = gamma0 + alpha[n_buckets - (time_bucket_1[i]-1)] + delta[trt_1[i]]

    }

    ## Priors for time, counting backwards from current time interval (k=1 for current, k=10 for most distant)
    alpha[1] = 0
    alpha[2] ~ dnorm(0, tau)
    for(k in 3:n_buckets) {
        alpha[k] ~ dnorm(2*alpha[k-1] - alpha[k-2], tau)  # 2*(most recent) - (more distant) puts trend in right direction
    }

    ## Other priors
    delta[1] <- 0
    for(i in 2:n_trts){
      delta[i] ~ dnorm(0, prec_theta)
    }

    tau ~  dgamma(tau_a, tau_b)

    ### Intercept
    gamma0 ~ dnorm(0, prec_eta)

    ### Precision (corresponding to random error term)
    prec ~ dgamma(prec_a, prec_b)


  }
"

  data$time_bucket <- rep(c(1:ceiling((nrow(data)/bucket_size))), each=bucket_size)[1:nrow(data)]
  #data$time_bucket <- data$period

  trt_bucket_n <- aggregate(data$response,
                            list(treatment = data$treatment,
                                 time_bucket = data$time_bucket),
                            "length")

  trt_bucket_means <- aggregate(data$response,
                                list(treatment = data$treatment,
                                     time_bucket = data$time_bucket),
                                "mean")

  trt_bucket_vars <- aggregate(data$response,
                               list(treatment = data$treatment,
                                    time_bucket = data$time_bucket),
                               function(x) var(x) * (length(x) - 1) / length(x))


  n_trt_buckets <- dim(trt_bucket_means)[1]
  n_trts <- max(trt_bucket_means$treatment) + 1
  n_buckets <- max(trt_bucket_means$time_bucket)

  trt <- trt_bucket_means$treatment + 1 ## need index to start at 1, not 0.
  time_bucket <- trt_bucket_means$time_bucket

  x_bar <- trt_bucket_means$x
  sigma_2 <- trt_bucket_vars$x
  n <- trt_bucket_n$x

  ## check when n == 1
  size_1 <- which(n == 1)
  if (length(size_1) == 0){ ## don't split up data

    ### Arguments to pass to jags_model
    data_list = list(x_bar = x_bar,
                     sigma_2 = sigma_2,
                     n = n,
                     trt = trt,
                     time_bucket = time_bucket,
                     n_trts = n_trts,
                     n_buckets = n_buckets,
                     n_trt_buckets = n_trt_buckets,
                     prec_theta = prec_theta,
                     prec_eta = prec_eta,
                     tau_a = tau_a,
                     tau_b = tau_b,
                     prec_a = prec_a,
                     prec_b = prec_b)

    inits_list = list(gamma0 = 0)


    ### Fit the model
    jags_model <- jags.model(textConnection(model_string_time_machine_cont),
                             data = data_list,
                             inits = inits_list,
                             n.adapt = 1000,
                             n.chains = 3,
                             quiet = T)


  }
  else {

    ## seperate the cases where n == 1
    x_bar_1 <- x_bar[size_1]
    time_bucket_1 <- time_bucket[size_1]
    trt_1 <- trt[size_1]
    n_trt_buckets_1 <- length(x_bar_1)

    ## remove n == 1 cases
    x_bar <- x_bar[-size_1]
    time_bucket <- time_bucket[-size_1]
    trt <- trt[-size_1]
    n <- n[-size_1]
    n_trt_buckets <- length(x_bar)

    ### Arguments to pass to jags_model
    data_list = list(x_bar = x_bar,
                     x_bar_1 = x_bar_1,
                     sigma_2 = sigma_2,
                     n = n,
                     trt = trt,
                     trt_1 = trt_1,
                     time_bucket = time_bucket,
                     time_bucket_1 = time_bucket_1,
                     n_trts = n_trts,
                     n_buckets = n_buckets,
                     n_trt_buckets = n_trt_buckets,
                     n_trt_buckets_1 = n_trt_buckets_1,
                     prec_theta = prec_theta,
                     prec_eta = prec_eta,
                     tau_a = tau_a,
                     tau_b = tau_b,
                     prec_a = prec_a,
                     prec_b = prec_b)

    inits_list = list(gamma0 = 0)


    ### Fit the model
    jags_model <- jags.model(textConnection(model_string_time_machine_cont_1),
                             data = data_list,
                             inits = inits_list,
                             n.adapt = 1000,
                             n.chains = 3,
                             quiet = T)
  }
  ### Extract posterior samples
  mcmc_samples <- coda.samples(jags_model,
                               c("delta"),
                               n.iter = 40000)

  ### Arrange all samples together
  all_samples <- do.call(rbind.data.frame, mcmc_samples)

  ## posterior means
  post_means <- colMeans(all_samples)

  ## posterior credible intervals
  post_cis <- apply(all_samples, 2, quantile, probs = c(alpha, 1 - alpha))

  ## posterior "p-values"
  post_p <- apply(all_samples, 2, function(x) mean(x < 0))

  ## return
  list(p_val = unname(post_p[arm+1]),
       treat_effect = unname(post_means[arm+1]),
       lower_ci = post_cis[1,arm+1],
       upper_ci = post_cis[2,arm+1],
       reject_h0 = unname(post_p[arm+1] < alpha))
}
pavlakrotka/NCC documentation built on April 17, 2025, 3:11 a.m.
rdrr.io home R language documentation Run R code online
CRAN packages Bioconductor packages R-Forge packages GitHub packages
Note that we can't provide technical support on individual packages. You should contact the package authors for that.
pavlakrotka/NCC
Simulation and Analysis of Platform Trials with Non-Concurrent Controls

R/timemachine_cont.R
In pavlakrotka/NCC: Simulation and Analysis of Platform Trials with Non-Concurrent Controls

Defines functions timemachine_cont

Documented in timemachine_cont

R Package Documentation

Browse R Packages

We want your feedback!

pavlakrotka/NCC Simulation and Analysis of Platform Trials with Non-Concurrent Controls

R/timemachine_cont.R In pavlakrotka/NCC: Simulation and Analysis of Platform Trials with Non-Concurrent Controls

Defines functions timemachine_cont

Documented in timemachine_cont

R Package Documentation

Browse R Packages

We want your feedback!

pavlakrotka/NCC
Simulation and Analysis of Platform Trials with Non-Concurrent Controls

R/timemachine_cont.R
In pavlakrotka/NCC: Simulation and Analysis of Platform Trials with Non-Concurrent Controls