survivalControl: Control for Fitting Piecewise Constant Hazard Mixed Models as...

View source: R/control_options.R

survivalControlR Documentation

Control for Fitting Piecewise Constant Hazard Mixed Models as an Approximation to Fitting Cox Proportional Hazards Mixed Models

Description

Constructs the control structure for additional parameters needed to properly fit survival data using a piecewise constant hazard mixed model

Usage

survivalControl(
  cut_num = 8,
  interval_type = c("equal", "manual", "group"),
  cut_points = NULL,
  time_scale = 1
)

Arguments

cut_num

positive integer specifying the number of time intervals to include in the piecewise constant hazard model assumptions for the sampling step. Default is 8. General recommendation: use between 5 and 10 intervals. See the Details section for additional information.

interval_type

character specifying how the time intervals are calculated. Options include 'equal' (default), 'manual', or 'group'. If 'equal' (default), time intervals are calculated such that there are approximately equal numbers of events per time interval. If 'manual', the user needs to input their own cut points (see cut_points for details). If 'group', time intervals are calculated such that there are approximately equal numbers of events per time interval AND there are at least 4 events observed by each group within each time interval. The input number of time intervals cut_num may be modified to a lower number in order to accomplish this goal. This option is generally difficult to perform when there are a large number of groups in the data.

cut_points

numeric vector specifying the value of the cut points to use in the calculation of the time intervals for the piecewise constant hazard model. If interval_type set to 'equal' or 'group', then this argument is not utilized. If interval_type set to 'manual', then this argument is required. First value must be 0, and all values must be ordered smallest to largest.

time_scale

positive numeric value (greater than 1) used to scale the time variable in the survival data. In order to calculate the piecewise constant hazard mixed model, the log of the time a subject survived within a particular time interval is used as an offset in the model fit. Sometimes multiplying the time scale by a factor greater than 1 improves the stability of the model fit algorithm.

Details

In the piecewise constant hazard model, there is an assumption that the time line of the data can be cut into cut_num time intervals and the baseline hazard is constant within each of these time intervals. In the fit algorithm, we estimate these baseline hazard values (specifically, we estimate the log of the baseline hazard values). By default, we determine cut points by specifying the total number of cuts to make (cut_num) and then specifying time values for cut points such that each time interval has an approximately equal number of events (interval_type = equal). The authors of this package have found simulations to work well using this default interval_type = equal, but if desired, users can further specify that each group has at least some (4) events observed within each time interval. Regardless of the interval_type choice, users should be aware that having too many cut points could result in having too few events for each time interval needed for a stable estimation of the baseline hazard estimates. Additionally, data with few events could result in too few events per time interval even for a small number of cut points. Alternatively, having too few cut points could result in a sub-par approximation of the baseline hazard, which can lead to biased estimation for the coefficients corresponding to the input variables of interest. We generally recommend having 8 total time intervals (more broadly, between 5 and 10 total time intervals). Warnings or errors will occur for cases when there are 1 or 0 events for a time interval. If this happens, either adjust the cut_num value appropriately, or in the case when the data simply has a very small number of events, consider not using this software for your estimation purposes.

Value

Function returns a list inheriting from class survivalControl containing fit and optimization criteria values used in optimization routine.


hheiling/glmmPen documentation built on Jan. 15, 2024, 11:47 p.m.