cureem | R Documentation |
Fits penalized parametric and semi-parametric mixture cure models (MCM) using the E-M algorithm with user-specified penalty parameters. The lasso (L1), MCP, and SCAD penalty are supported for the Cox MCM while only lasso is currently supported for parametric MCMs.
cureem(
formula,
data,
subset,
x_latency = NULL,
model = c("cox", "weibull", "exponential"),
penalty = c("lasso", "MCP", "SCAD"),
penalty_factor_inc = NULL,
penalty_factor_lat = NULL,
thresh = 0.001,
scale = TRUE,
maxit = NULL,
inits = NULL,
lambda_inc = 0.1,
lambda_lat = 0.1,
gamma_inc = 3,
gamma_lat = 3,
na.action = na.omit,
...
)
formula |
an object of class " |
data |
a data.frame in which to interpret the variables named in the
|
subset |
an optional expression indicating which subset of observations to be used in the fitting process, either a numeric or factor variable should be used in subset, not a character variable. All observations are included by default. |
x_latency |
specifies the variables to be included in the latency
portion of the model and can be either a matrix of predictors, a model
formula with the right hand side specifying the latency variables, or the
same data.frame passed to the |
model |
type of regression model to use for the latency portion of mixture cure model. Can be "cox", "weibull", or "exponential" (default is "cox"). |
penalty |
type of penalty function. Can be "lasso", "MCP", or "SCAD" (default is "lasso"). |
penalty_factor_inc |
vector of binary indicators representing the penalty to apply to each incidence coefficient: 0 implies no shrinkage and 1 implies shrinkage. If not supplied, 1 is applied to all incidence variables. |
penalty_factor_lat |
vector of binary indicators representing the penalty to apply to each latency coefficient: 0 implies no shrinkage and 1 implies shrinkage. If not supplied, 1 is applied to all latency variables. |
thresh |
small numeric value. The iterative process stops when the differences between successive expected penalized complete-data log-likelihoods for both incidence and latency components are less than this specified level of tolerance (default is 10^-3). |
scale |
logical, if TRUE the predictors are centered and scaled. |
maxit |
integer specifying the maximum number of passes over the data
for each lambda. If not specified, 100 is applied when
|
inits |
an optional list specifiying the initial values. This includes:
Penalized coefficients are initialized to zero. If |
lambda_inc |
numeric value for the penalization parameter |
lambda_lat |
numeric value for the penalization parameter |
gamma_inc |
numeric value for the penalization parameter |
gamma_lat |
numeric value for the penalization parameter |
na.action |
this function requires complete data so |
... |
additional arguments. |
b_path |
Matrix representing the solution path of the coefficients in the incidence portion of the model. Row is step and column is variable. |
beta_path |
Matrix representing the solution path of the coefficients in the latency portion of the model. Row is step and column is variable. |
b0_path |
Vector representing the solution path of the intercept in the incidence portion of the model. |
logLik_inc |
Vector representing the expected penalized complete-data log-likelihood for the incidence portion of the model for each step in the solution path. |
logLik_lat |
Vector representing the expected penalized complete-data log-likelihood for the latency portion of the model for each step in the solution path. |
x_incidence |
Matrix representing the design matrix of the incidence predictors. |
x_latency |
Matrix representing the design matrix of the latency predictors. |
y |
Vector representing the survival object response as
returned by the |
model |
Character string indicating the type of regression model used for the latency portion of mixture cure model ("weibull" or "exponential"). |
scale |
Logical value indicating whether the predictors were centered and scaled. |
method |
Character string indicating the EM algorithm was used in fitting the mixture cure model. |
rate_path |
Vector representing the solution path of the rate parameter for the Weibull or exponential density in the latency portion of the model. |
alpha_path |
Vector representing the solution path of the shape parameter for the Weibull density in the latency portion of the model. |
call |
the matched call. |
Archer, K. J., Fu, H., Mrozek, K., Nicolet, D., Mims, A. S., Uy, G. L., Stock, W., Byrd, J. C., Hiddemann, W., Braess, J., Spiekermann, K., Metzeler, K. H., Herold, T., Eisfeld, A.-K. (2024) Identifying long-term survivors and those at higher or lower risk of relapse among patients with cytogenetically normal acute myeloid leukemia using a high-dimensional mixture cure model. Journal of Hematology & Oncology, 17:28.
cv_cureem
library(survival)
withr::local_seed(1234)
temp <- generate_cure_data(n = 80, j = 100, n_true = 10, a = 1.8)
training <- temp$training
fit <- cureem(Surv(Time, Censor) ~ .,
data = training, x_latency = training,
model = "cox", penalty = "lasso", lambda_inc = 0.1,
lambda_lat = 0.1, gamma_inc = 6, gamma_lat = 10
)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.