Description Usage Arguments Details Value Author(s) References See Also Examples
Derives a meaningful and reliable individualized treatment regime based on the observed dataset from different subgroups with heterogeneity in optimal individualized treatment decision making. When patients are coming from the same group, it implements the classical Q learning and A learning algorithm.
1 2 3 4 5 6 | MPL(formula, data, subset, na.action, method = c("Q", "A"), bootstrap = FALSE,
control = MPL.control(...), model = TRUE, y = TRUE, a = TRUE,
g = TRUE, x.tau = TRUE, x.h = TRUE, x.pi = TRUE, random = FALSE, ...)
MPL.fit(y, x.tau, a, g=NULL, x.h=NULL, x.pi=NULL, method=c("Q", "A"),
bootstrap=FALSE, random=FALSE, control=MPL.control())
|
formula |
A symbolic description of the model to be fitted(of type y ~ x.tau | a, or y ~ x.tau | a | g, or y ~ x.tau | a | g | x.h, or y ~ x.tau | a | g | x.h | x.pi, or y ~ x.tau | a | g | | x.pi. Details are given in 'Details'). |
data |
An optional list or environment containing variables in |
subset, na.action |
Arguments controlling formula processing via |
method |
Method used for estimating the parameter in the groupwise contrast function. See 'Details'. |
bootstrap |
A logical value indicating whether bootstrap will be used. Default is |
control |
A list of control argument via |
model |
A logical value indicating whether model frame should be included as a component of the return value. |
y, a, g, x.tau, x.h, x.pi |
For For |
random |
A logical value indicating whether using a constant to fit the propensity score function
or not. In randomized studies, the propensity score is usually a constant function independent of baseline
covariates. When |
... |
Argument passed to |
A saline feature of data from clinical trials and medical studies is inhomogeneity. Patients not only differ in baseline characteristics, but also the way they respond to the treatment. Individualized treatment regimes are developed to select effective treatments based on patient's heterogeneity. Formally speaking, an individualized treatment regime (ITR) is a function that maps patients' baseline covariates to the space of available treatment options. The goal in precision medicine is to identify the optimal ITR to reach the best clinical outcomes.
However, the optimal ITR might also vary for patients across different subgroups. This function implements the maximin projection learning method that derives a meanful and reliable ITR for future patients based on the observed data from different populations with heterogeneity in optimal individualized decision making.
The means and covariance matrices of patients baseline covariates are allowed to vary across different subgroups.
MPL
will first standardize the groupwise baseline covariates to have zero mean and indentity covariance
matrix (based on Gram-Schmidt Orthonormalization) and then recommends an ITR for future groups of patients. Notice
that the resulting ITR cannot be directly applied to future patients. We need to standardize future patients
baseline covariates (based on the same procedure) first and apply the transformed covariates to the ITR. This is
implemented by the TR
function.
When the group indicator g
is omitted (or it is a constant vector) in the formula, MPL
assumes
all the patients are coming from the same group and implements the classical Q-learning and A-learning algorithm.
Otherwise, g
should be a numeric vector that has the same length of y
, indicating which group
each patient belongs to.
When x.h
is omitted and the baseline h.est
in MPL.control
is not specified, MPL
sets x.h=x.tau
. When x.pi
is omitted, the propensity score pi.est
in MPL.control
is
not specified, and random=FALSE
, MPL
sets x.pi=x.tau
.
Q-learning fits the entire Q function (the conditional mean of response given baseline covariates and treatment)
to derive the optimal ITR. A-learning is a more robust method that focuses directly on the contrast function (the
difference between two Q functions). It requires to specify both the baseline and the propensity score function
and the resulting estimator for the contrast function is consistent when either of the function is correctly
specified. This is referred to as the doubly robustness property of A-learning. MPL
uses Q-learning or
A-learning to estimate the groupwise contrast function that shares the same marginal treatment effects across
different subgroups. These estimators are further used to derive a ITR for future groups of patients. By default,
method="A"
and A-learning is implemented.
Inference for the maximin effects and the parameters in the groupwise contrast functions are conducted based on
bootstrap. By default, bootstrap=FALSE
and Bootstrap will not be conducted.
Theta.tau.est |
An (p_1+1)\times G matrix containing estimated parameters in the groupwise contrast function. Here
p_1 is the dimension of |
Theta.h.est |
An (p_2+1)\times G matrix containing estimated parameters in the groupwise baseline function. Here
p_2 is the dimension of |
Theta.pi.est |
An (p_3+1)\times G matrix containing estimated parameters in the groupwise propensity score function.
Here p_3 is the dimension of |
h.est |
Estimated baseline function. |
pi.est |
Estimated propensity score function. |
B |
An p_1\times G matrix containing estimated parameters in the groupwise contast function. Here p_1 is the dimension
of |
c0 |
The common marginal treatment effect shared by all subgroups. It can be used as the input of the function
|
beta.est |
The estimated maximin effects used to contruct ITR for future patients. |
Theta.tau.boot |
An (p_1+1)\times G\times B_0 array containing bootstrap samples for the estimated parameters in the groupwise
contrast function. Here p_1 is the dimension of |
Theta.h.boot |
An (p_2+1)\times G\times B_0 array containing bootstrap samples for the estimated parameters in the groupwise
baseline function. Here p_2 is the dimension of |
Theta.pi.boot |
An (p_3+1)\times G\times B_0 array containing bootstrap samples for the estimated parameters in the groupwise
propensity score function. Here p_3 is the dimension of |
beta.boot |
An p_1\times B_0 matrix containing bootstrap sample for the estimated maximin effects. Here p_1 is the dimension
of |
standardize |
A logical value indcating whether future patients covariates should be standardized first to be applied
to the ITR constructed by the maximin effects. |
model |
The full model frame (if |
y |
Response vector (if |
x.tau |
Covariates used to model the contrast function (if |
a |
Treatment vector (if |
g |
Group Indicator (if |
x.h |
Covariates used to model the baseline function (if |
x.pi |
Covariates used to model the propensity score function (if |
Chengchun Shi
Shi, C., Song, R., Lu, W., and Fu, B. (2018). Maximin Projection Learning for Optimal Treatment Decision with Heterogeneous Individualized Treatment Effects. Journal of the Royal Statistical Society, Series B, 80: 681-702.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 | set.seed(12345)
X <- matrix(rnorm(1600), 800, 2)
A <- rbinom(800, 1, 0.5)
h <- 1+sin(0.5*pi*X[,1]+0.5*pi*X[,2])
tau <- rep(0, 800)
B <- matrix(0, 2, 4)
B[,1] <- c(2,0)
B[,2] <- 2*c(cos(15*pi/180), sin(15*pi/180))
B[,3] <- 2*c(cos(70*pi/180), sin(70*pi/180))
B[,4] <- c(0,2)
for (g in 1:4){
tau[((g-1)*200+1):(g*200)] <- X[((g-1)*200+1):(g*200),]%*%B[,g]
}
## mean and scale of the subgroup covariates are allowed to be different
X[1:200,1] <- X[1:200,1]+1
X[201:400,2] <- 2*X[201:400,2]-1
X[601:800,] <- X[601:800,]/2
Y <- h+A*tau+0.5*rnorm(800)
G <- c(rep(1,200), rep(2,200), rep(3,200), rep(4,200))
## Q-learning
result <- MPL(Y~X|A|G, method="Q")
## A-learning
result <- MPL(Y~X|A|G)
## treating as homogeneous
result <- MPL(Y~X|A)
result <- MPL(Y~X|A|G, bootstrap=TRUE)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.