Description Usage Arguments Value Author(s) References See Also Examples
This is the Plearning to estimate optimal multistage DTR. It implements improved Olearning to estimate optimal treatment rules for each stage backwardly. And it also borrows idea from Q-learning to utilize the estimated optimal outcomes for later stages.
1 2 |
X |
is either a matrix shared among all stages; or list of feature matrices, where feature matrices from different stages can have different dimensions. |
AA |
a list of K, each element |
RR |
a list of K, each element |
n |
sample size |
K |
number of stages |
pi |
a list of K, the i'th element is the randomization probability at stage i |
pentype |
the type of regression used to take residual, 'lasso' is the default, using lasso regression; 'LSE' is the ordianry least square regression. as in the function |
clinear |
is grid of tuning parameter for |
m |
number of folds in cross validation for |
e |
The rounding error for to compute bias in |
kernel |
The choice of kernel for Improved O-learning, default is |
sigma |
if |
models |
a list of models of class 'linearcl' |
Ying Liu yl2802@cumc.columbia.edu http://www.columbia.edu/~yl2802/
Liu, Y., Zeng, D., Wang, Y. (2014). Use of personalized Dynamic Treatment Regimes (DTRs) and Sequential Multiple Assignment Randomized Trials (SMARTs) in mental health studies. Shanghai archives of psychiatry, 26(6), 376.
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4311115/
Liu et al. (2015). Under double-blinded review.
Olearning_Single
, Qlearning_Single
1 2 3 4 5 6 7 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.