Description Usage Arguments Value References See Also Examples
This function implements multiple stage O-learning (with improved single stage O-learing) to find optimal DTR by backward induction.
1 2 |
X |
is either a matrix shared among all stages; or list of feature matrices, where feature matrices from different stages can have different dimensions. |
AA |
a list of K, each element |
RR |
a list of K, each element |
n |
sample size |
K |
number of stages |
pi |
a list of K, the i'th element is the randomization probability at stage i |
pentype |
the type of regression used to take residual, 'lasso' is the default, using lasso regression; 'LSE' is the ordianry least square regression. as in the function |
kernel |
The choice of kernel for Improved O-learning, default is |
sigma |
if kernel='rbf', sigma is the grid of tuning parameter for 'rbf' kernal to run cross validation to choose from, the default is (0.03, 0.05, 0.07) |
clinear |
is grid of tuning parameter for |
m |
number of folds in cross validation for |
e |
The rounding error for computing bias in |
models |
a list of models of class 'linearcl' |
Liu et al. (2015). Under double-blinded review.
Zhao, Y., Zeng, D., Rush, A. J., & Kosorok, M. R. (2012). Estimating individualized treatment rules using outcome weighted learning. Journal of the American Statistical Association, 107(499), 1106-1118.
Zhao, Y. Q., Zeng, D., Laber, E. B., & Kosorok, M. R. (2014). New statistical learning methods for estimating optimal dynamic treatment regimes. Journal of the American Statistical Association, (just-accepted), 00-00.
1 2 3 4 5 6 7 8 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.