Description Usage Arguments Value Author(s) References See Also Examples
It implements single stage Q-learning. Q-learning estimates optimal treatment option by fitting a regression model with treatment, feature variable and their interactions. The optimal treatment option is the the sign of the interaction term which maximize the predicted value from the regression model.
1 | Qlearning_Single(H, A, R, pentype = "lasso",m=4)
|
H |
a n by p matrix, n is the sample size, p is the number of feature variables. |
A |
a vector of treatment assignments coded 1 and -1. |
R |
a vector of outcomes, larger is more desirable. |
pentype |
The type of regression in Q-learning, 'lasso' is the default lasso regression; 'LSE' is the ordinary least square. |
m |
needed when pentype='lasso', the number of folds in cross validation for picking tuning parameter for lasso in |
It returns a class of 'qlearn'
, that consists of two components:
co |
the coefficient of the regression model, it is a 2p+2 vector. The design matrix is X=(Intercept, H, A, diag(A)*H) |
Q |
The predicted optimal outcome from the regression model |
Ying Liu yl2802@cumc.columbia.edu http://www.columbia.edu/~yl2802/
Watkins, C. J. C. H. (1989). Learning from delayed rewards (Doctoral dissertation, University of Cambridge).
Murphy, S. A., Oslin, D. W., Rush, A. J., & Zhu, J. (2007). Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders. Neuropsychopharmacology, 32(2), 257-262.
Zhao, Y., Kosorok, M. R., & Zeng, D. (2009). Reinforcement learning design for cancer clinical trials. Statistics in medicine, 28(26), 3294.
1 2 3 4 5 6 7 8 |
Loading required package: kernlab
Loading required package: MASS
Loading required package: glmnet
Loading required package: Matrix
Loading required package: foreach
Loaded glmnet 2.0-16
Loading required package: ggplot2
Attaching package: 'ggplot2'
The following object is masked from 'package:kernlab':
alpha
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.