DoubleMLPLR | R Documentation |
Double machine learning for partially linear regression models.
R6::R6Class object inheriting from DoubleML.
Partially linear regression (PLR) models take the form
Y = D\theta_0 + g_0(X) + \zeta,
D = m_0(X) + V,
with E[\zeta|D,X]=0
and E[V|X] = 0
. Y
is the outcome
variable variable and D
is the policy variable of interest.
The high-dimensional vector X = (X_1, \ldots, X_p)
consists of other
confounding covariates, and \zeta
and V
are stochastic errors.
DoubleML::DoubleML
-> DoubleMLPLR
DoubleML::DoubleML$bootstrap()
DoubleML::DoubleML$confint()
DoubleML::DoubleML$fit()
DoubleML::DoubleML$get_params()
DoubleML::DoubleML$learner_names()
DoubleML::DoubleML$p_adjust()
DoubleML::DoubleML$params_names()
DoubleML::DoubleML$print()
DoubleML::DoubleML$set_sample_splitting()
DoubleML::DoubleML$split_samples()
DoubleML::DoubleML$summary()
new()
Creates a new instance of this R6 class.
DoubleMLPLR$new( data, ml_l, ml_m, ml_g = NULL, n_folds = 5, n_rep = 1, score = "partialling out", dml_procedure = "dml2", draw_sample_splitting = TRUE, apply_cross_fitting = TRUE )
data
(DoubleMLData
)
The DoubleMLData
object providing the data and specifying the
variables of the causal model.
ml_l
(LearnerRegr
,
Learner
, character(1)
)
A learner of the class LearnerRegr
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
Alternatively, a Learner
object with public field
task_type = "regr"
can be passed, for example of class
GraphLearner
. The learner can possibly
be passed with specified parameters, for example
lrn("regr.cv_glmnet", s = "lambda.min")
.
ml_l
refers to the nuisance function l_0(X) = E[Y|X]
.
ml_m
(LearnerRegr
,
LearnerClassif
, Learner
,
character(1)
)
A learner of the class LearnerRegr
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
For binary treatment variables, an object of the class
LearnerClassif
can be passed, for example
lrn("classif.cv_glmnet", s = "lambda.min")
.
Alternatively, a Learner
object with public field
task_type = "regr"
or task_type = "classif"
can be passed,
respectively, for example of class
GraphLearner
.
ml_m
refers to the nuisance function m_0(X) = E[D|X]
.
ml_g
(LearnerRegr
,
Learner
, character(1)
)
A learner of the class LearnerRegr
, which is
available from mlr3 or its
extension packages mlr3learners or
mlr3extralearners.
Alternatively, a Learner
object with public field
task_type = "regr"
can be passed, for example of class
GraphLearner
. The learner can possibly
be passed with specified parameters, for example
lrn("regr.cv_glmnet", s = "lambda.min")
.
ml_g
refers to the nuisance function g_0(X) = E[Y - D\theta_0|X]
.
Note: The learner ml_g
is only required for the score 'IV-type'
.
Optionally, it can be specified and estimated for callable scores.
n_folds
(integer(1)
)
Number of folds. Default is 5
.
n_rep
(integer(1)
)
Number of repetitions for the sample splitting. Default is 1
.
score
(character(1)
, function()
)
A character(1)
("partialling out"
or "IV-type"
) or a function()
specifying the score function.
If a function()
is provided, it must be of the form
function(y, d, l_hat, m_hat, g_hat, smpls)
and
the returned output must be a named list()
with elements psi_a
and
psi_b
. Default is "partialling out"
.
dml_procedure
(character(1)
)
A character(1)
("dml1"
or "dml2"
) specifying the double machine
learning algorithm. Default is "dml2"
.
draw_sample_splitting
(logical(1)
)
Indicates whether the sample splitting should be drawn during
initialization of the object. Default is TRUE
.
apply_cross_fitting
(logical(1)
)
Indicates whether cross-fitting should be applied. Default is TRUE
.
set_ml_nuisance_params()
Set hyperparameters for the nuisance models of DoubleML models.
Note that in the current implementation, either all parameters have to be set globally or all parameters have to be provided fold-specific.
DoubleMLPLR$set_ml_nuisance_params( learner = NULL, treat_var = NULL, params, set_fold_specific = FALSE )
learner
(character(1)
)
The nuisance model/learner (see method params_names
).
treat_var
(character(1)
)
The treatment varaible (hyperparameters can be set treatment-variable
specific).
params
(named list()
)
A named list()
with estimator parameters. Parameters are used for all
folds by default. Alternatively, parameters can be passed in a
fold-specific way if option fold_specific
is TRUE
. In this case, the
outer list needs to be of length n_rep
and the inner list of length
n_folds
.
set_fold_specific
(logical(1)
)
Indicates if the parameters passed in params
should be passed in
fold-specific way. Default is FALSE
. If TRUE
, the outer list needs
to be of length n_rep
and the inner list of length n_folds
.
Note that in the current implementation, either all parameters have to
be set globally or all parameters have to be provided fold-specific.
self
tune()
Hyperparameter-tuning for DoubleML models.
The hyperparameter-tuning is performed using the tuning methods provided in the mlr3tuning package. For more information on tuning in mlr3, we refer to the section on parameter tuning in the mlr3 book.
DoubleMLPLR$tune( param_set, tune_settings = list(n_folds_tune = 5, rsmp_tune = mlr3::rsmp("cv", folds = 5), measure = NULL, terminator = mlr3tuning::trm("evals", n_evals = 20), algorithm = mlr3tuning::tnr("grid_search"), resolution = 5), tune_on_folds = FALSE )
param_set
(named list()
)
A named list
with a parameter grid for each nuisance model/learner
(see method learner_names()
). The parameter grid must be an object of
class ParamSet.
tune_settings
(named list()
)
A named list()
with arguments passed to the hyperparameter-tuning with
mlr3tuning to set up
TuningInstance objects.
tune_settings
has entries
terminator
(Terminator)
A Terminator object. Specification of terminator
is required to perform tuning.
algorithm
(Tuner or character(1)
)
A Tuner object (recommended) or key passed to the
respective dictionary to specify the tuning algorithm used in
tnr(). algorithm
is passed as an argument to
tnr(). If algorithm
is not specified by the users,
default is set to "grid_search"
. If set to "grid_search"
, then
additional argument "resolution"
is required.
rsmp_tune
(Resampling or character(1)
)
A Resampling object (recommended) or option passed
to rsmp() to initialize a
Resampling for parameter tuning in mlr3
.
If not specified by the user, default is set to "cv"
(cross-validation).
n_folds_tune
(integer(1)
, optional)
If rsmp_tune = "cv"
, number of folds used for cross-validation.
If not specified by the user, default is set to 5
.
measure
(NULL
, named list()
, optional)
Named list containing the measures used for parameter tuning. Entries in
list must either be Measure objects or keys to be
passed to passed to msr(). The names of the entries must
match the learner names (see method learner_names()
). If set to NULL
,
default measures are used, i.e., "regr.mse"
for continuous outcome
variables and "classif.ce"
for binary outcomes.
resolution
(character(1)
)
The key passed to the respective
dictionary to specify the tuning algorithm used in
tnr(). resolution
is passed as an argument to
tnr().
tune_on_folds
(logical(1)
)
Indicates whether the tuning should be done fold-specific or globally.
Default is FALSE
.
self
clone()
The objects of this class are cloneable with this method.
DoubleMLPLR$clone(deep = FALSE)
deep
Whether to make a deep clone.
Other DoubleML:
DoubleML
,
DoubleMLIIVM
,
DoubleMLIRM
,
DoubleMLPLIV
library(DoubleML)
library(mlr3)
library(mlr3learners)
library(data.table)
set.seed(2)
ml_g = lrn("regr.ranger", num.trees = 10, max.depth = 2)
ml_m = ml_g$clone()
obj_dml_data = make_plr_CCDDHNR2018(alpha = 0.5)
dml_plr_obj = DoubleMLPLR$new(obj_dml_data, ml_g, ml_m)
dml_plr_obj$fit()
dml_plr_obj$summary()
## Not run:
library(DoubleML)
library(mlr3)
library(mlr3learners)
library(mlr3tuning)
library(data.table)
set.seed(2)
ml_l = lrn("regr.rpart")
ml_m = ml_l$clone()
obj_dml_data = make_plr_CCDDHNR2018(alpha = 0.5)
dml_plr_obj = DoubleMLPLR$new(obj_dml_data, ml_l, ml_m)
param_grid = list(
"ml_l" = paradox::ps(
cp = paradox::p_dbl(lower = 0.01, upper = 0.02),
minsplit = paradox::p_int(lower = 1, upper = 2)),
"ml_m" = paradox::ps(
cp = paradox::p_dbl(lower = 0.01, upper = 0.02),
minsplit = paradox::p_int(lower = 1, upper = 2)))
# minimum requirements for tune_settings
tune_settings = list(
terminator = mlr3tuning::trm("evals", n_evals = 5),
algorithm = mlr3tuning::tnr("grid_search", resolution = 5))
dml_plr_obj$tune(param_set = param_grid, tune_settings = tune_settings)
dml_plr_obj$fit()
dml_plr_obj$summary()
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.