Home

/

GitHub

/

In BarkleyBG/causalsandwich: Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

knitr::opts_chunk$set(echo = TRUE)

`causalsandwich` for estimating causal effects with empirical sandwich variance estimator

Currently implemented as of version 0.0.0.9000: - Estimators of the average treament effect - IPTW - G-formula - Doubly Robust estimator

Why use `causalsandwich`? Easy implementation with valid inference!

This package is inteded to be a "one-stop-shop" with emphasis on ease of use.
The estimators here are consistent and asymptotically normal following usual regularity assumptions.

What's `geex`?

geex is a package designed for easy implementation of estimating equations. The causalsandwich package is powered by geex

What are estimating equations?

See Stefanski and Boos (2002)

How to install

The package can be installed from Github:

devtools::install_github("BarkleyBG/causalsandwich")

Examples

Whenver possible, the causal effect estimate is listed last.

Estimating the Average Treatment Effect

First, some data:

library(causalsandwich)

n <- 200
data <- data.frame(
  Covar1 = rnorm(n),
  Covar2 = rnorm(n)
)
trtprobs <- plogis(0.2  + 0.5*data$Covar1 + 0.1*data$Covar2 + 0.2*data$Covar1 * data$Covar2)
data$BinaryTrt <- rbinom(n, 1, trtprobs)

outprobs <- plogis(0.5  + 0.2*data$Covar1 - 0.2*data$Covar2 + 0.2*data$Covar1 * data$Covar2 -0.2*data$BinaryTrt)
data$BinaryOutcome <- rbinom(n, 1, outprobs)

Defensive Programming

Until rigorous tests and defensive programming are added, users are recommended to use the following coding schemes:

Binary treatments should be indicator variables, with 1 for "treated" and 0 for "untreated". Use integers, not factors.
Binary outcomes should be indicator variables, with for "outcome" and 0 for "no outcome". Use integers, not factors.

Inverse Probability (of treament) Weighting

Estimating the Average Treatment Effect only takes specifying a few arguments. The below code will estimate propensity scores with the logistic GLM model and estimate the ATE with the Horwitz-Thompson IPTW estimator.

The formula specified is a mixture of two formulas. That is, BinaryTrt ~ Covar1*Covar2 will be passed into the logistic model to fit a model for treatment with main effects and interaction terms with the two covariates. The first section of the multi-part formula indicates that the column named BinaryOutcome in data corresponds to the outcome variable of interest.

formula <- BinaryOutcome | BinaryTrt ~ Covar1 * Covar2

IPTW <- estimateIPTW(
  data = data, 
  formula = formula 
)

Output includes point estimates and the sandwich variance estimator:

(ests <- IPTW@estimates)
(vcov <- IPTW@vcov)

The first four parameters correspond to the treatment model parameters. It is the fifth parameter that corresponds to the ATE. Thus, a point estimate and 95\% confidence interval for this parameter are:

param_num <- 5
ests[param_num]
ests[param_num] + stats::qnorm(c(0.025,0.975))*vcov[param_num,param_num]

Since the treatment model is correctly specified, the estimator is consistent (as is the variance estimator).

G-formula

Here we model the outcome on treatment and covariates, so we supply outcome_regression_formula to the formula argument.
We also supply the name of the column in the dataset corresponding to treatment, treatment_var_name = "BinaryTrt".

outcome_regression_formula <- BinaryOutcome ~ BinaryTrt + Covar1 * Covar2

GF <- estimateGF(
  data = data,
  treatment_var_name = "BinaryTrt",
  formula = outcome_regression_formula, 
  model_method = "logistic" 
) 

(ests <- GF@estimates)
(vcov <- GF@vcov)

An estimate and 95\% confidence interval is then

param_num <- 6
ests[param_num]
ests[param_num] + stats::qnorm(c(0.025,0.975))*vcov[param_num,param_num]

Doubly-Robust Estimation

Here we specify two formulas, one for each model.
- We also specify modeling methods for each model.
- It is not necessary to specify other variable names, as the other arguments are sufficient.
For illustration we also pass in deriv_control = geex::setup_deriv_control(method="simple"). This results in quicker (but probably less accurate) derivatives in the variance computations. This is passed directly to the geex guts.

outcome_regression_formula <- BinaryOutcome ~ BinaryTrt + Covar1 * Covar2
treatment_model_formula <- BinaryTrt ~ Covar1 * Covar2

DRIPTW <- estimateDRIPTW(
  data = data,
  outcome_formula = outcome_regression_formula,
  treatment_formula = treatment_model_formula,
  outcome_model_method = "logistic",
  treatment_model_method = "logistic",
  deriv_control = geex::setup_deriv_control(method="simple")
)

(ests <- DRIPTW@estimates)
(vcov <- DRIPTW@vcov)

An estimate and 95\% confidence interval is then

param_num <- 10
ests[param_num]
ests[param_num] + stats::qnorm(c(0.025,0.975))*vcov[param_num,param_num]

BarkleyBG/causalsandwich documentation built on May 28, 2019, 11:36 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

BarkleyBG/causalsandwich
Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

In BarkleyBG/causalsandwich: Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

`causalsandwich` for estimating causal effects with empirical sandwich variance estimator

Why use `causalsandwich`? Easy implementation with valid inference!

What's `geex`?

What are estimating equations?

How to install

Examples

Estimating the Average Treatment Effect

Defensive Programming

Inverse Probability (of treament) Weighting

G-formula

Doubly-Robust Estimation

R Package Documentation

Browse R Packages

We want your feedback!

BarkleyBG/causalsandwich Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

In BarkleyBG/causalsandwich: Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

causalsandwich for estimating causal effects with empirical sandwich variance estimator

Why use causalsandwich? Easy implementation with valid inference!

What's geex?

What are estimating equations?

How to install

Examples

Estimating the Average Treatment Effect

Defensive Programming

Inverse Probability (of treament) Weighting

G-formula

Doubly-Robust Estimation

R Package Documentation

Browse R Packages

We want your feedback!

BarkleyBG/causalsandwich
Estimating Causal Effects with M-Estimation Sandwich Variance Estimator

`causalsandwich` for estimating causal effects with empirical sandwich variance estimator

Why use `causalsandwich`? Easy implementation with valid inference!

What's `geex`?