PanelMatch Overview

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)

Data Preparation

Users should begin by preparing their data with the PanelData() function. PanelData() conducts a number of error checks on the data, balances the panel, and creates a PanelData object which stores the time identifier, unit identifier, treatment, and outcome variables. Storing this metadata simplifies the interface at later stages, so users do not need to repeatedly specify these important variables.

library(PanelMatch)
dem.panel <- PanelData(panel.data = dem, 
                       unit.id = "wbcode2", 
                       time.id = "year", 
                       treatment = "dem", 
                       outcome = "y")

Treatment Variation Plot

Users can visualize the variation of treatment across space and time. This will help users build an intuition about how comparison of treated and control observations can be made.

dem.panel <- PanelData(panel.data = dem,
               unit.id = "wbcode2",
               time.id = "year",
               treatment = "dem",
               outcome = "y")
DisplayTreatment(panel.data = dem.panel, legend.position = "none",
                 xlab = "year", ylab = "Country Code", 
                 hide.x.tick.label = TRUE, hide.y.tick.label = TRUE)

While one can create simple plots easily, some additional customization may be desirable. For instance, user-specified labels can help clarify the substantive interpretation of the figures and visual adjustments might be necessary to accommodate larger data sets, as automatically generated labels will become illegible. To this end, the DisplayTreatment() function offers a large number of options for adjusting common features of the plot. Additionally, the DisplayTreatment() function returns a ggplot2 object (created using geom_tile()), meaning that standard ggplot2 syntax can be used to further customize any aspect of the figure.

Creating and Refining Matched Sets

Users can then create and refine matched sets using PanelMatch(). There are a large number of parameters that control this process. Please see the function documentation for descriptions.

PM.maha <- PanelMatch(panel.data = dem.panel, 
                      lag = 4, 
                      refinement.method = "mahalanobis",
                      match.missing = FALSE, 
                      covs.formula = ~ I(lag(tradewb, 0:4)) + 
                                       I(lag(y, 1:4)),
                      size.match = 5, 
                      qoi = "att", 
                      lead = 0:2,
                      use.diagonal.variance.matrix = TRUE,
                      forbid.treatment.reversal = FALSE)



PM.ps.weight <- PanelMatch(lag = 4, 
                      refinement.method = "ps.weight",
                      panel.data = dem.panel, 
                      match.missing = FALSE, 
                      covs.formula = ~ I(lag(tradewb, 0:4)) + 
                                       I(lag(y, 1:4)),
                      qoi = "att", 
                      lead = 0:2,
                      use.diagonal.variance.matrix = TRUE,
                      forbid.treatment.reversal = FALSE)

One can examine the distribution of the sizes of matched sets with the plot.PanelMatch() method:

plot(PM.maha)

Using the get_covariate_balance() function, we can examine the covariate balance measure. Note that the covariate balance measure should be much lower for matched sets after refinement if the configuration used is effective.

covbal <- get_covariate_balance(PM.maha, PM.ps.weight, 
                                panel.data = dem.panel,
                                covariates = c("tradewb", "y"),
                                include.unrefined = TRUE)

We can examine and plot these results. We can see that the Mahalanobis distance based matching refinement method generally performs better than the propensity score weighting method. We can also visualize the pre-refinement balance measures to see how much refinement improved covariate balance.

summary(covbal)
plot(covbal, type = "panel", 
     include.unrefined.panel = FALSE, ylim = c(-.5, .5))
# Since specifications are identical except 
# for refinement method, just look at the first result.
plot(get_unrefined_balance(covbal)[1], 
     include.unrefined.panel = FALSE, ylim = c(-.5, .5))

Getting Estimates and Standard Errors

Once proper matched sets are attained by PanelMatch(), users can estimate the causal quantity of interest such as the average treatment effect using PanelEstimate(). Users can estimate the contemporaneous effect as well as long-term effects. In this example, we illustrate the use of PanelEstimate() to estimate the average treatment effect on treated units (att) at time t on the outcomes from time t+0 to t+4.

PE.results <- PanelEstimate(sets = PM.maha, 
                            panel.data = dem.panel, 
                            se.method = "bootstrap")

plot(PE.results)


Try the PanelMatch package in your browser

Any scripts or data that you put into this service are public.

PanelMatch documentation built on April 3, 2025, 6:34 p.m.