knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)
library(dplyr)
library(ggplot2)
library(magrittr)
library(rlang)
library(tidyr)
library(MacroGrowth)

Introduction

The R package MacroGrowth provides sophisticated tools for fitting macroeconomic growth models to data. MacroGrowth has several desirable features for macroeconomic modelers:

The functions in this package were developed for and used first in Heun et al. [-@Heun:2017].

Macroeconomic growth models

MacroGrowth supports the following macroeconomic growth models:

In the above equations:

In the CES model, the elasticity of substitution between the ($x_1$, $x_2$) pair and $x_3$ is given by $\sigma = \frac{1}{1 + \rho}$. The elasticity of substitution between $x_1$ and $x_2$ is given by $\sigma_1 = \frac{1}{1 + \rho_1}$.

The Linex model is usually presented with $a_0$ (as above) and $c_t$ coefficients. $c_t$ is the ratio of Linex fitted coefficients, given by $c_t = \frac{a_1}{a_0}$.

Data

The starting point for any fitting process is macroeconomic data. For this vignette, we'll use a subset of the UK macroeconomic data from Heun et al. [-@Heun:2017]. The data frame EconUK is included in the MacroGrowth package.

library(dplyr)
library(ggplot2)
library(magrittr)
library(rlang)
library(tidyr)
library(MacroGrowth)
head(EconUK)

The EconUK data frame contains indexed GDP, capital, labor, and two types of energy data (primary exergy and useful exergy). All economic and energy data series are indexed by ratio to an initial year (1960), such that all quantities are 1 in 1960. Time is indexed by difference to the initial year, such that 1960 is year 0. The prefix i indicates an indexed variable. iK is indexed capital stock, iL is indexed labor, iXp is indexed primary exergy, and iXu is indexed useful exergy.

The functions in MacroGrowth can fit indexed or non-indexed data. When data are indexed as shown in the EconUK data frame, the fitted value of the $\theta$ parameter is expected to be close to 1. When data are not indexed, the $\theta$ parameter will likely be far from 1, because $\theta$ assumes the task of unit conversion from factors of production in various functional forms (on the right side of the economic model) to economic output (on the left side of the economic model).

Fitting examples

To fit a model to macroeconomic data, use one of the *Model functions in the MacroGrowth pacakge.

Single-factor model

To fit a single-factor model, use the sfModel function.

sffit <- sfModel(formula = iGDP ~ iK + iYear, data = EconUK)
sffit

The sfModel function produces an SFmodel object.

In all *Model functions, a formula of the form y ~ x1 + x2 + x3 + t communicates the roles of the factors of production to the fitting function. The components of the formula are:

For models with fewer than 3 factors of production, some of x1, x2, and x3 can be omitted.

In the above example, formula = iGDP ~ iK + iYear indicates that the model is to be fitted with indexed GDP (iGDP) as the response variable, indexed capital stock (iK) as the only factor of production, and indexed time (iYear) as the time variable.

The coefficients of the fitted model can be recovered with the naturalCoef function.

naturalCoef(sffit)

Cobb-Douglas model

Similar functions can be used for the other model types. To fit the Cobb-Douglas model to the UK's capital stock and labor data, use the cdModel function.

cdfit <- cdModel(formula = iGDP ~ iK + iL + iYear, data = EconUK)
cdfit

The cdModel function produces a CDEmodel object.

In the above example, iGDP ~ iK + iL + iYear indicates that the model is to be fitted with indexed GDP (iGDP) as the response variable, indexed capital stock (iK) and indexed labor (iL) as the factors of production, and indexed year (iYear) as the time variable.

The coefficients of the model can be recovered with the naturalCoef function.

naturalCoef(cdfit)

CES model

For the CES production function, use the cesModel function with two or three factors of production.

cesfit <- cesModel(formula = iGDP ~ iK + iL + iYear, data = EconUK)
naturalCoef(cesfit)

The cesModel function produces a cesModel object.

Note that output elasticities ($\alpha_1$, $\alpha_2$, and $\alpha_3$) are calculated by the naturalCoef function. Equations for the output elasticities are given in Appendix B of Heun et al. [-@Heun:2017].

Also note that the cesModel function repeatedly calls the excellent cesEst function from the micEconCES pacakge. For more details on the cesEst function, see Henningsen and Henningsen [-@Henningsen:2011td]. For more details on the fitting algorithm employed by cesModel, see Heun et al. [-@Heun:2017].

Linex model

For the Linex model, there must be three factors of production. The first factor of production ($x_1$) must be capital stock. The second factor of production ($x_2$) must be labor. The third factor of production ($x_3$) must be an energy variable.

linexfit <- linexModel(formula = iGDP ~ iK + iL + iXp + iYear, data = EconUK)
naturalCoef(linexfit)

The linexModel function produces a LINEXmodel object.

In the Linex model, output elasticities are a function of time. To recover the output elasticities from the Linex model, use the fortify function. $\alpha$, $\beta$, and $\gamma$ are output elasticities for capital, labor, and energy, respectively.

head(fortify(linexfit))

Linex model output elasticities are given by Equation 9 in Warr and Ayres [-@Warr:2012cg].

Metadata

By default the original economic data are saved as an attribute of the model object. Data can be recovered with the getData function.

head(getData(cdfit))

To not save original economic data with the model object, set the save.data argument of the various *Model functions to FALSE.

Fitted values

The fitted values of the response variable ($y$) can be recovered with the yhat function. The yhat function gives values of the response variable on the natural (not logarithmic) scale.

yhat(cdfit)

Residuals

Residuals for any model can be recovered by the resid function. The resid function gives residuals in log space (where the model was fitted).

resid(cdfit)

Sum of squared errors (SSE)

The sum of squared errors (SSE) for any model can be calculated by the following code.

sum(resid(cdfit)^2)

Accounting for boundaries and constraints

Most *Model fitting functions have the capability to account for boundaries and constraints. Constrained fitting behavior is controlled by the constrained argument to the *Model functions.

Single-factor model

The simplest fitting function (sfModel) provides an option to constrain the fit with $m = 1$, thereby assuming a Cobb-Douglas-type model where the output elasticity is constrained to the economically-meaningful value. By default, the single-factor model sets constrained = FALSE.

sfModel(formula = iGDP ~ iK + iYear, data = EconUK) %>% naturalCoef()
sfModel(formula = iGDP ~ iK + iYear, data = EconUK, constrained = TRUE) %>% naturalCoef()

Cobb-Douglas model

The cdModel function always assumes constant returns to scale and fits with $\displaystyle\sum_i \alpha_i = 1$. By default, constrained = TRUE, thereby assuring that $0 \le \alpha_i \le 1$.

To demonstrate fitting with constraints, we first use the cdModel function to fit the first 10 years (1960--1969) of EconUK and find $\alpha_2 < 0$.

cdModel(formula = iGDP ~ iK + iL + iXu + iYear, 
        data = EconUK %>% filter(Year < 1970), 
        constrained = FALSE) %>% 
  naturalCoef()

If, on the other hand, we use the default setting for the constrained argument, the best fit is found with $\alpha_2 = 0$ and different values for \alpha_1 and \alpha_3.

cdModel(formula = iGDP ~ iK + iL + iXu + iYear, data = EconUK %>% filter(Year < 1970)) %>% naturalCoef()

CES model

Constraints on the CES model are considerably more complicated than constraints on single-factor or Cobb-Douglas models. When constrained = TRUE, the cesModel function restricts fitted parameters to the following economically-meaningful regions:

The default value of constrained for the cesModel function is TRUE.

To demonstrate, we fit the CES model unconstrained with capital (iK) and useful exergy (iXu) as the factors of production for the years 1980--1989 to find that $\rho_1$ is beyond the economically meaningful boundary.

cesModel(formula = iGDP ~ iK + iXu + iYear, 
         data = EconUK %>% filter(Year >= 1980 & Year < 1990), 
         constrained = FALSE) %>% 
  naturalCoef()

If, instead, we use the default value for constrained (TRUE), we obtain fitted parameters that fall within the economically-meaningful region or on the boundary of the economically-meaningful region, even if the quality of the fit is worse.

cesModel(formula = iGDP ~ iK + iXu + iYear, 
         data = EconUK %>% filter(Year >= 1980 & Year < 1990)) %>% 
  naturalCoef()

The process for constrained fitting by the cesModel function is described in detail in Heun et al. [-@Heun:2017]. In short, the full CES model and 20 boundary models are compared. The model with the smallest sum of squared errors (SSE) is returned by the cesModel function. Models with larger SSE are stored as a list in the model.attempts attribute of the cesModel object returned by the cesModel function. To retrieve the rejected boundary models, use the attr function.

attr(cesfit, "model.attempts")[[1]] %>% naturalCoef()

Linex model

The economically-meaningful region for Linex model parameters $a_0$ and $a_1$ is all real numbers. Thus, fitted parameter constraints have no meaning for the linexModel function, and constrained is not an argument to the linexModel function.

Bootstrap resampling

Bootstrapping is a statistical technique for estimating the precision of parameter estimates by exploring the distribution of estimates in many resampled data sets. Each resampled data set is a randomized version of the original sample data to which the desired analysis method can be applied.

The coefficients from the fit to a resampled time series (the "resample coefficients") will be different from the coefficients obtained from the fit to historical data (the "base coefficients") and form a "resample distribution". When these resample coefficients are highly variable, it is an indication that the data do not determine the parameter estimates very precisely. Even when the residuals are small and the model produces fitted values that track the observed data closely, it may still be difficult to estimate some or all of the model parameters precisely. Lack of precision can stem from a number of factors, including a poor model fit, low model sensitivity to one or more parameters, correlation among parameter estimates, variability unexplained by the predictors in the model, etc.

To perform bootstrap resampling in the EconGrowth package, a model object must first be obtained by calling one of the *Model functions. For this example, we'll use the cdfit object obtained above. The resampledFits function performs the resampling.

cdfits_rs <- resampledFits(model = cdfit, method = "wild", n = 5, seed = 123)

The resampledFits function returns a list with two named members: coeffs and models. coeffs is a data frame with n + 1 rows. The first row contains fitted coefficients for the original model and has a method of orig. The remaining rows contain fitted coefficients for the resample fits, each with a different value for the index variable, and each with the method specified in the function call.

cdfits_rs$coeffs

models is a list containing the model object for each resample fit.

# Original fit
cdfits_rs$models[[1]] %>% naturalCoef()
# First resampled model
cdfits_rs$models[[2]] %>% naturalCoef()
# Last resampled model
cdfits_rs$models[[6]] %>% naturalCoef()

The resampledFits function knows about models fitted with any of the *Model functions.

resampledFits(model = sffit, method = "wild", n = 5)[["coeffs"]]
resampledFits(model = cesfit, method = "wild", n = 5)[["coeffs"]]
resampledFits(model = linexfit, method = "wild", n = 5)[["coeffs"]]

Plotting results

Graphs comparing fitted and historical values are often helpful for visualizing model performance. The following code provides an example using the Cobb-Douglas model fitted above.

bind_cols(EconUK, yhat(cdfit) %>% as.data.frame() %>% set_names("yhat")) %>% 
  ggplot() + 
  # Add historical data as points
  geom_point(mapping = aes(x = Year, y = iGDP), shape = 1) +
  # Add the fitted model as a line
  geom_line(mapping = aes(x = Year, y = yhat))

For the purposes of displaying resampled results from the three-factor-of-production Cobb-Douglas model, fitted output elasticies ($\alpha_i$) can be shown in a ternary plot.

triData <- cdModel(formula = iGDP ~ iK + iL + iXu + iYear, data = EconUK) %>%
  resampledFits(method = "wild", n = 100, seed = 123) %>%
  extract2("coeffs")

triData %>% filter(method == "wild") %>% 
  triPlot(mapping = aes(x = alpha_1, y = alpha_2, z = alpha_3), 
          alpha = 0.3) +
  geom_point(data = triData %>% filter(method == "orig"),
             mapping = aes(x = alpha_1, y = alpha_2, z = alpha_3),
             color = "red", alpha = 1, size = 3, stat = "triangle")

Conclusion

The MacroGrowth package provides functions that streamline both fitting processes (with *Model functions) and bootstrap resampling (with the resampledFits function) for several macroeconomic growth models.

References



EconModels/MacroGrowth documentation built on Dec. 17, 2019, 10:41 p.m.