knitr::opts_chunk$set( collapse = TRUE, comment = "#>", echo = FALSE, error = FALSE, message = FALSE, warning = FALSE ) # Install locally # devtools::install_local( R'(C:\Users\James.Thorson\Desktop\Git\tinyVAST)', force=TRUE ) # Build and PDF # setwd(R'(C:\Users\James.Thorson\Desktop\Git\tinyVAST)'); devtools::build_rmd("vignettes/model-description.Rmd"); rmarkdown::render( "vignettes/model-description.Rmd", rmarkdown::pdf_document()) # # Recommended workflow: # * Open in Rstudio and knit using button there
\newcommand{\B}{\mathbf}
\newcommand{\PL}{\mathrm}
\newcommand{\s}{\boldsymbol{s}}
tinyVAST is a bivariate extension of a generalized linear mixed model (see Tables 1 and 2 for notation), which includes two linear predictors that each include four additive components:
Spatial interactions among variables: The user can specify interactions among variables at a given site (or for spatially correlated latent variables) using arrow notation derived from path analysis, based on the interface from R package sem;
Temporal interaction among variables: The user can specify simultaneous and lagged interactions among variables over time using an expanded arrow-and-lag notation that is derived from R package dsem, where these interactions the annual intercept for a given variable therefore apply uniformly for all locations.
Spatio-temporal interactions among variables: The user can specify simultaneous and lagged interactions among variables over time, where these interactions occur on a site-by-site basis.
Generalized additive model: The user specifies a formula for a generalized additive model (GAM) that is interpreted by R package mgcv. If other model components are missing, tinyVAST estimates parameters that are similar to mgcv, with small differences resulting from different methods for parameter estimation;
These four components are assembled into two linear predictors:
$$ \begin{aligned} p_{\PL 1,i} &= \underbrace{\B X_{\PL 1,i} \B \alpha_{\PL 1} + \B Z_{\PL 1,i} \B \gamma_{\PL 1}}\text{GAM} + \underbrace{\B A{i} \B \Omega_{\PL 1,c[i]}}\text{space_term} + \underbrace{\B D{\PL 1,c[i],t[i]}}\text{time_term} + \underbrace{\B A_i \B E{\PL 1,c[i],t[i]}}\text{spacetime_term} \ p{\PL 2,i} &= \underbrace{\B X_{\PL 2,i} \B \alpha_{\PL 2} + \B Z_{\PL 2,i} \B \gamma_{\PL 2}}\text{GAM} + \underbrace{\B A{i} \B \Omega_{\PL 2,c[i]}}\text{space_term} + \underbrace{\B D{\PL 2,c[i],t[i]}}\text{time_term} + \underbrace{\B A_i \B E{\PL 2,c[i],t[i]}}_\text{spacetime_term} \end{aligned} $$
where
and terms are defined similarly for the second linear predictor $p_{\PL 2,i}$ except using subscript-2. The linear predictors are then passed through a bivariate inverse-link function to specify the distribution for errors:
$$ y_i \sim f_{e[i]}( g_{e[i]}^{-1} (p_{\PL 1,i}, p_{\PL 2,i}), \theta_{e[i]} ) $$
where
In the simple case, the distribution only requires a single linear predictor such that $\B p_{\PL 2} = \B 0$ by construction and drops out of the model. In this case, the model collapses to a generalized linear mixed model. For example we might have a log-link and Poisson distribution, such that it collapses to a log-linked Poisson GLMM, $y_i \sim \mathrm{Poisson}(e^{p_{\PL 1,i}})$.
However, tinyVAST can also handle a delta-model using either logit-log or Poisson-linked link functions:
$$ g_{e[i]}^{-1} (p_{\PL 1,i}, p_{\PL 2,i}) = ( \mu_{\PL 1,i}, \mu_{\PL 2,i} ) $$
where $\mu_{\PL 1,i}$ and $\mu_{\PL 2,i}$ are the two linear predictors, such that $\mathbb{E}[y_i] = \mu_{\PL 1,i} \mu_{\PL 2,i}$;
For the conventional logit-log bivariate link function we use a logit-link for encounter probabilities and a log-link for positive catch rates:
$$ \begin{aligned} \mu_{\PL 1,i} &= \frac{e^{p_{\PL 1,i}}}{1+e^{p_{\PL 1,i}}} \ \mu_{\PL 2,i} &= e^{p_{\PL 2,i}} \end{aligned} $$
while for the Poisson-linked link function we specify a complemetary log-log link for encounter probabilities and define the second link function such that $\mu_{\PL 1,i} \mu_{\PL 2,i} = e^{p_{\PL 1,i}} e^{p_{\PL 2,i}}$:
$$ \begin{aligned} \mu_{\PL 1,i} &= 1 - e^{-e^{p_{\PL 1,i}}} \ \mu_{\PL 2,i} &= \frac{e^{p_{\PL 1,i}}}{\mu_{\PL 1,i}} e^{p_{\PL 2,i}} \end{aligned} $$ where $e^{p_{\PL 1,i}}$ is the density for an underlying point process, and $e^{p_{\PL 2,i}}$ is biomass per point (i.e., animal or group).
In either the conventional or Poisson-linked delta model, $\mu_{\PL 1,i} = \mathrm{Pr}(Y>0)$ is the encounter probability (and is fitted with a Bernoulli distribution), while $\mu_{\PL 2,i}$ is the central tendancy for positive values, i.e., $\mu_{\PL 2,i} = \mathbb{E}(Y | Y>0)$.
Linear predictors include spatially autocorrelated latent variables. These variables are treated as Gaussian Markov random fields (GMRFs), and evaluating the probability density of GMRFs involves calculating the precision matrix $\B Q_{\PL{domain}} = \B \Sigma^{-1}$ as the inverse of the spatial covariance matrix. tinyVAST involves three options for specifying this spatial precision:
$$ \B Q_{\PL{domain}} = \tau^2 ( \kappa^4 \B M_{\PL 0} + 2\kappa^2 \B M_{\PL 1} + \B M_{\PL 2} ) $$ where every row $\B A_i$ of the interpolation matrix $\B A$ is nonzero for only the three vertices of the triangle that contains sample $i$
$$ \B Q_{\PL{domain}} = \tau^2 (\B I - \kappa \B A^)^2 $$ where $\B A^$ is the adjacency matrix of the graph specified by the analyst, and each row $\B A_i$ of interpolation matrix $\B A$ is nonzero only for the single spatial stratum containing sample $i$ (and noting that adjacency matrix $\B A^*$ is different from interpolation matrix $\B A$, but we use the same notation for both due to a collision in notational standards).
For each spatial domain, the term $\tau$ is defined such that precision $\B Q_{\PL{domain}}$ has unit variance. This is done because the spatial domain always arises in combination with other parameters, which are used to define the variance of the associated spatial variable.
tinyVAST also involves specifying a structural equation model (SEM). This SEM be viewed either:
To specify a SEM, the user uses arrow notation derived from package sem
. For example, to specify a linear model this involves:
w1 -> w2, b_12 w1 <-> w1, sd_1 w2 <-> w2, sd_2
This then estimates a single slope parameter (represented with a one-headed arrow), as well as the variance of $W_1$ and $W_2$ (specified with two-headed arrows). In a more complicated case, $W_1$ might cause $W_2$, which in turn causes $W_3$. This is then represented as:
# Path, parameter_name, start value w1 -> w2, b_12, 0 w2 -> w3, b_23, 0 w1 <-> w1, s_1, 1 w2 <-> w2, s_2, 1 w3 <-> w3, s_3, 1
SEM interactions can be as complicated or simple as desired, and can include:
parameter_name
is provided as NA
and the starting value that follows is the fixed value;parameter_name
is provided for multiple rows of the text file.In this preceding example, path coefficients for one-headed arrows then define path matrix $\B P$:
$$ \B P = \begin{pmatrix} 0 & 0 & 0 \ b_{12} & 0 & 0 \ 0 & b_{23} & 0 \end{pmatrix} $$
and coefficents for two-headed arrows define the Cholesky $\B G$ of the exnogenous covariance matrix $\B G^T \B G$:
$$ \B G = \begin{pmatrix} s_{1} & 0 & 0 \ 0 & s_{2} & 0 \ 0 & 0 & s_{3} \end{pmatrix} $$
These matrices are define a simultaneous equation model:
$$ \mathbf{ w = P w + \epsilon} \ \B \epsilon \sim \mathrm{MVN}( \B 0, \B G^T \B G ) $$ where the variance $\mathrm{Var}(\B w) = (\mathbf{I - P})^{-1} \B G^2 (\mathbf{I - P}^T)^{-1}$. This then results in a sparse precision matrix:
$$ \B Q = (\mathbf{I - P}^T) \B G^{-1} \B G^{-T} (\mathbf{I - P}) $$
where this precision matrix $\B Q$ is then used as a modular component in the larger tinyVAST model.
Similarly, tinyVAST involves specifying dynamic structural equation models (DSEM). To specify a DSEM, the user uses arrow-and-lag notation. For example, to specify a univariate first-order autoregressive process:
w1 -> w1, 1, rho, 0.8
If there were four time-intervals ($T=4$) this would then result in the path matrix:
$$ \B P = \begin{pmatrix} 0 & 0 & 0 & 0 \ \rho & 0 & 0 & 0 \ 0 & \rho & 0 & 0 \ 0 & 0 & \rho & 0 \end{pmatrix} $$
and when the DSEM involves multiple times and variables, the sparse precision is formed by summing across the Kronecker product of time-lag and interaction matrices. This DSEM defines a GMRF over a nonseparable interaction of time and variables, represented by a matrix $\B Q_{\PL{time_term}}$ with dimension $CT \times CT$. The user can specify a separate arrow-and-lag notation to define the precision matrix for the time-variable interaction $\B Q_{\PL{time_term}}$ and for the space-time-variable interaction $\B Q_{\PL{spacetime_term}}$.
The precision matrix $\B Q_{\PL{time_term}}$ for the time term $\B D$ is used to define the time-varying intercept for each variable:
$$ \mathrm{vec}(\B D) \sim \mathrm{MVN}(\B 0, \B Q_{\PL{time_term}}) $$ Meanwhile, the space-time term $\B Q_{\PL{spacetime_term}}$ is combined with the spatial precision $\B Q_{\PL{space_term}}$ as we explain next.
tinyVAST uses the SEM and DSEM notation to construct the joint precision for the space-variable interaction $\B \Omega$ with dimension $S \times C$, and the space-time-variable interaction $\B E$ with dimension $S \times C \times T$. To do so, it constructs the separable precision for each process:
$$ \mathrm{vec}(\B E) \sim \mathrm{MVN}(\B 0, \B Q_{\PL{spacetime_term}} \otimes \B Q_{\PL{domain}}) $$
where the precision matrix $\B Q_{\PL{spacetime_term}} \otimes \B Q_{\PL{domain}}$ has dimension $STC \times STC$ to match the length of $\mathrm{vec}(\B E)$, and $$ \mathrm{vec}(\B \Omega) \sim \mathrm{MVN}(\B 0, \B Q_{\PL{space_term}} \otimes \B Q_{\PL{domain}}) $$
where the precision matrix $\B Q_{\PL{space_term}} \otimes \B Q_{\PL{domain}}$ has dimension $SC \times SC$, to match the length of $\mathrm{vec}(\B \Omega)$.
Finally, the analyst can specify a generalized additive model using syntax from package mgcv. For example this might involve:
count ~ year + offset(log_area) + s(depth) + s(species, bs="re")
If year
and species
are factors and depth
and log_area
are continuous, then this would specify a fixed effect for each level year
, a spline smoother for depth
, using log_area
as an offset, and estimating a random intercept for each level of species
. This formula is parsed internally to assemble fixed effects in design matrix $\B X$ and the basis functions for spline smoothers and random effects in design matrix $\B Z$. The coefficients $\B \gamma$ associated with smoothers and random effects are then specified to follow a GMRF:
$$ \B \gamma \sim \mathrm{GMRF}( \B 0, \B Q_{\PL{gam}}) $$ where $\B Q_{\PL{gam}}$ is a blockwise diagonal matrix, assembled from estimated variance parameters and matrices constructed by mgcv.
subscripts <- tibble::tribble( ~Symbol, ~Description, "$i$", "Index for each sample, $i$ in $(1,2,...,I)$", "$s[i]$", "spatial coordinate for sample $i$, $s$ in $(1,2,...,S)$", "$t[i]$", "time-interval for sample $i$, $t$ in $(1,2,...,T)$", "$c[i]$", "category for sample $i$, $c$ in $(1,2,...,C)$", "$e[i]$", "error distribution and link function for sample $i$" ) knitr::kable(subscripts, caption = "Table 1: Subscript notation", escape = FALSE, booktabs = TRUE)
symbols <- tibble::tribble( ~Symbol, ~Code, ~Description, "$y$", "`y_i`", "Observed response data", "$p_1$", "`p_i`", "first linear predictor", "$p_2$", "`p2_i`", "second linear predictor", ) knitr::kable(symbols, caption = "Table 2: Symbol notation, code representation (in model output or in model template code), and descriptions.", escape = FALSE, booktabs = TRUE, linesep = c( rep('', 7), # y, mean etc. '\\addlinespace', rep('', 7), # fields '\\addlinespace', rep('', 5), # covariance stuff '\\addlinespace', rep('', 5), # more covariance stuff '\\addlinespace', rep('', 99) # end ))
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.