plbpsm_models: Specifying generalized geospatial additive models
In funstatpackages/GgAM: Generalized geoAdditive Models

Description Value Models with simple smooth terms

This page is intended to provide some more information on how to specify GGAMs. A GGAM is a GLM in which the linear predictor depends, in part, on a sum of smooth functions of predictors and (possibly) linear functionals of smooth functions of (possibly dummy) predictors. Specifically let y_i denote an independent random variable with mean mu_i and an exponential family distribution, or failing that a known mean variance relationship suitable for use of quasi-likelihood methods. Then the the linear predictor of a GGAM has a structure something like

g(mu_i)=X_i b + f_1(x_1i,x_2i) + f_2(x_3i) + L_i f_3(x_4) + ...

where g is a known smooth monotonic ‘link’ function, X_i b is the parametric part of the linear predictor, the x_j are predictor variables, the u_j and f_j are smooth functions. There may of course be multiple linear functional terms, or none.

The key idea here is that the dependence of the response on the predictors can be represented as a parametric sub-model plus the sum of some (functionals of) smooth functions of one or more of the predictor variables. Thus the model is quite flexible relative to strictly parametric linear or generalized linear models, but still has much more structure than the completely general model that says that the response is just some smooth function of all the covariates.

The following sections discuss specifying model structures for gam. Specification of the distribution and link function is done using the family argument to plbpsm and works in the same way as for glm. This page therefore concentrates on the model formula for plbpsm.

A plbpsm object has the following elements:

`aic`	AIC of the fitted model: bear in mind that the degrees of freedom used to calculate this are the effective degrees of freedom of the model, and the likelihood is evaluated at the maximum of the penalized likelihood in most cases, not at the MLE.
`assign`	Array whose elements indicate which model term (listed in `pterms`) each parameter relates to: applies only to non-smooth terms.
`call`	the matched call (allows `update` to be used with `plbpsm` objects, for example).
`cmX`	column means of the model matrix (with elements corresponding to smooths set to zero ) — useful for componentwise CI calculation.
`coefficients_linear`	the linear (parametric) coefficients of the fitted model.
`coefficients_nonlinear`	the nonlinear (bivariate spline) coefficients of the fitted model.
`data`	the original supplied data argument (for class `"gam"` compatibility).
`deviance`	model deviance (not penalized deviance).
`df.null`	null degrees of freedom.
`df.residual`	effective residual degrees of freedom of the model.
`edf`	estimated degrees of freedom.
`family`	family object specifying distribution and link used.
`fitted.values`	fitted model predictions of expected value for each datum.
`formula`	the model formula.
`gcv_opt`	The minimized smoothing penalty parameter selection score: GCV.
`cv_opt`	The minimized smoothing penalty parameter selection score: CV.
`linear.predictors`	fitted model prediction of link function of expected value for each datum.
`criterion`	One of `"GCV"` or `CV`, depending on the fitting criterion used.
`min.edf`	Minimum possible degrees of freedom for whole model.
`model`	model frame containing all variables needed in original model fit.
`na.action`	The `na.action` used in fitting.
`nsdf`	number of parametric, non-smooth, model terms including the intercept.
`null.deviance`	deviance for single parameter model.
`method`	One of `ALASSO`, `SCAD`, depending on the variable selection method.
`pred.formula`	one sided formula containing variables needed for prediction, used by `predict.plbpsm`
`pterms`	`terms` object for strictly parametric part of model.
`rank`	apparent rank of fitted model.
`residuals`	the working residuals for the fitted model.
`scale`	defualt=1.
`se_beta`	estimated standard error for parametric covariates.
`sse`	Sum of squared error of the estimation.
`basis_info`	list of smooth objects, containing the basis information for each term in the model formula in the order in which they appear. These smooth objects are what gets returned by the `BasisCon` objects.
`terms`	`terms` object of `model` model frame.
`var.summary`	A named list of summary information on the predictor variables
`Ve`	estimated covariance matrix for the parameter estimators. Particularly useful for testing whether terms are zero. Not so useful for CI's as smooths are usually biased.
`weights`	final weights used in grplsfit.
`y`	response data.

Consider the example model.

g(mu_i) = b_0 + b_1 x_1i + b_2 x_2i + u(x_3i) + f(x_4i,x_5i)

where the response variables y_i has expectation mu_i and g is a link function.

The plbpsm formula for this would be
y ~ x1 + x2 + u(x3) + f(x4,x5).
This would use the default basis for the smooths (a thin plate regression spline basis for each), with default choices of degree of polynomials for both smooths. Full details of how to control smooths are given in s and u.

The above assumes that x_4 and x_5 are naturally on similar scales (e.g. they might be co-ordinates), so that isotropic smoothing is appropriate.

funstatpackages/GgAM documentation built on Nov. 4, 2019, 12:59 p.m.