fusionSimulate: Simulate spatial data
In spatialfusion: Multivariate Analysis of Spatial Data Using a Unifying Spatial Fusion Framework

fusionSimulate

R Documentation

Simulate spatial data

Description

Simulate spatial response variables with different data types, including geostatistical (point), lattice (areal), and point pattern data. They share common latent Gaussian processes. The geostatistical and lattice response variables are allowed to have fixed effects.

Usage

fusionSimulate(n.point, n.area, n.grid, n.pred, dimension = 10,
               psill = 5, phi = 1, nugget = 0, tau.sq = 1,
               domain = NULL, point.beta = NULL, area.beta = NULL,
               nvar.pp = 1, distributions,
               design.mat = matrix(c(1, 1.5, 2)),
               pp.offset, seed)

Arguments

`n.point`	positive integer. Sample size for geostatistical (point) data.
`n.area`	positive integer. Sample size for lattice (areal) data.
`n.grid`	positive integer. Number of grid to be divided in each direction of the spatial domain.
`n.pred`	positive integer. Number of prediction locations to sample regularly.
`dimension`	positive integer. Dimension of the square spatial domain.
`domain`	an object of class `sf`. The spatial domain considered for the simulation. If NULL, a square domain with length `dimension` is used.
`psill`	positive numeric or numeric vector. Partial sill parameter(s) of the latent Gaussian process(es).
`phi`	positive numeric or numeric vector. Range parameter(s) of the latent Gaussian process(es).
`nugget`	positive numeric vector. Nugget parameter(s) of the latent Gaussian process(es).
`tau.sq`	positive numeric vector. Variance component(s) for normally distributed responses.
`point.beta`	a list of column matrices. Regression coefficient for geostatistical data, e.g. `list(rbind(1,2,3),rbind(2,4,6))` for two geostatistical response variables with an intercept plus two covariates each.
`area.beta`	a list of column matrices. Regression coefficient for lattice data, e.g. see above.
`nvar.pp`	numeric. Number of point pattern response variables to simulate.
`distributions`	character vector. Names of distributions for each dependent variables with geostatistical and lattice type, currently “Gaussian” or “normal”, “Poisson” (count) and “Bernoulli” (binary) are supported. Note: no distribution for point pattern data is required to be specified.
`design.mat`	matrix. Design matrix for the latent Gaussian process(es), with the number of columns equal to the number of latent processes.
`pp.offset`	numeric. A single offset term for the intensity of point pattern data.
`seed`	integer. Random seed.

Details

The exponential covariance model is used,

C(d) = \sigma^2 \exp{-d/\phi}

where d is the Euclidean distance, \sigma^2 is the partial sill and \phi is the spatial range.

If the purpose is to validate a fitted latent spatial components of a spatial fusion model, one can check the fitted latent values against mrf[sample.ind, -1:2]. If the purpose is to investigate prediction performance of latent spatial components, one can predict at locations pred.loc and check against mrf[pred.ind, -1:2].

Value

The returned value is a list that consists of:

`data`	a named list providing data variables.
`mrf`	a data.frame of locations and the latent Gaussian process.
`domain`	an object of class `sf` describing the whole domain.
`pred.loc`	a data.frame of locations for prediction.
`pred.ind`	a vector of indices for prediction locations.
`sample.ind`	a vector providing the indices of sampled locations in the Gaussian process. (only if there is geostatistical data)
`mean.w`	a list of aggregated latent process for each area. (only if there is lattice data)
`poly`	an object of class `sf` for the lattice data. (only if there is lattice data)
`lgcp.grid`	a data.frame containing the centroids of gridded cells for point pattern data and the corresponding event counts. (only if there is point pattern data)

Author(s)

Craig Wang

Examples

# three responses with a single latent Gaussian process
dat1 <- fusionSimulate(n.point = 50, n.area = 5, n.grid = 2,
            psill = 1, phi = 1, nugget = 0, tau.sq = 0.5,
            point.beta = list(rbind(1,5)), area.beta = list(rbind(-1, 0.5)),
            distributions = c("normal","poisson"), pp.offset = 1,
            design.mat = matrix(c(1,1,1)))


# three responses with two latent Gaussian processes
dat2 <- fusionSimulate(n.point = 50, n.area = 5, n.grid = 2,
               psill = c(1,2), phi = c(2,1), nugget = c(0,0), tau.sq = 1,
               point.beta = list(rbind(1,5)), area.beta = list(rbind(-1, 0.5)),
               distributions = c("normal","poisson"), pp.offset = 0.1,
               design.mat = matrix(c(1,1,1,2,3,4), ncol = 2))

spatialfusion documentation built on June 22, 2025, 5:07 p.m.