etas: Fit the space-time ETAS model to data

View source: R/etas.R

etasR Documentation

Fit the space-time ETAS model to data

Description

A function to fit the space-time version of the Epidemic Type Aftershock Sequence (ETAS) model to a catalog of earthquakes (a spatio-temporal point pattern) and perform a stochastic declustering method.

Usage

 etas(object, param0 = NULL, bwd = NULL, nnp = 5, bwm = 0.05,
      verbose = TRUE, plot.it = FALSE, ndiv = 1000, no.itr = 11,
      rel.tol=1e-03, eps = 1e-06, cxxcode = TRUE, nthreads = 1,
      mver = 1)

Arguments

object

An object of class "catalog" containing an earthquake catalog dataset.

param0

Initial guess for model parameters. A numeric vector of appropriate length (currently 8). See details.

bwd

Optional. Bandwidths for smoothness and integration on the geographical region win. A numeric vector which has the length of the number of events. If not supplied, the following arguments nnp and bwm determine bandwidths.

nnp

Number of nearest neighbors for bandwidth calculations. An integer.

bwm

Minimum bandwidth. A positive numeric value.

verbose

Logical flag indicating whether to print progress reports.

plot.it

Logical flag indicating whether plot probabilities of each event being a background event on a map.

ndiv

An integer indicating the number of knots on each side of the geographical region for integral approximation.

no.itr

An integer indicating the number of iterations for convergence of the iterative approach of simultaneous estimation and declustering algorithm. See details.

rel.tol

Relative iteration convergence tolerance of the iterative estimation approach.

eps

Optimization convergence tolerance in the Davidon-Fletch-Powell algorithm

cxxcode

Logical flag indicating whether to use the C++ code. The C++ code is slightly faster and allows parallel computing.

nthreads

An integer indicating number of threads in the parallel region of the C++ code

mver

An integer indicating which spatial probability density function for locations of triggered events should be use. The default mver=1 corresponds to the inverse power density and mver=2 corresponds to the Gaussian density.

Details

Ogata (1988) introduced the epidemic type aftershock sequence (ETAS) model based on Gutenberg-Richter law and modified Omori law. In its space-time representation (Ogata, 1998), the ETAS model is a temporal marked point process model, and a special case of marked Hawkes process, with conditional intensity function

\lambda(t, x, y | H_t) = \mu(x,y) + \sum_{t_i < t} k(m_i)g(t - t_i)f(x - x_i, y - y_i|m_i)

where

H_t:

is the observational history up to time t, but not including t; that is

H_t=\{(t_i, x_i, y_i, m_i): t_i < t\}

\mu(x,y):

is the background intensity. Currently it is assumed to take the semi-parametric form

\mu(x,y)=\mu u(x,y)

where \mu is an unknown constant and u(x,y) is an unknown function.

k(m):

is the expected number of events triggered from an event of magnitude m given by

k(m) = A\exp(\alpha(m - m_0))

g(t):

is the p.d.f of the occurrence times of the triggered events, taking the form

g(t) = \frac{p-1}{c}(1 + \frac{t}{c})^{-p}

f(x,y|m):

is the p.d.f of the locations of the triggered events, considered to be either the long tail inverse power density (mver = 1)

f(x, y|m) = \frac{q-1}{\pi \sigma(m))} (1 + \frac{x^2 + y^2}{\sigma(m)})^{-q}

or the light tail Gaussian density (mver = 2, only can be used if cxxcode = TRUE)

f(x,y|m)= \frac{1}{2\pi \sigma(m)}\exp(-\frac{x^2 + y^2}{2\sigma(m)})

with

\sigma(m) = D\exp(\gamma(m - m_0))

The ETAS models classify seismicity into two components, background seismicity \mu(x, y) and clustering seismicity \lambda(t, x, y|H_t) - \mu(x, y), where each earthquake event, whether it is a background event or generated by another event, produces its own offspring according to the branching rules controlled by k(m), g(m) and f(x, y|m).

Background seismicity rate u(x, y) and the model parameters

\theta=(\mu, A, c, \alpha, p, D, q, \gamma)

are estimated simultaneously using an iterative approach proposed in Zhuang et al. (2002). First, for an initial u_0(x, y), the parameter vector \theta is estimated by maximizing the log-likelihood function

l(\theta)=\sum_{i} \lambda(t_i, x_i, y_i|H_{t_i}) - \int \lambda(t, x, y|H_t) dx dy dt.

Then the procedure calculates the probability of being a background event for each event in the catalog by

\phi_i = \frac{\mu(x_i, y_i)}{\lambda(t_i, x_i, y_i|H_{t_i})}.

Using these probabilities and kernel smoothing method with Gaussian kernel and appropriate choice of bandwidth (determined by bwd or nnp and bwm arguments), the background rate u_0(x, y) is updated. These steps are repeated until the estimates converge (stabilize).

The no.itr argument specifies the maximum number of iterations in the iterative simultaneous estimation and declustering algorithm. The estimates often converge in less than ten iterations. The relative iteration convergence tolerance and the optimization convergence tolerance are, respectively, determined by rel.tol and eps arguments. The progress of the computations can be traced by setting the verbose and plot.it arguments to be TRUE.

If cxxcode = TRUE, then the internal function etasfit uses the C++ code implemented using the Rcpp package, which allows multi-thread parallel computing on multi-core processors with OpenMP. The argument nthreads in this case determines the number of threads in the parallel region of the code. If nthreads = 1 (the default case), then a serial version of the C++ code carries out the computations.

This version of the ETAS model assumes that the earthquake catalog is complete and the data are stationary in time. If the catalog is incomplete or there is non-stationarity (e.g. increasing or cyclic trend) in the time of events, then the results of this function are not reliable.

Value

A list with components

param:

The ML estimates of model parameters.

bk:

An estimate of the u(x, y).

pb:

The probabilities of being background event.

opt:

The results of optimization: the value of the log-likelihood function at the optimum point, its gradient at the optimum point and AIC of the model.

rates:

Pixel images of the estimated total intensity, background intensity, clustering intensity and conditional intensity.

Note

This function is based on a C port of the original Fortran code by Jiancang Zhuang, Yosihiko Ogata and their colleagues. The etas function is intended to be used for small and medium-size earthquake catalogs. For large earthquake catalogs, due to time-consuming computations, it is highly recommended to use the parallel Fortran code on a server machine. The Fortran code (implemented for parallel/non-parallel computing) can be obtained from http://bemlar.ism.ac.jp/zhuang/software.html.

Author(s)

Abdollah Jalilian jalilian@razi.ac.ir

References

Ogata Y (1988). Statistical Models for Earthquake Occurrences and Residual Analysis for Point Processes. Journal of the American Statistical Association, 83(401), 9–27. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.2307/2288914")}.

Ogata Y (1998). Space-time Point-process Models for Earthquake Occurrences. Annals of the Institute of Statistical Mathematics, 50(2), 379–402. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1023/a:1003403601725")}.

Zhuang J, Ogata Y, Vere-Jones D (2002). Stochastic Declustering of Space-Time Earthquake Occurrences. Journal of the American Statistical Association, 97(458), 369–380. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1198/016214502760046925")}.

Zhuang J, Ogata Y, Vere-Jones D (2006). Diagnostic Analysis of Space-Time Branching Processes for Earthquakes. In Case Studies in Spatial Point Process Modeling, pp. 275–292. Springer Nature. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1007/0-387-31144-0_15")}.

Zhuang J (2011). Next-day Earthquake Forecasts for the Japan Region Generated by the ETAS Model. Earth, Planets and Space, 63(3), 207–216. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.5047/eps.2010.12.010")}.

See Also

catalog for constructing data. probs for estimated declustering probabilities. resid.etas for diagnostic plots.

Examples

  # fitting the ETAS model to an Iranian catalog
  # preparing the catalog
  iran.cat <- catalog(iran.quakes, time.begin="1973/01/01",
     study.start="1986/01/01", study.end="2016/01/01",
     lat.range=c(26, 40), long.range=c(44, 63), mag.threshold=5)
  print(iran.cat)
  ## Not run: 
  plot(iran.cat)
## End(Not run)

  # setting initial parameter values
  param0 <- c(0.46, 0.23, 0.022, 2.8, 1.12, 0.012, 2.4, 0.35)

  # fitting the model
  ## Not run: 
  iran.fit <- etas(iran.cat, param0=param0)
## End(Not run)


  # fitting the ETAS model to an Italian catalog
  # preparing the catalog
  italy.cat <- catalog(italy.quakes, dist.unit="km")
  ## Not run: 
  plot(italy.cat)
## End(Not run)

  # setting initial parameter values
  mu <- 1
  k0 <- 0.005
  c <- 0.005
  alpha <- 1.05
  p <- 1.01
  D <- 1.1
  q <- 1.52
  gamma <- 0.6
  # reparametrization: transform k0 to A
  A <- pi * k0 / ((p - 1) * c^(p - 1) * (q - 1) * D^(q - 1))
  param0 <- c(mu, A, c, alpha, p, D, q, gamma)

  # fitting the model
  ## Not run: 
  nthreads <- parallel::detectCores()
  italy.fit <- etas(italy.cat, param0, nthreads=nthreads)
## End(Not run)


  # fitting the ETAS model to a Japanese catalog
  # setting the target polygonal study region
  jpoly <- list(long=c(134.0, 137.9, 143.1, 144.9, 147.8,
      137.8, 137.4, 135.1, 130.6), lat=c(31.9, 33.0, 33.2,
      35.2, 41.3, 44.2, 40.2, 38.0, 35.4))
  # preparing the catalog
  japan.cat <- catalog(japan.quakes, study.start="1953-05-26",
      study.end="1990-01-08", region.poly=jpoly, mag.threshold=4.5)
  ## Not run: 
  plot(japan.cat)
## End(Not run)

  # setting initial parameter values
  param0 <- c(0.592844590, 0.204288231, 0.022692883, 1.495169224,
  1.109752319, 0.001175925, 1.860044210, 1.041549634)

  # fitting the model
  ## Not run: 
  nthreads <- parallel::detectCores()
  japan.fit <- etas(japan.cat, param0, nthreads=nthreads)
## End(Not run)

ETAS documentation built on Sept. 30, 2024, 9:42 a.m.