plot.uvevd: Plot Diagnostics for a Univariate EVD Object
In evd: Functions for Extreme Value Distributions

plot.uvevd

R Documentation

Plot Diagnostics for a Univariate EVD Object

Description

Four plots (selectable by which) are currently provided: a P-P plot, a Q-Q plot, a density plot and a return level plot.

Usage

## S3 method for class 'uvevd'
plot(x, which = 1:4, main, ask = nb.fig <
    length(which) && dev.interactive(), ci = TRUE, cilwd = 1,
    a = 0, adjust = 1, jitter = FALSE, nplty = 2, ...)
## S3 method for class 'gumbelx'
plot(x, interval, which = 1:4, main, ask = nb.fig <
    length(which) && dev.interactive(), ci = TRUE, cilwd = 1,
    a = 0, adjust = 1, jitter = FALSE, nplty = 2, ...)

Arguments

`x`	An object that inherits from class `"uvevd"`.
`which`	If a subset of the plots is required, specify a subset of the numbers `1:4`.
`main`	Title of each plot. If given, must be a character vector with the same length as `which`.
`ask`	Logical; if `TRUE`, the user is asked before each plot.
`ci`	Logical; if `TRUE` (the default), plot simulated 95% confidence intervals for the P-P, Q-Q and return level plots.
`cilwd`	Line width for confidence interval lines.
`a`	Passed through to `ppoints` for empirical estimation. Larger values give less probability for extreme events.
`adjust`, `jitter`, `nplty`	Arguments to the density plot. The density of the fitted model is plotted with a rug plot and (optionally) a non-parameteric estimate. The argument `adjust` controls the smoothing bandwidth for the non-parametric estimate (see `density`). `jitter` is logical; if `TRUE`, the (possibly transformed) data are jittered to produce the rug plot. This need only be used if the data contains repeated values. `nplty` is the line type of the non-parametric estimate. To omit the non-parametric estimate set `nplty` to zero.
`interval`	A vector of length two, for the gumbelx (maximum of two Gumbels) model. This is passed to the uniroot function to calculate quantiles for the Q-Q and return level plots. The interval should be large enough to contain all plotted quantiles or an error from uniroot will occur.
`...`	Other parameters to be passed through to plotting functions.

Details

The following discussion assumes that the fitted model is stationary. For non-stationary generalized extreme value models the data are transformed to stationarity. The plot then corresponds to the distribution obtained when all covariates are zero.

The P-P plot consists of the points

\{(G_n(z_i), G(z_i)), i = 1,\ldots,m\}

where G_n is the empirical distribution function (defined using ppoints), G is the model based estimate of the distribution (generalized extreme value or generalized Pareto), and z_1,\ldots,z_m are the data used in the fitted model, sorted into ascending order.

The Q-Q plot consists of the points

\{(G^{-1}(p_i), z_i), i = 1,\ldots,m\}

where G^{-1} is the model based estimate of the quantile function (generalized extreme value or generalized Pareto), p_1,\ldots,p_m are plotting points defined by ppoints, and z_1,\ldots,z_m are the data used in the fitted model, sorted into ascending order.

The return level plot for generalized extreme value models is defined as follows.

Let G be the generalized extreme value distribution function, with location, scale and shape parameters a, b and s respectively. Let z_t be defined by G(z_t) = 1 - 1/t. In common terminology, z_t is the return level associated with the return period t.

Let y_t = -1/\log(1 - 1/t). It follows that

z_t = a + b(y_t^s - 1)/s.

When s = 0, z_t is defined by continuity, so that

z_t = a + b\log(y_t).

The curve within the return level plot is z_t plotted against y_t on a logarithmic scale, using maximum likelihood estimates of (a,b,s). If the estimate of s is zero, the curve will be linear. For large values of t, y_t is approximately equal to the return period t. It is usual practice to label the x-axis as the return period.

The points on the plot are

\{(-1/\log(p_i), z_i), i = 1,\ldots,m\}

where p_1,\ldots,p_m are plotting points defined by ppoints, and z_1,\ldots,z_m are the data used in the fitted model, sorted into ascending order. For a good fit the points should lie “close” to the curve.

The return level plot for peaks over threshold models is defined as follows.

Let G be the generalized Pareto distribution function, with location, scale and shape parameters u, b and s respectively, where u is the model threshold. Let z_m denote the m period return level (see fpot and the notation therein). It follows that

z_m = u + b((pmN)^s - 1)/s.

When s = 0, z_m is defined by continuity, so that

z_m = u + b\log(pmN).

The curve within the return level plot is z_m plotted against m on a logarithmic scale, using maximum likelihood estimates of (b,s,p). If the estimate of s is zero, the curve will be linear.

The points on the plot are

\{(1/(pN(1-p_i)), z_i), i = 1,\ldots,m\}

Examples

uvdata <- rgev(100, loc = 0.13, scale = 1.1, shape = 0.2)
M1 <- fgev(uvdata)
## Not run: par(mfrow = c(2,2))
## Not run: plot(M1)

uvdata <- rgpd(100, loc = 0, scale = 1.1, shape = 0.2)
M1 <- fpot(uvdata, 1)
## Not run: par(mfrow = c(2,2))
## Not run: plot(M1)

evd documentation built on Sept. 21, 2024, 9:06 a.m.