sample.variogram: Computing (Robust) Sample Variograms of Spatial Data

The function sample.variogram computes the sample (empirical) variogram of a spatial variable by the method-of-moment and three robust estimators. Both omnidirectional and direction-dependent variograms can be computed, the latter for observation locations in a three-dimensional domain. There are summary and plot methods for summarizing and displaying sample variograms.


sample.variogram(object, ...)

## Default S3 method:
sample.variogram(object, locations, lag.dist.def,
    xy.angle.def = c(0, 180), xz.angle.def = c(0, 180), max.lag = Inf,
    estimator = c("qn", "mad", "matheron", "ch"), mean.angle = TRUE, ...)

## S3 method for class 'formula'
sample.variogram(object, data, subset, na.action,
    locations, lag.dist.def, xy.angle.def = c(0, 180),
    xz.angle.def = c(0, 180), max.lag = Inf,
    estimator = c("qn", "mad", "matheron", "ch"), mean.angle = TRUE, ...)

## S3 method for class 'georob'
sample.variogram(object, lag.dist.def,
    xy.angle.def = c(0, 180), xz.angle.def = c(0, 180), max.lag = Inf,
    estimator = c("qn", "mad", "matheron", "ch"), mean.angle = TRUE, ...)

## S3 method for class 'sample.variogram'
summary(object, ...)

## S3 method for class 'sample.variogram'
plot(x, type = "p", add = FALSE,
    xlim = c(0, max(x[["lag.dist"]])),
    ylim = c(0, 1.1 * max(x[["gamma"]])), col, pch, lty, cex = 0.8,
    xlab = "lag distance", ylab = "semivariance",
    annotate.npairs = FALSE, npairs.pos = 3, npairs.cex = 0.7,
    legend = nlevels(x[["xy.angle"]]) > 1 || nlevels(x[["xz.angle"]]) > 1,
    legend.pos = "topleft", ...)



a numeric vector with the values of the response for which the sample variogram should be computed (sample.variogram.default), a formula, specifying in its left part the response variable (right part of formula is ignored, sample.variogram.formula), an object of class georob
(sample.variogram.georob) or an object of class
sample.variogram (summary.sample.variogram).


a numeric matrix with the coordinates of the locations where the response was observed (sample.variogram.default) or a one-sided formula specifying the coordinates (sample.variogram.formula). The matrix may have an arbitrary number of columns for an omnidirectional variogram, but at most 3 columns if a directional variogram is computed.


an optional data frame, list or environment (or another object coercible by to a data frame) containing the response variable and the coordinates where the data was recorded. If not found in data, the variables are taken from environment(formula), typically the environment from which sample.variogram is called.


an optional vector specifying a subset of observations to be used for estimating the variogram.


a function which indicates what should happen when the data contain NAs. The default is set by the na.action argument of options, and is if that is unset. The “factory-fresh” default is na.omit. Another possible value is NULL, no action. Value na.exclude can be useful.


a numeric scalar defining a constant bin width for grouping the lag distances or a numeric vector with the bounds of a set of contiguous bins (upper bounds of bins except for the first element of lag.dist.def which is the lower bound of the first bin). This argument is mandatory.


an numeric vector defining angular classes in the horizontal plane for computing directional variograms. xy.angle.def must contain an ascending sequence of azimuth angles in degrees from north (positive clockwise to south), see Details. Omnidirectional variograms are computed with the default c(0,180).


an numeric vector defining angular classes in the x-z-plane for computing directional variograms. xz.angle.def must contain an ascending sequence of angles in degrees from zenith (positive clockwise to nadir), see Details. Omnidirectional variograms are computed with the default c(0,180).


a positive numeric defining the largest lag distance for which semi variances should be computed (default no restriction).


a character keyword defining the estimator for computing the sample variogram. Possible values are:

  • "qn": Genton's robust Qn-estimator (default, Genton, 1998),

  • "mad": Dowd's robust MAD-estimator (Dowd, 1984),

  • "matheron": non-robust method-of-moments estimator,

  • "ch": robust Cressie-Hawkins estimator (Cressie and Hawkins, 1980).


a logical scalar controlling whether the mean lag vector (per combination of lag distance and angular class) is computed from the mean angles of all the lag vectors falling into a given class (TRUE, default) or from the mid-angles of the respective angular classes (FALSE).


an object of class sample.variogram.

type, xlim, ylim, xlab, ylab

see respective arguments of plot.default.


a logical scalar controlling whether a new plot should be generated (FALSE, default) or whether the information should be added to the current plot (TRUE).


a vector with the colours of plotting symbols for distinguishing semi variances for angular classes in the x-y-plane.


a vector with the types of plotting symbols for distinguishing semi variances for angular classes in the x-z-plane.


the line type.


a numeric with the character expansion factor for plotting symbols.


a logical scalar controlling whether the plotting symbols should be annotated by the number of data pairs per lag class.


an integer defining the position where text annotation about number of pairs should be plotted, see text.


a numeric defining the character expansion for text annotation about number of pairs.


a logical scalar controlling whether a legend should be plotted.


a character keyword defining where to place the legend, see legend for possible values.


additional arguments passed to plot.formula.


The angular classes in the x-y- and x-z-plane are defined by vectors of ascending angles on the half circle. The ith angular class is defined by the vector elements, say l and u, with indices i and i+1. A lag vector belongs to the ith angular class if its azimuth (or angle from zenith), say \varphi, satisfies l < \varphi \leq u. If the first and the last element of xy.angle.def or xz.angle.def are equal to 0 and 180 degrees, respectively, then the first and the last angular class are “joined”, i.e., if there are K angles, there will be only K-2 angular classes and the first class is defined by the interval ( xy.angle.def[K-1]-180, xy.angle.def[2] ] and the last class by ( xy.angle.def[K-2], xy.angle.def[K-1]].


All methods of the generic function sample.variogram return an object of class sample.variogram, which is a data frame with the following components:

lag.dist the mean lag distance of the lag class,
xy.angle the angular class in the x-y-plane,
xz.angle the angular class in the x-z-plane,
gamma the estimated semi-variance of the lag class,
npairs the number of data pairs in the lag class,
lag.x the x-component of the mean lag vector of the lag class,
lag.x the y-component of the mean lag vector of the lag class,
lag.z the z-component of the mean lag vector of the lag class.

The method summary.sample.variogram returns an object of class summary.sample.variogram which is list with the components log.dist, npairs, xy.angle and xz.angle, see description for object of class sample.variogram above. There is a print method for objects of class summary.sample.variogram which invisibly returns the object unchanged.

The method plot.sample.variogram is called for its side effects and invisibly returns the object sample.variogram unchanged.


Andreas Papritz


## omnidirectional sample variogram <- sample.variogram(pressure~1, data = wolfcamp,
    locations = ~x + y, lag.dist.def = seq(0, 200, by = 15))

plot(, type = "l")

## direction-dependent sample variogram <- sample.variogram(pressure~1, data = wolfcamp,
    locations = ~x + y, lag.dist.def = seq(0, 200, by = 15),
    xy.angle.def = c(0., 22.5, 67.5, 112.5, 157.5, 180.))
plot(, type = "l", add = TRUE, col = 2:5)

