Advanced Methods

This section presents an overview of advanced geospatial methods, which are used to estimate values at unsampled locations and model the spatial correlation of the data. These methods include varieties of kriging and conditional simulation. Kriging is a spatial interpolation method that allows estimation of values at unsampled locations and provides an estimate of the uncertainty in the interpolated values. Selection of a particular kriging method depends on the characteristics of the data set, such as trends present in the data or the degree of spatial correlation, which can be determined using variograms and other spatial correlation models. Information about using spatial correlation models, different kriging methods, and conditional simulation is also presented in this section.

Spatial Correlation Models for Advanced Methods

Kriging and simulation methods require a model of spatial correlation. Spatial autocorrelation can be modeled using the variogram or covariance function. Typically, the empirical variogram is plotted based on the data, and a variogram model is fit to the empirical variogram. These activities may be referred to as variography. In general, variography encompasses directional spatial autocorrelation, bivariate autocorrelation, and multivariate spatial autocorrelation.

Most advanced geospatial methods rely on a search neighborhood to generate spatial predictions. The search neighborhood is selected based on the underlying spatial autocorrelation in the sampled population, and is simply the radius within which known values are used to predict unknown variables. Recognizing that correlation decreases with distance, the optimal search neighborhood is one which includes known values with a large influence and excludes the rest.

Construction of the Empirical Variogram

As part of EDA, the empirical variogram is constructed by plotting one-half the squared difference in values (semivariance) for each pair of sampling points as a function of distance separating the points (variogram cloud; see Figure 78). The variogram expresses the variability of the data set as a function of space: if data are spatially correlated, then on average, close sample points are more alike and have a smaller semivariance than samples farther apart.

The choice of the variogram parameters (for example, lag, direction) is a fundamental step for using advanced geospatial methods and should be done so as to be as representative as possible of the spatial characteristics of the data set. For example, if anisotropy is observed, which is common in environmental data, then an anisotropic variogram must be built that takes into account different spatial directions. Experimental variogram parameters are intrinsically linked to each particular data set; nevertheless, some recommendations can be made in order to create a suitable experimental curve.

Lag

When sampling is performed by following a regular grid, the regular distance between samples is taken as the value of the variogram lag. Distance between samples, however, is often irregular and the lag of the variogram then may be chosen by taking into account the different distances between the pairs of sampling locations. In this case, the value of the lag may be calculated by taking the average of distances between the sampling locations. As the distance between samples becomes larger, the reliability of the estimates of semivariance goes down. Consequently, a rough rule of thumb is that the maximum lag distance should not exceed half of the maximum distance between samples (see Figure 76).