Evaluate Geospatial Method Accuracy

Before using the predictions or prediction uncertainty estimates, the overall accuracy of the interpolation method or model must be assessed. The interpolated values should be checked to make sure they are consistent with the CSM and other source of information. In addition, the method should be evaluated using formal statistical methods for assessing model fit. The primary model assessment methods are cross-validation and validation.
▼Read more

Cross-Validation

▼Read more

Validation

▼Read more

Different sets of cross-validation and validation statistics are available for performing model diagnostics depending on the complexity of the model used. This method is further described in the sections below.

Determine Errors in Simple Methods

For simple geospatial models (for example, IDW), after computing the cross-validation or validation errors, the mean error (same units as the data, measuring the prediction bias) and the root-mean-square error (RMSE, measuring prediction accuracy) can be calculated. Simple methods do not estimate prediction uncertainty across sample locations; however, cross-validation may be used to estimate variation at individual sample locations.

▼Read more

Determine Errors in More Complex and Advanced Methods

In addition to predictions at unsampled locations, more complex (regression) and advanced (geostatistical) methods also provide the prediction standard errors that provide an estimate of uncertainty at each prediction location. Three additional metrics of method accuracy can be calculated: mean standardized error (dimensionless), average standard error (analogous to root mean square error), and root-mean-square standardized error (measuring the assessment of prediction variability). Beyond assessing the overall ability of the model to make good predictions, this additional set of statistics allows assessment of how accurately the model reflects the variability of the data. The mean error and RMSE are still useful metrics for advanced methods and should be calculated and presented as with simple methods.

▼Read more

Examples

▼Read more

Cross-validation was used to evaluate the four models displayed in Figure 46. One way to graphically evaluate the results is to plot the predicted value versus the observed value, as shown in Figure 47. If the predictions match perfectly, the points plot on a diagonal line. The best fit line to the points is shown on the figure in blue. The fit lines for all four of the models have a flatter slope than the 45-degree line show in black. All of the methods tend to smooth the data, leading to underprediction of the higher values and overprediction of the lower values. This is a general characteristic of all interpolation methods except conditional simulation.

The cross-validation statistics for the four models are shown in Table 4. Cross-validation finds the prediction error at each of the data points by comparing the prediction from the model after withholding the data point to the observed value. The prediction error is the difference between the prediction and the observed value. These errors are summarized in two ways: the mean and the root mean square (RMS). The mean error indicates whether the predictions are biased by being on average too high or too low. The RMS error is a measure of the total error in either direction.

Kriging methods also produce a prediction standard error at each location. For these methods, a standardized cross-validation error can be calculated by dividing the cross-validation prediction error by the prediction standard error. If the RMS standardized prediction error is less than one, it means that the method is producing prediction standard errors that overestimate the actual prediction error. If the RMS standardized error is more than one, the method is underestimating the actual prediction errors.

As shown in Table 4, the three kriging methods produce a much lower RMS error than IDW. There is little difference between the performance of the ordinary kriging methods with different variogram models. Ordinary kriging assumes no trend, only a constant mean. Kriging with external drift (trend) models the trend with a regression on distance to the river, resulting in a significant improvement in RMS error. In addition, kriging with external drift has a RMS standardized error closer to one than the ordinary kriging methods. Based on the cross-validation statistics, in this example kriging with external drift yields the best results.

Figure 47. Predicted values versus observed values.

Table 4. Cross-validation statistics

		Cross-Validation Statistics
Method	Variogram	Mean Error	Root Mean Square Error	Mean Standardized Error	Root Mean Square Standardized Error
IDW	NA	-0.0128	0.514	NA	NA
Ordinary Kriging	Exponential	0.00213	0.393	0.00301	0.93
Ordinary Kriging	Spherical	-2.06E-05	0.392	0.00017	0.90
Kriging with External Drift	Spherical	-0.00285	0.375	-0.00377	1.04

Print this page/section