Skip to main content
Log in

Using Mahalanobis Distance to Detect and Remove Outliers in Experimental Covariograms

  • Original Paper
  • Published:
Natural Resources Research Aims and scope Submit manuscript

Abstract

Experimental variograms are crucial for most geostatistical studies. In kriging, for example, the variography has a direct influence on the interpolation weights. Despite the great importance of variogram estimators in predicting geostatistical features, they are commonly influenced by outliers in the dataset. The effect of some randomly spatially distributed outliers can mask the pattern of the experimental variogram and produce a destructuration effect, implying that the true data spatial continuity cannot be reproduced. In this paper, an algorithm to detect and remove the effect of outliers in experimental variograms using the Mahalanobis distance is proposed. An example of the algorithm’s application is presented, showing that the developed technique is able to satisfactorily detect and remove outliers from a variogram.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13

Similar content being viewed by others

References

  • Ben-Gal, I. (2005). Outlier detection. In Data mining and knowledge discovery handbook (pp. 131–146).

  • Costa, J. F. (2003). Reducing the impact of outliers in ore reserves estimation. Mathematical Geology, 35(3), 323–345.

    Article  Google Scholar 

  • Cressie, N., & Hawkins, D. M. (1980). Robust estimation of the variogram: I. Journal of the International Association for Mathematical Geology, 12(2), 115–125.

    Article  Google Scholar 

  • Dutter, R. (1996). On robust estimation of variograms in geostatistics. Robust statistics, data analysis, and computer intensive methods (pp. 153–171). Berlin: Springer.

    Chapter  Google Scholar 

  • Filzmoser, P. (2004). A multivariate outlier detection method. na.

  • Genton, M. G. (1998). Highly robust variogram estimation. Mathematical Geology, 30(2), 213–221.

    Article  Google Scholar 

  • Hazewinkel, M. (2001). Chebyshev inequality in probability theory. Encyclopedia of mathematics. Berlin: Springer.

    Google Scholar 

  • Krige, D. G., & Magri, E. J. (1982). Studies of the effects of outliers and data transformation on variogram estimates for a base metal and a gold ore body. Journal of the International Association for Mathematical Geology, 14(6), 557–564.

    Article  Google Scholar 

  • Lebrenz, H., & Bárdossy, A. (2017). Estimation of the variogram using Kendall’s tau for a robust geostatistical interpolation. Journal of Hydrologic Engineering, 22(9), 04017038.

    Article  Google Scholar 

  • Mahalanobis, P. C. (1936). On the generalised distance in statistics. Proceedings of the National Institute of Sciences of India, 1936, 49–55.

    Google Scholar 

  • O’Leary, B., Reiners, J. J., Xu, X., & Lemke, L. D. (2016). Identification and influence of spatio-temporal outliers in urban air quality measurements. Science of the Total Environment, 573, 55–65.

    Article  Google Scholar 

  • Rousseeuw, P. J., & Van Zomeren, B. C. (1990). Unmasking multivariate outliers and leverage points. Journal of the American Statistical association, 85(411), 633–639.

    Article  Google Scholar 

  • Saw, J. G., Yang, M. C., & Mo, T. C. (1984). Chebyshev inequality with estimated mean and variance. The American Statistician, 38(2), 130–132.

    Google Scholar 

  • Srivastava, R. M. (2001). Outliers: A guide for data analysts and interpreters on how to evaluate unexpected high values. Contaminated sites statistical applications guidance document no. 12-8, BC, Canada, 4 pp. https://www2.gov.bc.ca/assets/gov/environment/air-land-water/site-remediation/docs/guidance-documents/gd08.pdf. Accessed 4 Aug 2018.

  • Werner, M. (2003). Identification of multivariate outliers in large data sets. Ph.D. thesis, Citeseer.

  • Ziegel, E. R. (1995). Gslib: Geostatistical software library and user’s guide.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Alvarenga Drumond.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Drumond, D.A., Rolo, R.M. & Costa, J.F.C.L. Using Mahalanobis Distance to Detect and Remove Outliers in Experimental Covariograms. Nat Resour Res 28, 145–152 (2019). https://doi.org/10.1007/s11053-018-9399-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11053-018-9399-y

Keywords

Navigation