Correlation is a fundamental tool for multivariate data analysis. Most multivariate statistical methods use correlation as a basis for data analytics. Machine learning methods are also impacted by correlations in data. With todays’ big data, the role of correlation becomes increasingly important. Although the basic concept of correlation is simple, it has many complexities in practice. Many may know the common saying “correlation is not causation”, but the statement “a causation does not necessarily lead to correlation” is much less known or even debatable. This chapter presents uses and pitfalls of correlation analysis for geoscience applications.


  1. Aldrich, J. (1995). Correlation genuine and spurious in Pearson and Yule. Statistical Science, 10(4), 364–376.MathSciNetCrossRefGoogle Scholar
  2. Fletcher, S. (2017). Data assimilation for the geosciences: From theory to application. Amsterdam: Elsevier.CrossRefGoogle Scholar
  3. Galton, F. (1888). Co-relations and their measurement, chiefly from anthropometric data. Proceedings of the Royal Society of London, 45, 135–145.Google Scholar
  4. Langford, E., Schwertman, N., & Owens, M. (2001). Is the property of being positively correlated transitive? American Statistician, 55, 322–325.MathSciNetCrossRefGoogle Scholar
  5. Ma, Y. Z. (2009). Simpson’s paradox in natural resource evaluation. Mathematical Geosciences, 41(2), 193–213. Scholar
  6. Ma, Y. Z. (2011). Pitfalls in predictions of rock properties using multivariate analysis and regression method. Journal of Applied Geophysics, 75, 390–400.CrossRefGoogle Scholar
  7. Ma, Y. Z. (2015). Simpson’s paradox in GDP and Per-capita GDP growth. Empirical Economics, 49(4), 1301–1315.CrossRefGoogle Scholar
  8. Ma, Y. Z., & Gomez, E. (2015). Uses and abuses in applying neural networks for predicting reservoir properties. Journal of Petroleum Science and Engineering, 133, 66–75. Scholar
  9. Ma, Y. Z., Wang, H., Sitchler, J., et al. (2014). Mixture decomposition and lithofacies clustering using wireline logs. Journal of Applied Geophysics, 102, 10–20. Scholar
  10. Martinelli, G., & Chugunov, N. (2014). Sensitivity analysis with correlated inputs for volumetric analysis of hydrocarbon prospects. In The proceeding of ECMOR XIV– 14th European conference on the mathematics of oil recovery.
  11. Mayer-Schonberger, V., & Cukier, K. (2013). Big data: A revolution that will transform how we live, work, and think. Boston: Houghton Mifflin Harcourt.Google Scholar
  12. Pearl, J. (2000). Causality: Models, reasoning and inference. Cambridge: Cambridge University Press, 384p.zbMATHGoogle Scholar
  13. Pearson, K., Lee, A., & Bramley-Moore, L. (1899). Mathematical contributions to the theory of evolution – VI. Genetic (reproductive) selection: Inheritance of fertility in man, and of fertility in thorough-bred racehorses. Philosophical Transactions of the Royal Society of London, Series A, 192, 257–278.CrossRefGoogle Scholar
  14. Rousseeuw, P. J., & Leroy, A. M. (1987). Robust regression and outlier detection. Hoboken: Wiley.CrossRefGoogle Scholar
  15. Yule, G. U., & Kendall, M. G. (1968). An introduction to the theory of statistics (14th ed.). New York: Hafner Pub. Co. Revised and Enlarged, Fifth Impression.zbMATHGoogle Scholar
  16. Zeisel, H. (1985). Say it with figures (6th ed.). New York: Harper and Brothers.Google Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Y. Z. Ma
    • 1
  1. 1.SchlumbergerDenverUSA

Personalised recommendations