Advertisement

Statistical Methods & Applications

, Volume 19, Issue 4, pp 497–515 | Cite as

Influence functions of the Spearman and Kendall correlation measures

  • Christophe CrouxEmail author
  • Catherine Dehon
Open Access
Article

Abstract

Nonparametric correlation estimators as the Kendall and Spearman correlation are widely used in the applied sciences. They are often said to be robust, in the sense of being resistant to outlying observations. In this paper we formally study their robustness by means of their influence functions and gross-error sensitivities. Since robustness of an estimator often comes at the price of an increased variance, we also compute statistical efficiencies at the normal model. We conclude that both the Spearman and Kendall correlation estimators combine a bounded and smooth influence function with a high efficiency. In a simulation experiment we compare these nonparametric estimators with correlations based on a robust covariance matrix estimator.

Keywords

Asymptotic variance Correlation Gross-error sensitivity Influence function Kendall correlation Robustness Spearman correlation 

Mathematics Subject Classification (2000)

65G35 62F99 

Notes

Acknowledgments

We would like to thank the two reviewers for their careful reading of our manuscript and their useful comments.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

References

  1. Alqallaf FA, Konis KP, Martin RD, Zamar RH (2002) Scalable robust covariance and correlation estimates for data mining. In proceedings of the seventh ACM SIGKDD international conference on knowledge discovery and data mining, EdmontonGoogle Scholar
  2. Atkinson AC, Riani M, Cerioli A (2004) Exploring multivariate data with the forward search. Springer, New YorkzbMATHGoogle Scholar
  3. Blomqvist N (1950) On a measure of dependance between two random variables. Ann Math Stat 21: 593–600zbMATHCrossRefMathSciNetGoogle Scholar
  4. Bonett DG, Wright TA (2000) Sample size requirements for estimating Pearson, Kendall and Spearman correlation. Psychometrika 65: 23–28CrossRefGoogle Scholar
  5. Borkowf C (2002) Computing the nonnull asymptotic variance and the asymptotic relative efficiency of Spearman’s rank correlation. Comput Stat Data Anal 39: 271–286zbMATHCrossRefMathSciNetGoogle Scholar
  6. Caperaa P, Garralda Guillem AI (1997) Taux de resistance des tests de rang d’independance. Can J Stat 25: 113–124zbMATHCrossRefMathSciNetGoogle Scholar
  7. Cerioli A (2010) Multivariate outlier detection with high-breakdown estimators. J Am Stat Assoc (to appear)Google Scholar
  8. Croux C, Haesbroeck G (1999) Influence function and efficiency of the minimum covariance determinant scatter matrix estimator. J Multivar Anal 71: 161–190zbMATHCrossRefMathSciNetGoogle Scholar
  9. David FN, Mallows CL (1961) The variance of Spearman’s rho in normal samples. Biometrika 48: 19–28zbMATHMathSciNetGoogle Scholar
  10. Davies PL, Gather U (2005) Breakdown and groups (with discussion). Ann Stat 33: 977–1035zbMATHCrossRefMathSciNetGoogle Scholar
  11. Devlin SJ, Gnanadesikan R, Kettering JR (1975) Robust estimation and outlier detection with correlation coefficients. Biometrika 62: 531–545zbMATHCrossRefGoogle Scholar
  12. Falk M (1998) A note on the comedian for elliptical distributions. J Multivar Anal 67: 306–317zbMATHCrossRefMathSciNetGoogle Scholar
  13. Genton MG, Ma Y (1999) Robustness properties of dispersion estimators. Stat Probab Lett 44: 343–350zbMATHCrossRefMathSciNetGoogle Scholar
  14. Gnanadesikan R, Kettering JR (1972) Robust estimates, residuals, and outlier detection wit multiresponse data. Biometrics 28: 81–124CrossRefGoogle Scholar
  15. Grize YL (1978) Robustheitseigenschaften von Korrelations-schätzungen, Unpublished Diplomarbeit, ETH ZürichGoogle Scholar
  16. Hampel FR, Ronchetti EM, Rousseeuw PJ, Stahel WA (1986) Robust statistics: the approach based on influence functions. Wiley, New YorkzbMATHGoogle Scholar
  17. Kendall MG (1938) A new measure of rank correlation. Biometrika 30: 81–93zbMATHMathSciNetGoogle Scholar
  18. Khan JA, Van Aelst S, Zamar RH (2007) Robust linear model selection based on least angle regression. J Am Stat Assoc 480: 1289–1299CrossRefMathSciNetGoogle Scholar
  19. Maronna R, Martin D, Yohai V (2006) Robust Statistics. Wiley, New YorkzbMATHCrossRefGoogle Scholar
  20. Maronna RA, Zamar RH (2002) Robust estimates of location and dispersion of high-dimensional datasets. Technometrics 44: 307–317CrossRefMathSciNetGoogle Scholar
  21. Moran PAP (1948) Rank correlation and permutation distributions. Biometrika 44: 142–144zbMATHGoogle Scholar
  22. Morgenthaler S (2007) A survey of robust statistics. Stat Methods Appl 15: 271–293CrossRefMathSciNetGoogle Scholar
  23. Mosteller F (1946) On some useful inefficient statistics. Ann Math Stat 17: 377zbMATHCrossRefMathSciNetGoogle Scholar
  24. Olkin I, Raveh A (2009) Bounds for how much influence an observation can have. Stat Methods Appl 18: 1–11CrossRefMathSciNetGoogle Scholar
  25. Rousseeuw PJ, Van Driessen K (1999) A fast algorithm for the minimum covariance determinant estimator. Technometrics 41: 212–223CrossRefGoogle Scholar
  26. Shevlyakov GL, Vilchevski NO (2002) Robustness in data analysis: criteria and methods. Modern probability and statistics. VSP, UtrechtGoogle Scholar
  27. Spearman C (1904) General intelligence objectively determined and measured. Am J Psychol 15: 201–293CrossRefGoogle Scholar
  28. Wilcox RR (2003) Inferences based on multiple skipped correlations. Comput Stat Data Anal 44: 223–236zbMATHCrossRefMathSciNetGoogle Scholar

Copyright information

© The Author(s) 2010

Open AccessThis is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Authors and Affiliations

  1. 1.Faculty of Business and Economics, and K.U. LeuvenLeuvenBelgium
  2. 2.Tilburg UniversityTilburgThe Netherlands
  3. 3.Université libre de Bruxelles, ECARES, and Institut de Recherche en StatistiqueBrusselsBelgium

Personalised recommendations