Skip to main content

Nonparametric Testing for Heterogeneous Correlation

  • Chapter
  • First Online:
Big and Complex Data Analysis

Part of the book series: Contributions to Statistics ((CONTRIB.STAT.))

Abstract

In the presence of weak overall correlation, it may be useful to investigate if the correlation is significantly and substantially more pronounced over a subpopulation. Two different testing procedures are compared. Both are based on the rankings of the values of two variables from a data set with a large number n of observations. The first maintains its level against Gaussian copulas; the second adapts to general alternatives in the sense that the number of parameters used in the test grows with n. An analysis of wine quality illustrates how the methods detect heterogeneity of association between chemical properties of the wine, which are attributable to a mix of different cultivars.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aeberhard, S., Coomans, D., de Vel, O.: Improvements to the classification performance of RDA. J. Chemometr. 7 (2), 99–115 (1993)

    Article  Google Scholar 

  2. Diaconis, P., Graham, R.L.: Spearman’s footrule as a measure of disarray. J. R. Stat. Soc. Ser. B - Stat. Methodol. 39, 262–268 (1977)

    MathSciNet  MATH  Google Scholar 

  3. Diaconis, P., Ram, A.: Analysis of systematic scan metropolis algorithms using Iwahori-Hecke algebra techniques. Mich. Math. J. 48, 157–190 (2000)

    Article  MathSciNet  MATH  Google Scholar 

  4. Fermanian, J.D., Radulovic, D., Wegkamp, M.: Weak convergence of empirical copula processes. Bernoulli 10, 847–860 (2004)

    Article  MathSciNet  MATH  Google Scholar 

  5. Fligner, M.A., Verducci, J.S.: Distance based ranking models. Journal of the Royal Statistical Society B, 48, 359–369 (1986)

    MathSciNet  MATH  Google Scholar 

  6. Frank, M.J.: On the simultaneous associativity of F(x, y) and x + y − F(x, y). Aequationes Math. 19, 194–226 (1979)

    Article  MathSciNet  MATH  Google Scholar 

  7. Genest, C.: Frank’s family of bivariate distributions. Biometrika 74 (3), 549–555 (1987)

    Article  MathSciNet  MATH  Google Scholar 

  8. Genest, C., MacKay, J.: The joy of copulas: bivariate distributions with uniform marginals. Am. Stat. 40, 280–283 (1986)

    MathSciNet  Google Scholar 

  9. Genest, C., Nešlehová, J.: On tests of radial symmetry for bivariate copulas. Stat. Pap. 55, 1107–1119 (2014)

    Article  MathSciNet  MATH  Google Scholar 

  10. Katz, G.: How much do we know about HDL cholesterol? Clin. Correlat. (2014) (http://www.clinicalcorrelations.org/?p=7298)

    Google Scholar 

  11. Lehmann, E.L., Romano, J.P.: Testing Statistical Hypotheses, 3rd edn. Springer, New York (2006).

    MATH  Google Scholar 

  12. Mallows, C.: Non-Null Ranking Models. Biometrika, 44 (1), 114–130 (1957)

    Article  MathSciNet  MATH  Google Scholar 

  13. Nelsen, R.B.: An Introduction to Copulas, 2nd edn. Springer, New York (2006)

    MATH  Google Scholar 

  14. Sampath, S., Verducci, J.: Detecting the end of agreement between two long ranked lists. Stat. Anal. Data Min. 6 (6), 458–471 (2013)

    Article  MathSciNet  MATH  Google Scholar 

  15. Sampath, S., Caloiaro, A., Johnson, W., Verducci, J.: The top-K tau-path screen for monotone association in subpopulations. WIREs Comput. Stat. (2016). doi:10.1002/wics.1382

    Google Scholar 

  16. Sen, P.K., Salama, I.A., Quade, D.: Spearman’s footrule: asymptotics in applications. Chil. J. Stat. 2, 3–20 (2011)

    MathSciNet  MATH  Google Scholar 

  17. Starr, S.: Thermodynamic limit for the Mallows model on S n . J. Math. Phys. 50, 195–208 (2009)

    Article  MathSciNet  Google Scholar 

  18. Voight, B.F., et al.: Plasma HDL cholesterol and risk of myocardial infarction: a Mendelian randomisation study. Lancet 380 (9841), 572–580 (2012)

    Article  Google Scholar 

  19. Yu, L., Verducci, J., Blower, P.: The tau-path test for monotone association in an unspecified subpopulation: applications to chemogenomic data mining. Stat. Methodol. 8, 97–111 (2011)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgement

We thank the referee for their thoughtful and helpful review.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Bamattre, S., Hu, R., Verducci, J.S. (2017). Nonparametric Testing for Heterogeneous Correlation. In: Ahmed, S. (eds) Big and Complex Data Analysis. Contributions to Statistics. Springer, Cham. https://doi.org/10.1007/978-3-319-41573-4_12

Download citation

Publish with us

Policies and ethics