Skip to main content

Part of the book series: Association for Women in Mathematics Series ((AWMS,volume 18))

  • 368 Accesses

Abstract

The problem of testing whether two samples come from the same or different populations is a classical one in statistics. A new distribution-free test based on standardized ranks for the univariate two-sample problem is studied. Providing a distribution-free (nonparametric) method offers a valuable technique for analyzing data that consists of ranks or relative preferences of data, and of data that are small samples from unknown distributions. The proposed test statistic examines the difference between the average of between-group rank distances and the average of within-group rank distances. This test statistic is closely related to the classical two-sample Cramér–von Mises criterion; however, they are different empirical versions of the same quantity for testing the equality of two population distributions. The advantage of the proposed rank-based test over the classical one is its ease to generalize to the multivariate case. In addition, the motivation of the proposed test is its application to microarray analyses in identifying sets of genes that are differentially expressed in various biological states, such as diseased versus non-diseased. A numerical study is conducted to compare the power performance of the rank formulation test with other commonly used nonparametric tests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Anderson, T.W. (1962). On the distribution of the two-sample Cramér-von Mises criterion. Ann. Math. Statist., 33(3), 1148–1159.

    Article  MathSciNet  MATH  Google Scholar 

  2. Baringhaus, L. and Franz, C. (2004). On a new multivariate two-sample test. J. Multivariate Anal., 88, 190–206.

    Article  MathSciNet  MATH  Google Scholar 

  3. Chiu, S. and Liu, K. (2009). Generalized Cramér-von Mises goodness-of-fit tests for multivariate distributions. Comput. Stat. Data An., 53, 3817–3834.

    Article  MATH  Google Scholar 

  4. Cotterill, D. and Csörgő, M. (1982). On the limiting distribution of and critical values for the multivariate Cramér-von Mises Statistic. Ann. Stat., 10(1), 233–244.

    Article  MathSciNet  MATH  Google Scholar 

  5. Darling, D.A. (1957). The Kolmogorov-Smirnov, Cramér-von Mises tests. Ann. Math. Stat., 28(4), 823–838.

    Article  MATH  Google Scholar 

  6. Einmahl, J. and McKeague, I. (2003). Empirical likelihood based hypothesis testing. Bernoulli, 9(2), 267–290.

    Article  MathSciNet  MATH  Google Scholar 

  7. Fernández, V., Jimènez Gamerro, M. and Muñoz Garcìa, J. (2008). A test for the two-sample problem based on empirical characteristic functions. Comput. Stat. Data An., 52, 3730–3748.

    Article  MathSciNet  MATH  Google Scholar 

  8. Fisz, M. (1960). On a result by M. Rosenblatt concerning the von Mises - Smirnov Test. Ann. Math. Stat., 31(2), 427–429.

    Article  MathSciNet  MATH  Google Scholar 

  9. Genest, C., Quessy, J.F. and Rémillard, B. (2007). Asymptotic local efficiency of Cramér-von Mises tests for multivariate independence. Ann. Stat., 35(1), 166–191.

    Article  MATH  Google Scholar 

  10. Gretton, A., Borgwardt, K.M., Rasch, M.J., Schölkopf, B. and Smola, A. (2008). A kernel method for the two-sample problem, J. Mach. Learn. Res., 1, 1–10.

    MATH  Google Scholar 

  11. Gurevich, G. and Vexler, A. (2011). A two-sample empirical likelihood ratio test based on samples entropy. Stat. Comput., 21, 657–670.

    Article  MathSciNet  MATH  Google Scholar 

  12. Hsu, C.L. and Lee, W.C. (2010). Detecting differentially expressed genes in heterogeneous diseases using half Student’s t-test. International Journal of Epidemiology., 39,1597–1604.

    Article  Google Scholar 

  13. Lehmann, E.L. (1951). Consistency and unbiasedness of certain nonparametric tests. Ann. Math. Stat., 22, 165–179.

    Article  MathSciNet  MATH  Google Scholar 

  14. Morgenstern, D. (2001). Proof of a conjecture by Walter Deuber concerning the distances between points of two types in \(\mathbb {R}^d\), Discrete Math., 226, 347–349.

    MathSciNet  MATH  Google Scholar 

  15. Möttönen J., Oja, H. and Tienari J. (1997). On the efficiency of multivariate spatial sign and rank tests. Ann. Stat., 25, 542–552.

    Article  MathSciNet  MATH  Google Scholar 

  16. Oja, H. (2010). Multivariate Nonparametric Methods with R: An Approach Based on Spatial Signs and Ranks. Springer, New York.

    Book  MATH  Google Scholar 

  17. Rosenblatt, M. (1952). Limit theorems associated with variants of the von Mises statistic. Ann. Math. Stat., 23, 617–623.

    Article  MathSciNet  MATH  Google Scholar 

  18. Rubinstein, R. Y. (1981). Simulation and the Monte Carlo Method. John Wiley & Sons, New York.

    Book  MATH  Google Scholar 

  19. Székely, G.J. and Rizzo, M.L. (2013). Energy statistics: A class of statistics based on distances. J. Stat. Plan. Infer., 143, 1249–1272.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jamye Curry .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 The Author(s) and the Association for Women in Mathematics

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Curry, J., Dang, X., Sang, H. (2019). A Multivariate Rank-Based Two-Sample Test Statistic. In: D'Agostino, S., Bryant, S., Buchmann, A., Guinn, M., Harris, L. (eds) A Celebration of the EDGE Program’s Impact on the Mathematics Community and Beyond . Association for Women in Mathematics Series, vol 18. Springer, Cham. https://doi.org/10.1007/978-3-030-19486-4_16

Download citation

Publish with us

Policies and ethics