A Multivariate Rank-Based Two-Sample Test Statistic

Curry, Jamye; Dang, Xin; Sang, Hailin

doi:10.1007/978-3-030-19486-4_16

Jamye Curry⁷,
Xin Dang⁸ &
Hailin Sang⁸

Part of the book series: Association for Women in Mathematics Series ((AWMS,volume 18))

368 Accesses

Abstract

The problem of testing whether two samples come from the same or different populations is a classical one in statistics. A new distribution-free test based on standardized ranks for the univariate two-sample problem is studied. Providing a distribution-free (nonparametric) method offers a valuable technique for analyzing data that consists of ranks or relative preferences of data, and of data that are small samples from unknown distributions. The proposed test statistic examines the difference between the average of between-group rank distances and the average of within-group rank distances. This test statistic is closely related to the classical two-sample Cramér–von Mises criterion; however, they are different empirical versions of the same quantity for testing the equality of two population distributions. The advantage of the proposed rank-based test over the classical one is its ease to generalize to the multivariate case. In addition, the motivation of the proposed test is its application to microarray analyses in identifying sets of genes that are differentially expressed in various biological states, such as diseased versus non-diseased. A numerical study is conducted to compare the power performance of the rank formulation test with other commonly used nonparametric tests.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Anderson, T.W. (1962). On the distribution of the two-sample Cramér-von Mises criterion. Ann. Math. Statist., 33(3), 1148–1159.
Article MathSciNet MATH Google Scholar
Baringhaus, L. and Franz, C. (2004). On a new multivariate two-sample test. J. Multivariate Anal., 88, 190–206.
Article MathSciNet MATH Google Scholar
Chiu, S. and Liu, K. (2009). Generalized Cramér-von Mises goodness-of-fit tests for multivariate distributions. Comput. Stat. Data An., 53, 3817–3834.
Article MATH Google Scholar
Cotterill, D. and Csörgő, M. (1982). On the limiting distribution of and critical values for the multivariate Cramér-von Mises Statistic. Ann. Stat., 10(1), 233–244.
Article MathSciNet MATH Google Scholar
Darling, D.A. (1957). The Kolmogorov-Smirnov, Cramér-von Mises tests. Ann. Math. Stat., 28(4), 823–838.
Article MATH Google Scholar
Einmahl, J. and McKeague, I. (2003). Empirical likelihood based hypothesis testing. Bernoulli, 9(2), 267–290.
Article MathSciNet MATH Google Scholar
Fernández, V., Jimènez Gamerro, M. and Muñoz Garcìa, J. (2008). A test for the two-sample problem based on empirical characteristic functions. Comput. Stat. Data An., 52, 3730–3748.
Article MathSciNet MATH Google Scholar
Fisz, M. (1960). On a result by M. Rosenblatt concerning the von Mises - Smirnov Test. Ann. Math. Stat., 31(2), 427–429.
Article MathSciNet MATH Google Scholar
Genest, C., Quessy, J.F. and Rémillard, B. (2007). Asymptotic local efficiency of Cramér-von Mises tests for multivariate independence. Ann. Stat., 35(1), 166–191.
Article MATH Google Scholar
Gretton, A., Borgwardt, K.M., Rasch, M.J., Schölkopf, B. and Smola, A. (2008). A kernel method for the two-sample problem, J. Mach. Learn. Res., 1, 1–10.
MATH Google Scholar
Gurevich, G. and Vexler, A. (2011). A two-sample empirical likelihood ratio test based on samples entropy. Stat. Comput., 21, 657–670.
Article MathSciNet MATH Google Scholar
Hsu, C.L. and Lee, W.C. (2010). Detecting differentially expressed genes in heterogeneous diseases using half Student’s t-test. International Journal of Epidemiology., 39,1597–1604.
Article Google Scholar
Lehmann, E.L. (1951). Consistency and unbiasedness of certain nonparametric tests. Ann. Math. Stat., 22, 165–179.
Article MathSciNet MATH Google Scholar
Morgenstern, D. (2001). Proof of a conjecture by Walter Deuber concerning the distances between points of two types in \(\mathbb {R}^d\), Discrete Math., 226, 347–349.
MathSciNet MATH Google Scholar
Möttönen J., Oja, H. and Tienari J. (1997). On the efficiency of multivariate spatial sign and rank tests. Ann. Stat., 25, 542–552.
Article MathSciNet MATH Google Scholar
Oja, H. (2010). Multivariate Nonparametric Methods with R: An Approach Based on Spatial Signs and Ranks. Springer, New York.
Book MATH Google Scholar
Rosenblatt, M. (1952). Limit theorems associated with variants of the von Mises statistic. Ann. Math. Stat., 23, 617–623.
Article MathSciNet MATH Google Scholar
Rubinstein, R. Y. (1981). Simulation and the Monte Carlo Method. John Wiley & Sons, New York.
Book MATH Google Scholar
Székely, G.J. and Rizzo, M.L. (2013). Energy statistics: A class of statistics based on distances. J. Stat. Plan. Infer., 143, 1249–1272.
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Georgia Gwinnett College, Lawrenceville, GA, USA
Jamye Curry
University of Mississippi, University, MS, USA
Xin Dang & Hailin Sang

Authors

Jamye Curry
View author publications
You can also search for this author in PubMed Google Scholar
Xin Dang
View author publications
You can also search for this author in PubMed Google Scholar
Hailin Sang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jamye Curry .

Editor information

Editors and Affiliations

Science Writing Advanced Academic Program, John Hopkins University, Baltimore, MD, USA
Susan D'Agostino
Department of Mathematics and Computer Science, Dickinson College, Carlisle, PA, USA
Sarah Bryant
Department of Mathematics, University of San Diego, San Diego, CA, USA
Amy Buchmann
Department of Mathematics and Computer Science, Belmont University, Nashville, TN, USA
Michelle Craddock Guinn
Division of Sciences and Mathematics, University of the District of Columbia, Washington, DC, USA
Leona Harris

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Curry, J., Dang, X., Sang, H. (2019). A Multivariate Rank-Based Two-Sample Test Statistic. In: D'Agostino, S., Bryant, S., Buchmann, A., Guinn, M., Harris, L. (eds) A Celebration of the EDGE Program’s Impact on the Mathematics Community and Beyond . Association for Women in Mathematics Series, vol 18. Springer, Cham. https://doi.org/10.1007/978-3-030-19486-4_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-19486-4_16
Published: 01 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19485-7
Online ISBN: 978-3-030-19486-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics