Abstract
The problem of testing whether two samples come from the same or different populations is a classical one in statistics. A new distribution-free test based on standardized ranks for the univariate two-sample problem is studied. Providing a distribution-free (nonparametric) method offers a valuable technique for analyzing data that consists of ranks or relative preferences of data, and of data that are small samples from unknown distributions. The proposed test statistic examines the difference between the average of between-group rank distances and the average of within-group rank distances. This test statistic is closely related to the classical two-sample Cramér–von Mises criterion; however, they are different empirical versions of the same quantity for testing the equality of two population distributions. The advantage of the proposed rank-based test over the classical one is its ease to generalize to the multivariate case. In addition, the motivation of the proposed test is its application to microarray analyses in identifying sets of genes that are differentially expressed in various biological states, such as diseased versus non-diseased. A numerical study is conducted to compare the power performance of the rank formulation test with other commonly used nonparametric tests.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anderson, T.W. (1962). On the distribution of the two-sample Cramér-von Mises criterion. Ann. Math. Statist., 33(3), 1148–1159.
Baringhaus, L. and Franz, C. (2004). On a new multivariate two-sample test. J. Multivariate Anal., 88, 190–206.
Chiu, S. and Liu, K. (2009). Generalized Cramér-von Mises goodness-of-fit tests for multivariate distributions. Comput. Stat. Data An., 53, 3817–3834.
Cotterill, D. and Csörgő, M. (1982). On the limiting distribution of and critical values for the multivariate Cramér-von Mises Statistic. Ann. Stat., 10(1), 233–244.
Darling, D.A. (1957). The Kolmogorov-Smirnov, Cramér-von Mises tests. Ann. Math. Stat., 28(4), 823–838.
Einmahl, J. and McKeague, I. (2003). Empirical likelihood based hypothesis testing. Bernoulli, 9(2), 267–290.
Fernández, V., Jimènez Gamerro, M. and Muñoz Garcìa, J. (2008). A test for the two-sample problem based on empirical characteristic functions. Comput. Stat. Data An., 52, 3730–3748.
Fisz, M. (1960). On a result by M. Rosenblatt concerning the von Mises - Smirnov Test. Ann. Math. Stat., 31(2), 427–429.
Genest, C., Quessy, J.F. and Rémillard, B. (2007). Asymptotic local efficiency of Cramér-von Mises tests for multivariate independence. Ann. Stat., 35(1), 166–191.
Gretton, A., Borgwardt, K.M., Rasch, M.J., Schölkopf, B. and Smola, A. (2008). A kernel method for the two-sample problem, J. Mach. Learn. Res., 1, 1–10.
Gurevich, G. and Vexler, A. (2011). A two-sample empirical likelihood ratio test based on samples entropy. Stat. Comput., 21, 657–670.
Hsu, C.L. and Lee, W.C. (2010). Detecting differentially expressed genes in heterogeneous diseases using half Student’s t-test. International Journal of Epidemiology., 39,1597–1604.
Lehmann, E.L. (1951). Consistency and unbiasedness of certain nonparametric tests. Ann. Math. Stat., 22, 165–179.
Morgenstern, D. (2001). Proof of a conjecture by Walter Deuber concerning the distances between points of two types in \(\mathbb {R}^d\), Discrete Math., 226, 347–349.
Möttönen J., Oja, H. and Tienari J. (1997). On the efficiency of multivariate spatial sign and rank tests. Ann. Stat., 25, 542–552.
Oja, H. (2010). Multivariate Nonparametric Methods with R: An Approach Based on Spatial Signs and Ranks. Springer, New York.
Rosenblatt, M. (1952). Limit theorems associated with variants of the von Mises statistic. Ann. Math. Stat., 23, 617–623.
Rubinstein, R. Y. (1981). Simulation and the Monte Carlo Method. John Wiley & Sons, New York.
Székely, G.J. and Rizzo, M.L. (2013). Energy statistics: A class of statistics based on distances. J. Stat. Plan. Infer., 143, 1249–1272.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 The Author(s) and the Association for Women in Mathematics
About this chapter
Cite this chapter
Curry, J., Dang, X., Sang, H. (2019). A Multivariate Rank-Based Two-Sample Test Statistic. In: D'Agostino, S., Bryant, S., Buchmann, A., Guinn, M., Harris, L. (eds) A Celebration of the EDGE Program’s Impact on the Mathematics Community and Beyond . Association for Women in Mathematics Series, vol 18. Springer, Cham. https://doi.org/10.1007/978-3-030-19486-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-19486-4_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19485-7
Online ISBN: 978-3-030-19486-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)