Reliability and efficiency of algorithms for computing the significance of the Mann–Whitney test

Nagarajan, Niranjan; Keich, Uri

doi:10.1007/s00180-009-0148-x

Reliability and efficiency of algorithms for computing the significance of the Mann–Whitney test

Original Paper
Published: 29 March 2009

Volume 24, pages 605–622, (2009)
Cite this article

Computational Statistics Aims and scope Submit manuscript

Niranjan Nagarajan¹ &
Uri Keich²^nAff3

319 Accesses
10 Citations
Explore all metrics

Abstract

Motivated by recent applications of the Mann–Whitney U test to large data sets we took a critical look at current methods for computing its significance. Surprisingly, we found that the two fastest and most popular tools for exact computation of the test significance, Dinneen and Blakesley’s and Harding’s, can exhibit large numerical errors even in moderately large datasets. In addition, another method proposed by Pagano and Tritchler also suffers from a similar numerical instability and can produce inaccurate results. This motivated our development of a new algorithm, mw-sFFT, for the exact computation of the Mann–Whitney test with no ties. Among the class of exact algorithms that are numerically stable, mw-sFFT has the best complexity: O(m ² n) versus O(m ² n ²) for others, where m and n are the two sample sizes. This asymptotic efficiency is also reflected in the practical runtime of the algorithm. In addition, we also present a rigorous analysis of the propagation of numerical errors in mw-sFFT to derive an error guarantee for the values computed by the algorithm. The reliability and efficiency of mw-sFFT make it a valuable tool in compuational applications and we plan to provide open-source libraries for it in C++ and Matlab.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Confidence distributions and hypothesis testing

Article Open access 29 March 2024

Eugenio Melilli & Piero Veronese

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Sander Greenland, Stephen J. Senn, … Douglas G. Altman

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

Levi Kumle, Melissa L.-H. Võ & Dejan Draschkow

References

Bickel DR (2004) Degrees of differential gene expression: detecting biologically significant expression differences and estimating their magnitudes. Bioinformatics 20(5): 682–688
Article Google Scholar
Buckle N, Kraft C, van Eeden C (1969) An approximation to the Wilcoxon–Mann–Whitney distribution. J Am Stat Assoc 64: 591–599
Article MATH Google Scholar
Dembo A, Zeitouni O (1998) Large deviation techniques and applications. Springer, New York
Google Scholar
Di Bucchianico A (1999) Combinatorics, computer-algebra and the Wilcoxon–Mann–Whitney test. J Stat Plann Infer 79: 349–364
Article MATH MathSciNet Google Scholar
Dinneen LC, Blakesley BC (1973) Algorithm AS 62: a generator for the sampling distribution of the Mann–whitney U statistic. Appl Stat 22(2): 269–273
Article Google Scholar
Fix E, Hodges JL (1955) Significance probabilities of the Wilcoxon test. Ann Math Stat 26(2): 301–312
Article MATH MathSciNet Google Scholar
Froda S, van Eeden C (2000) A uniform saddlepoint expansion for the null-distribution of the Wilcoxon–Mann–Whitney statistic. Can J Stat 1: 137–149
Google Scholar
Harding EF (1984) An efficient, minimal-storage procedure for calculating the Mann–Whitney U , generalized U and similar distributions. Appl Stat 33(1): 1–6
Article MATH Google Scholar
Hodges JL, Ramsey P, Wechsler S (1990) Improved significance probabilities of the Wilcoxon test. J Educ Stat 15(3): 249–265
Article Google Scholar
Jin R, Robinson J (1999) Saddlepoint approximation near the endpoints of the support. Stat Prob Lett 45(4): 295–303
Article MATH MathSciNet Google Scholar
Jin R, Robinson J (2003) Saddlepoint approximations of the two-sample Wilcoxon statistic. Institute of Mathematical Statistics, pp 149–158
Karanam S, Moreno CS (2004) CONFAC: automated application of comparative genomic promoter analysis to DNA microarray datasets. Nucleic Acids Res 32(Web Server issue): W475–W484
Article Google Scholar
Keich U (2005) Efficiently computing the p-value of the entropy score. J Comput Biol 12(4): 416–430
Article Google Scholar
Keich U, Nagarajan N (2006) A fast and numerically robust method for exact multinomial goodness-of-fit test. J Comput Graph Stat 15(4): 779–802
Article MathSciNet Google Scholar
Kifer D, Ben-David S, Gehrke J (2004) Detecting change in data streams. In: Proceedings of the 30th VLDB conference, pp 180–191
Mann H, Whitney D (1947) On a test whether one of two random variables is stochastically larger than the other. Ann Math Stat 18: 50–60
Article MATH MathSciNet Google Scholar
Mehta CR, Patel NR, Tsiatis AA (1984) Exact significance testing to establish treatment equivalence with ordered categorical data. Biometrics 40(3): 819–825
Article MATH MathSciNet Google Scholar
Nagarajan N, Jones N, Keich U (2005) Computing the p-value of the information content from an alignment of multiple sequences. In: Proceedings of the 13th ISMB conference, pp 311–318
Pagano M, Tritchler D (1983) On obtaining permutation distribution in polynomial time. Am Stat Assoc 83: 435–440
Article MathSciNet Google Scholar
Press W, Teukolsky S, Vetterling W, Flannery B (1992) Numerical recipes in C. The art of scientific computing, 2nd edn. Cambridge University Press, New York
MATH Google Scholar
Streitberg B, Rohmel J (1984) Exact nonparametrics in APL. In: Proceedings of the APL conference, ACM, New York, pp 313–325
Tasche M, Zeuner H (2002) Improved roundoff error analysis for precomputed twiddle factors. J Comput Anal Appl 4(1): 1–18
MATH MathSciNet Google Scholar
Troyanskaya OG, Garber ME, Brown PO, Botstein D, Altman RB (2002) Nonparametric methods for identifying differentially expressed genes in microarray data. Bioinformatics 18(11): 1454–1461
Article Google Scholar
van Dantzig D (1947–1950) Kader cursus Mathematische Statistiek, Mathematisch Centrum, pp 301–304
Van de Wiel MA, Smeets SJ, Brakenhoff RH, Yistra B (2005) CGHMultiArray: exact p-values for multi-array comparative genomic hybridization data. Bioinformatics 21: 3193–3194
Article Google Scholar

Download references

Author information

Uri Keich
Present address: School of Mathematics and Statistics, University of Sydney, Sydney, NSW, 2006, Australia

Authors and Affiliations

CBCB, UMIACS, University of Maryland, College Park, MD, 20742, USA
Niranjan Nagarajan
Department of Computer Science, Cornell University, Ithaca, NY, 14850, USA
Uri Keich

Authors

Niranjan Nagarajan
View author publications
You can also search for this author in PubMed Google Scholar
Uri Keich
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Niranjan Nagarajan.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nagarajan, N., Keich, U. Reliability and efficiency of algorithms for computing the significance of the Mann–Whitney test. Comput Stat 24, 605–622 (2009). https://doi.org/10.1007/s00180-009-0148-x

Download citation

Received: 21 March 2008
Accepted: 10 March 2009
Published: 29 March 2009
Issue Date: December 2009
DOI: https://doi.org/10.1007/s00180-009-0148-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reliability and efficiency of algorithms for computing the significance of the Mann–Whitney test

Abstract

Access this article

Similar content being viewed by others

Confidence distributions and hypothesis testing

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Reliability and efficiency of algorithms for computing the significance of the Mann–Whitney test

Abstract

Access this article

Similar content being viewed by others

Confidence distributions and hypothesis testing

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation