Abstract
When comparing the central values of two independent groups, should a t-test be performed, or should the observations be transformed into their ranks and a Wilcoxon-Mann-Whitney test performed? This paper argues that neither should automatically be chosen. Instead, provided that software for conducting randomisation tests is available, the chief concern should be with obtaining data values that are a good reflection of scientific reality and appropriate to the objective of the research; if necessary, the data values should be transformed so that this is so. The subsequent use of a randomisation (permutation) test will mean that failure of the transformed data values to satisfy assumptions such as normality and equality of variances will not be of concern.
Similar content being viewed by others
References
Abt, K. (1996). How statistics can ’lie’. European Journal of Anesthesiology, 13, 427–431.
Chatfield, C. (1995). Problem Solving. A Statistician’s Guide. 2nd Edition. Chapman and Hall, London.
Gøtzsche, P.C. (1996). Blinding during data analysis and writing of manuscripts. Controlled Clinical Trials, 17, 285–293 (including comments by G.L. Knatterud and S.J. Pocock).
Grayson, D., Pattison, P. and Robins, G. (1997). Evidence, inference, and the ’rejection’ of the significance test. Australian Journal of Psychology, 49, 64–70.
Harris, R.J. (1994). ANOVA: An Analysis of Variance Primer. Peacock, Itasca, Illinois.
Marsh, C. (1988). Exploring Data. An Introduction to Data Analysis for Social Scientists. Polity Press, Cambridge, U.K.
May, W.L. and Johnson, W.D. (1997). A SASR macro for the multivariate extension of the Kruskal-Wallis test including multiple comparisons: Randomization and x2 criteria. Computational Statistics and Data Analysis, 22, 239–250.
Mehta, C. and Patel, N. (1995). StatXact 3 for Windows. Cytel Software Corporation, Cambridge, Massachusetts.
Mielke, P.W. (1986). Non-metric statistical analyses: Some metric alternatives. Journal of Statistical Planning and Inference, 13, 377–387.
Mielke, P.W. and Berry, K.J. (1983). Asymptotic clarifications, generalizations, and concerns regarding an extended class of matched pairs tests based on powers of ranks. Psychometrika, 48, 483–485.
Routledge, R.D. (1997). P-values from permutation and F-tests. Computational Statistics and Data Analysis, 24, 379–386.
Slauson, W.L., Cade, B.S. and Richards, J.D. (1999). User Manual for BLOSSOM Statistical Software. Midcontinent Ecological Science Center, Fort Collins, Colorado.
Author information
Authors and Affiliations
Additional information
The authors gratefully acknowledge helpful advice from two referees.
Rights and permissions
About this article
Cite this article
Hutchinson, T.P., Cairns, D. & Chekaluk, E. The construction of data to reflect the research objective, and how randomisation tests make such data usable. Statistical Papers 43, 349–359 (2002). https://doi.org/10.1007/s00362-002-0109-8
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/s00362-002-0109-8