Abstract
This paper deals with a survey of different types of tests, parametric, nonparametric, robustified and adaptive ones, and with an application to the two-sided c-sample location problem. Some concepts of robustness are discussed, such as breakdown point, influence function, gross-error sensitivity and especially α- and β-robustness. A robustness study on level α in the case of heteroscedasticity and nonnormal distributions is carried out via Monte Carlo methods and also a power comparison of all the tests considered. It turns out that robustified versions of the F-test and Welch-test where the original observations are replaced by its ranks behave well over a broad class of distributions, symmetric ones with different tail weight and asymmetric ones, but, on the whole, an adaptive test is to prefer.
Similar content being viewed by others
References
Andrews, D.F., Bickel, P.J., Hampel, F.R., Huber, P.J., Rogers, W.H. and Tukey, J.W. (1972).Robust estimation of location: Survey and advances. Princeton University Press, Princeton, N.J.
Babu, G.J., Padmanabhan, A.R. and Puri, M.L. (1999). Robust one-way ANOVA under possibly non-regular conditions.Biometrical Journal, 41, 321–339.
Barlow, R.E., Bartholomew, D.J., Bremner, J.M. and Brunk, H.D. (1972)Statistical inference under order restrictions. Wiley, New York.
Bartlett, M.S. (1937). Properties of sufficiency and statistical tests.Proceedings of the Royal Society, Ser. A 160, 268–282.
Behnen, K. and Neuhaus, G. (1989).Rank tests with estimated scores and their application. Teubner, Stuttgart.
Beier, F. and Büning, H. (1997). An adaptive test against ordered alternatives.Computational Statistics and Data Analysis, 25, 441–452.
Bickel, P.J. (1976) Another look at robustness: A review of reviews and some developments.Scandinavian Journal of Statistics, 3, 145–168.
Box, G.E.P. (1953). Non-normality and tests on variances.Biometrika, 40, 318–335.
Box, G.E.P. and Tiao, G.C. (1962). A further look at robustness via Bayes’s theorem.Biometrika, 49, 419–432.
Bradley, J.V. (1968).Distribution-free statistical methods. Printice-Hall, Englewood Cliffs, NJ.
Bradley, J.V. (1978). Robustness?.British Journal of Mathematical and Statistical Psychology, 31, 144–152.
Brown, M.B. and Forsythe, A.B. (1974). The small sample behavior of some statistics which test the equality of several means.Technometrics, 16, 129–132.
Brunner, E. (1998)
Büning, H. (1983). Adaptive verteilungsfreie Tests.Statistische Hefte, 24, 47–67.
Büning, H. (1991).Robuste und adaptive Tests. De Gruyter, Berlin.
Büning, H. (1993). A coefficient of stability of rank tests in the one-sample case.The Statistician, 42, 175–180.
Büning, H. (1994). Robust and adaptive tests for the two-sample location problem.OR Spektrum, 16, 33–39.
Büning, H. (1996). Adaptive tests for the c-sample location problem—the case of two-sided alternatives.Communications in Statistics—Theory and Methods, 25, 1569–1582.
Büning, H. (1997). Robust analysis of variance.Journal of Applied Statistics, 24, 319–332.
Büning, H. (1999). Adaptive Jonckheere-type tests for ordered alternatives.Journal of Applied Statistics, 26, 541–551.
Büning, H. and Trenkler, G. (1994).Nichtparametrische statistische Methoden. De Gruyter, Berlin
Büning, H. and Kössler, W. (1996). Robustness and efficiency of some tests for ordered alternatives in the c-sample location problem.Journal of Statistical Computation and Simulation, 55, 337–352.
Büning, H. and Kössler, W. (1997). Power of some tests for umbrella alternatives in the multi- sample location problem.Biometrical Journal, 39, 481–494.
Büning, H. and Kössler, W. (1998). Adaptive tests for umbrella alternatives.Biometrical Journal, 40, 573–587.
Büning, H. and Chakraborti, S. (1999). Power comparison of several two-sample tests for general alternatives.Allgemeines Statistisches Archiv, 83, 190–210.
Büning, H. and Kössler, W. (1999). The asymptotic power of Jonckheere-type tests for ordered alternatives.The Australian and New Zealand Journal of Statistics, 41, 901–911.
Büning, H. and Thadewald, T. (1999). An adaptive two-sample location-scale test of Lepage-type for symmetric distributions.Diskussionsbeiträge des Fachbereichs Wirtschaftswissenschaft der Freien Universität Berlin, Volkswirtschaftliche Reihe, Nr. 16,to appear Journal of Statistical Computation and Simulation.
Chatfield, C. (1988).Problem-solving—A statistician’s guide. Chapman and Hall, London.
Chen, Y.I. (1991). Notes on the Mack-Wolfe and Chen-Wolfe tests for umbrella alternatives,Biometrical Journal, 33, 281–290.
Chen, Y.I. (1992). Robust umbrella tests for a generalized Behrens-Fisher problem.Biometrical Journal, 34, 347–357.
Chen, Y.I. (1993). On the comparison of umbrella pattern treatment means with a control mean.Biometrical Journal, 35, 689–700.
Chen, Y.I. and Wolfe, D.A. (1990a). A study of distribution-free tests for umbrella alternatives.Biometrical Journal, 32, 47–57.
Chen, Y.I. and Wolfe, D.A. (1990b). Modifications of the Mack-Wolfe umbrella tests for a generalized Behrens-Fisher problem.Canadian Journal of Statistics, 18, 245–253.
Conover, W.J. and Iman, R.L. (1981). Rank transformation as a bridge between parametric and nonparametric statistics.The American Statistician, 35, 124–133.
Conover, W.J., Johnson, M.E. and Johnson, M.M. (1981). A comparative study of tests of homogeneity of variances, with applications to the outer continental helf bidding data.Technometrics, 23, 351–361.
Donoho, D.L. and Huber, P.J. (1983). The notion of breakdown point. InA Festschrift for Erich Lehmann (P.J. Bickel et al. eds.), 157–184, Wadsworth, Belmont, California.
Duran, B.S. (1976). A survey of nonparametric tests for scale.Communications in Statistics—Theory and Methods 5, 287–1312.
Eplett, W.J.R. (1980). An influence curve for two-sample rank tests.Journal of the Royal Statistical Society, Ser. B, 42, 64–70.
Friedman, M. (1937). The use of ranks to avoid the assumption of normality implicit in the analysis of variance.Journal of the American Statistical Association, 32, 675–701.
Gauß, C.F. (1821).Göttingische gelehrte Anzeigen, 321–327.
Geary, R.C. (1947). Testing for normality.Biometrika, 34, 209–242.
Häjek, J. and Sidák, Z.S. (1967).Theory of rank tests. Academic Press, New York, London.
Hájek, J., Sidák, Z.S. and Sen, P.K. (1999).Theory of rank tests. Academic Press, New York, London.
Hall, P. and Padmanabhan, A.R. (1997). Adaptive inference for the two-sample scale problem.Technometrics 39, 412–422.
Hampel, F.R. (1968).Contributions to the theory of robust estimation. Unpublished dissertation. University of California, Berkeley.
Hampel, F.R. (1974). The influence curve and its role in robust estimation.Journal of the American Statistical Association, 69, 383–393.
Hampel, F.R., Ronchetti, E.M., Rousseeuw, P.J. and Stahel, W.A. (1986).Robust statistics—The approach based on influence functions. Wiley, New York.
He, X., Simpson, D.G. and Portnoy, S.L. (1990). Breakdown robustness of tests.Journal of the American Statistical Association, 85, 446–452.
Hettmansperger, T.P. and Norton, R.M. (1987). Tests for patterned alternatives in k- sample problems.Journal of the American Statistical Association 82, 292–299.
Hettmansperger, T.P. and McKean, J.W. (1998).Robust nonparametric statistical methods. Arnold, London.
Hill, N.J., Padmanabham, A.R. and Puri, M.L. (1988). Adaptive nonparametric procedures and applications.Applied Statistics, 37, 205–218.
Hogg, R.V. (1974). Adaptive robust procedures. A partial review and some suggestions for future applications and theory.Journal of the American Statistical Association, 69, 909–927.
Hogg, R.V. (1976). A new dimension to nonparametric tests.Communications in Statistics—Theory and Methods, 5, 1313–1325.
Hogg, R.V., Fisher, D.M. and Randles, R.H. (1975). A two-sample adaptive distribution-free test.Journal of the American Statistical Association,70, 656–661.
Hotelling, H. and Pabst, M.R. (1936). Rank correlation and tests of significance involving no assumption of normality.Annals of Mathematical Statistics 7, 29–43.
Hothorn, L. and Liese, F. (1991). Adaptive Umbrellatests—Simulationsuntersuchungen.Rostocker Mathematisches Kolloquium, 45, 57–74.
Huber, P.J. (1964). Robust estimation of a location parameter.Annals of Mathematical Statistics, 35, 73–101.
Huber, P.J. (1981).Robust statistics, Wiley, New York.
Husková, M. (1985). Partial review of adaptive procedures. In:Sequential Methods in Statistics, 16, Banach Center Publications, Warschau.
Hüsler, J. (1987). On the two-sample adaptive distribution-free test.Communications in Statistics-Simulation and Computation,16, 55–68.
Jonckheere, A.R. (1954). A distribution-free k-sample test against ordered alternatives.Biometrika, 41, 133–145.
Kendall, G.M. and Stuart, A. (1969).The advanced theory of statistics, Vol. 1, Charles Griffin, London.
Kössler, W. (1991). Restriktive adaptive Rangtests zur Behandlung des Zweistichproben—Skalenproblems. Unveröffentlichte Dissertation, Humboldt-Universität zu Berlin.
Kumar, N., Gill, A.N. and Mehta, G.P. (1994). Distribution-free test for homogeneity against ordered alternatives.Communications in Statistics-Theory and Methods, 23, 1247–1257.
Lachenbruch, P.A. and Clements, P.J. (1991). ANOVA, Kruskal-Wallis, normal scores and unequal variance.Communications in Statistics-Theory and Methods, 20, 107–126.
Lambert, D. (1981). Influence functions for testing.Journal of the American Statistical Association, 76, 649–657.
Layard, M.W.J. (1973). Robust large-sample tests for homogeneity of variance.Journal of the American Statistical Association 68, 195–198.
Lee, H. and Yuen Fung, K. (1983). Robust procedures for multi-sample location problems with unequal group variances.Journal of Statistical Computation and Simulation, 18, 125–143.
Lepage, Y. (1971). A combination of Wilcoxon’s and Ansari-Bradley’s statistics.Biometrika, 58, 213–217.
Levene, H. (1960). Robust tests for equality of variances. In:Contributions to probability and statistics, ed. I. Olkin, Stanford University Press, Palo Alto, 278–292.
Mack, G.A. and Wolfe, D.A. (1981). K-sample rank tests for umbrella alternatives.Journal of the American Statistical Association, 76, 175–181.
Mann, H.B. and Whitney, D.R. (1947). On a test whether one of two samples is stochstically larger than the other.Annals of Mathematical Statistics, 18, 50–60.
Miller, R.G. (1968). Jacknifing variances.Annals of Mathematical Statistics, 39, 567–582.
Neuhäuser, M., Liu, P.-Y. and Hothorn, L.A. (1998). Nonparametric tests for trend: Jonckheere’s test, a modification and a maximum test.Biometrical Journal, 40, 899–909.
O’Gorman, T.W. (1997). A comparison of an adaptive two-sample test to the t-test, rank-sum, and log-rank tests.Communications in Statistics-Simulation and Computation,26, 1393–1411.
Pan, G. (1996a). Distribution-free confidence procedure for umbrella orderings.Australian Journal of Statistics, 38, 161–172.
Pan, G. (1996b). Distribution-free tests for umbrella alternatives.Communications in Statistics-Theory and Methods, 25, 3185–3194.
Puri, M.L. (1965). Some distribution-free k-sample rank tests of homogeneity against ordered alternatives.Communications on Pure and Applied Mathematics, 18, 51–63.
Ramberg, J.S. and Schmeiser, B.W. (1972). An approximate method for generating symmetric random variables.Communications of ACM, 15, 987–990.
Ramberg, J.S. and Schmeiser, B.W. (1974). An approximate method for generating asymmetric random variables.Communications of ACM, 17, 78–82.
Randles, R.H. and Hogg, R.V. (1973). Adaptive distribution-free tests.Communications in Statistics, 2, 337–356.
Randles, R.H. and Wolfe, D.A. (1979).Introduction to the theory of nonparametric statistics. Wiley, New York.
Rao, K.S.M. and Gore, A.P. (1984). Testing against ordered alternatives in one-way layout.Biometrical Journal, 26, 25–32.
Reed III, J.F. and Stark, D.B. (1988). Robust alternatives to tradition analysis of variance: Welch W*, James JI *, James JII *, Brown-Forsythe BF*.Computer Methods and Programs in Biomedicine, 26, 233–238.
Rieder, H. (1982). Qualitative robustness of rank tests.Annals of Statistics, 10, 205–211.
Ronchetti, E.M. (1997). Robust inference by influence functions.Journal of Statistical Planning and Inference, 57, 59–72.
Roussas, G.G. (1973).A first course in mathematical statistics. Addison-Wesley, Reading, MA.
Rousseeuw, P.J. and Ronchetti, E.M. (1979). The influence curve for tests.Research Report 21, Fachgruppe für Statistik, ETH, Zürich.
Rüberg, S.J. (1986). A continuously adaptive nonparametric two-sample test.Communications in Statistics-Theory and Methods,15, 2899–2920.
Shetty, I.D., Govindarajulu, Z. and Bhat, S.V. (1998). A class of distribution-free tests for testing homogeneity against ordered location alternatives.Communications in Statistics-Theory and Methods, 27, 747–760.
Shi, N.Z. (1988). A test of homogeneity for umbrella alternatives and tables of the level probabilities.Communications in Statistics-Theory and Methods,17, 657–670.
Shiraishi, T. (1991). Hypothesis testing and parameter estimation based on M-statistics in k samples with unequal variances.Metrika, 38, 163–178.
Simpson, D.G. (1989). Hellinger deviance tests: efficiency, breakdown points and examples.Journal of the American Statistical Association, 84, 107–113.
Smirnow, N.V. (1939). On the estimation of the decrepancy between empirical curves of distribution for two independent samples (Russian).Bulletin of Moscow University, 2, 3–16.
Staudte, R.G. and Sheather, S.J. (1990).Robust estimation and testing. Wiley, New York.
Student (1908). The probable error of a mean.Biometrika, 6, 1–25.
Sun, S. (1997). A class of adaptive distribution-free procedures.Journal of Statistical Planning and Inference, 59, 191–211.
Tan, W.Y. and Tabatabai, M.A. (1985). Some robust ANOVA procedures under heteroscedasticity and nonnormality.Communications in Statistics-Simulation and Computation, 14, 1007–1026.
Tan, W.Y. and Tabatabai, M.A. (1986). A robust procedure for comparing several means under heteroscedasticity and nonnormality.Communications in Statistics-Simulation and Computation, 15, 733–745.
Tiku, M.L., Tan, W.Y. and Balakrishnan, N. (1986).Robust inference. Marcel Dekker, New York.
Tryon, P.V. and Hettmansperger, T.P. (1973). A class of nonparametric tests for homogeneity against ordered alternatives.Annals of Statistics, 1, 1061–1070.
Tukey, J.W. (1960). A survey of sampling from contaminated distributions.Contributions to probability and statistics. Essays in honor of Harald Hotelling, California, 445–485.
Tukey, J.W. (1962). The future of data analysis.Annals of Mathematical Statistics, 33, 1–67.
Tukey, J.W. (1970).Exploratory data analysis. Addison-Wesley, Reading, MA.
Wald, A. and Wolfowitz, J. (1940). On a test whether two samples are from the same population.Annals of Mathematical Statistics, 11, 147–162.
Weith, B.L. (1951). On the comparison of several mean values: An alternative approach.Biometrika,38, 330–336.
Wilcon, R.R. (1997).Introduction to robust estimation and hypothesis testing. Academic Press, San Diego.
Wilcon, R.R., Charlin, V.L. and Thompson, K.L. (1986). New Monte Carlo results on the robustness of the ANOVAF, W and F statistics.Communications in Statistics-Simulation and Computation, 15, 933–943.
Wilcoxon, F. (1945). Individual comparisons by ranking methods.Biometrics, 1, 80–83.
Ylvicaker, D. (1977). Test resistance.Journal of the American Statistical Association, 72, 551–557.
Zayed, H. and Quade, D. (1997). A note on the distribution of the resistance of rank tests.Communications in Statistics-Theory and Methods, 26, 1867–1875.
Zhang, J. (1996). On the sample breakdown robustness of some nonparametric tests.Communications in Statistics-Theory and Methods, 25, 1185–1198.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Büning, H. Robustness and power of parametric, nonparametric, robustified and adaptive tests—The multi-sample location problem. Statistical Papers 41, 381–407 (2000). https://doi.org/10.1007/BF02925759
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02925759