Abstract
Hypothesis testing and confidence set estimation coexist as two related but disparate methods of statistical inference. As is well known, a confidence set is equivalent to a family of hypothesis tests—a relationship often exploited to develop confidence sets. In this paper, an extension of hypothesis testing is considered which often makes testing equivalent to confidence estimation. In the extension, with roots in partition multiple testing, all parameter points are tested—not only those corresponding to the null hypotheses. Consequently, the conclusion of a test is a confidence set, whether or not the null hypothesis is rejected, and the more specific inference of a confidence set is obtained with no loss of power to reject the null hypothesis. The methodology, known to be applicable for individual tests, is extended here to multiple testing, or multiple comparisons.
The literature on confidence estimation is reviewed in view of this extension of testing, to explore the nature and extent of the resulting stronger connections between these two methods of inference. Cases considered include basic t-tests and related extensions, the Tukey-Kramer and Scheffè single-step multiple tests, and certain step-down partition tests. Compelling evidence is provided that the traditional approach to hypothesis testing should in almost all cases be replaced with the partition multiple test extension presented here, so as to yield the same specific conclusions as corresponding confidence sets.
Similar content being viewed by others
References
Berger, R.L., Hsu, J.C., 1999. Stepwise confidence intervals without multiplicity adjustment for dose-response and toxicity studies. Journal of the American Statistical Association, 94, 68–482.
Bofinger, E., 1987. Stepdown procedures for comparison with a control. Australian Journal of Statistics, 29, 248–364.
Casella, G., Berger, R.L., 1990. Statistical Inference. Duxbury Press, California.
Finner, H., 1994. Two-sided tests and one-sided confidence bounds. Annals of Statistics, 22, 1502–1516.
Finner, H., 2008. Preface to the translation of Eckart Sonnemann’s 1982 paper on general solutions to multiple testing problems. Biometrical Journal, 50, 640.
Finner, H., Strassburger, K., 2002. The partitioning principle: A powerful tool in multiple decision theory. Annals of Statistics, 30, 1194–1213.
Guilbaud, O., 2008. Simultaneous confidence regions corresponding to Holm’s step-down procedure and other closed-testing procedures. Biometrical Journal, 50, 678–692.
Hayter, A.J., 2007. A combination multiple comparisons and subset selection procedure to identify treatments that are strictly inferior to the best. Journal of Statistical Planning and Inference, 137, 2115–2126.
Hayter, A.J., Hsu, J.C., 1994. On the relationship between stepwise decision procedures and confidence sets. Journal of the American Statistical Association, 89, 128–136.
Hayter, A.J., Miwa, T., Liu, W., 2000. Combining the advantages of one-sided and two-sided procedures for comparing several treatments with a control. Journal of Statistical Planning and Inference, 86, 81–99.
Lehmann, E.L., 1986. Testing Statistical Hypotheses (2nd edition). John Wiley, New York.
Maihara, H., 2007. Multiple decision problem on signs of mean vectors in three-variate normal case. Journal of the Japanese Statistical Society, 37, 29–52.
Miwa, T., Hayter, A.J., 1999. Combining the advantages of one-sided and two-sided test procedures for comparing several treatment effects. Journal of the American Statistical Association, 94, 302–307.
Scheffé, H., 1959. The Analysis of Variance. Wiley, New York.
Sonnemann, E., 1982. Allgemeine Lösungen multipler testprobleme. EDV in Medizin und Biologie, 13(4), 120–128 (in German), (Translation into English by
Finner, H. (2008), Biometrical Journal, 50, 641–656.
Stefansson, G., Kim, W.-C., Hsu, J.C., 1988. On confidence sets in multiple comparisons. In Statistical Decision Theory and Related Topics (IV), Gupta S.S., and Berger, J.O. (Editors), Academic Press, New York. 2, 89–104.
Strassburger, K., Bretz, F., 2008. Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni-based closed tests. Statistics in Medicine, 27, 4914–4927.
Takeuchi, K., 1973. Studies in Some Aspects of Theoretical Foundations of Statistical Data Analysis (In Japanese). Toyo Keizai Inc., Tokyo.
Voss, D.T., 2008. Partition testing: A generalization of hypothesis testing. Statistics and Applications, 6, 25–34.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Voss, D.T. Testing is Confidence Estimation: Partition Multiple Testing. J Stat Theory Pract 4, 559–569 (2010). https://doi.org/10.1080/15598608.2010.10412004
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1080/15598608.2010.10412004