Strong model dependence in statistical analysis: goodness of fit is not enough for model choice

Copas, John; Eguchi, Shinto

doi:10.1007/s10463-018-0691-8

Strong model dependence in statistical analysis: goodness of fit is not enough for model choice

Published: 03 October 2018

Volume 72, pages 329–352, (2020)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

John Copas¹ &
Shinto Eguchi²

411 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Most statistical methods are based on models, but most practical applications ignore the fact that the results depend on the model as well as on the data. This paper examines the size of this model dependence, and finds that there can be very considerable variation between the results of fitting different models to the same data, even if the models being considered are restricted to those which give an acceptable fit to the data. Under reasonable regularity conditions, we show that different empirically acceptable models can give rise to non-overlapping confidence intervals for the same parameter. Application papers need to recognize that the validity of conventional statistical results rests on the assumption that the underlying model is known to be correct, and that this is a much stronger requirement than merely confirming that the model gives a good fit to the data. The problem of model dependence is only partially resolved by using formal methods of model selection or model averaging.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Article 04 June 2018

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

References

Box, G. E. P. (1976). Science and statistics. Journal of the American Statistical Association, 71, 791–799.
Article MathSciNet Google Scholar
Claeskens, G., Hjort, N. L. (2008). Model selection and model averaging. Cambridge: Cambridge University Press.
Cox, D. R. (1970). Analysis of binary data. London: Chapman and Hall/CRC.
Cox, D. R. (1995). Contribution to the discussion of the paper by Draper. Journal of the Royal Statistical Society, Series B, 57, 78.
Draper, D. (1995). Assessment and propagation of model uncertainty (with discussion). Journal of the Royal Statistical Society, Series B, 57, 45–97.
MathSciNet MATH Google Scholar
Efron, B. (2014). Estimation and accuracy after model selection. Journal of the American Statistical Association, 109, 991–1007.
Article MathSciNet Google Scholar
Everitt, B. S. (1977). The analysis of contingency tables. London: Chapman and Hall/CRC.
Ferrari, D., Yang, Y. (2015). Confidence sets for model selection by F-testing. Statistica Sinica, 25, 1637–1658.
Hjort, N. L., Claeskens, G. (2003). Frequentist model average estimators. Journal of the American Statistical Association, 98, 879–899.
Article MathSciNet Google Scholar
Hodges, J. S. (1987). Uncertainty, policy analysis and statistics. Statistical Science, 2, 259–291.
Article Google Scholar
Hoeting, J. A., Madigan, D., Raftery, A. E., Volinsky, C. T. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14, 382–417.
Ioannidis, J. P. A. (2005). Why most published research findings are false. PLoS Medicine, 2(8), e124. https://doi.org/10.1371/journal.pmed.0020124.t001.
Langford, J. (2005). Tutorial on practical prediction theory for classification. Journal of Machine Learning Research, 6, 273–306.
Leeb, H., Potscher, B. M. (2005). Model selection and inference: Facts and fiction. Econometric Theory, 21, 21–59.
Miller, A. J. (2002). Subset selection in regression (2nd ed.). London: Chapman and Hall/CRC.
Nan, Y., Yang, Y. (2014). Variable selection diagnostic measures for high-dimensional regression. Journal of Computational and Grahical Statistics, 23, 636–656.
Article MathSciNet Google Scholar
Penrose, K., Nelson, A., Fisher, A. (1985). Generalized body composition prediction equation for men using simple measurement techniques (abstract). Medicine and Science in Sports and Exercise, 17, 189.
Article Google Scholar
Potscher, B. M. (1991). Effects of model selection on inference. Econometric Theory, 7, 163–185.
Article MathSciNet Google Scholar
Royston, P., Sauerbrei, W. (2008). Multivariate model-building. Chichester: Wiley.
Simmons, J. P., Nelson, L. D., Simonsohn, U. (2011). False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychological Science, 20, 1–8.
Wadman, M. (2013). NIH mulls for validating key results. Nature, 500, 14–16.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the editors and referees for their very helpful comments on an earlier version of this paper.

Author information

Authors and Affiliations

Department of Statistics, University of Warwick, Coventry, CV4 7AL, UK
John Copas
Institute of Statistical Mathematics, Midori-cho 10-3, Tachikawa, Tokyo, 190-8562, Japan
Shinto Eguchi

Authors

John Copas
View author publications
You can also search for this author in PubMed Google Scholar
Shinto Eguchi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John Copas.

Additional information

The online version of this article contains supplementary material.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material

Supplementary Appendix A, giving the proof of equation (17) in Section 3.3, is available online at the journal website. Similarly, Supplementary Appendix B gives the proof of equation (30) in Section 4.2.(PDF 84KB)

About this article

Cite this article

Copas, J., Eguchi, S. Strong model dependence in statistical analysis: goodness of fit is not enough for model choice. Ann Inst Stat Math 72, 329–352 (2020). https://doi.org/10.1007/s10463-018-0691-8

Download citation

Received: 02 February 2018
Revised: 25 June 2018
Published: 03 October 2018
Issue Date: April 2020
DOI: https://doi.org/10.1007/s10463-018-0691-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Strong model dependence in statistical analysis: goodness of fit is not enough for model choice

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material

About this article

Cite this article

Keywords

Navigation

Strong model dependence in statistical analysis: goodness of fit is not enough for model choice

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material

About this article

Cite this article

Share this article

Keywords

Search

Navigation