Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In B. N. Petrov & F. Csaki (Eds.), Second international symposium on information theory. Budapest: Akademiai Kiado.
Google Scholar
Berger, J. (2013). Statistical decision theory and Bayesian analysis. New York: Springer.
Google Scholar
Bernardo, J., & Smith, A. F. M. (2000). Bayesian theory. New York: Wiley.
Google Scholar
Breiman, L. (1996). Stacked regressions. Machine Learning, 24, 49–64.
Google Scholar
Brier, G. W. (1950). Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78, 1–3.
Google Scholar
Burnham, K. P., & Anderson, D. R. (2002). Model selection and multimodel inference: A practical information-theoretic approach (2nd ed.). New York: Springer.
Google Scholar
Claeskens, G., & Hjort, N. L. (2008). Model selection and model averaging. Cambridge: Cambridge University Press.
Google Scholar
Clarke, B. S., & Clarke, J. L. (2018). Predictive statistics: Analysis and inference beyond models. Cambridge: Cambridge University Press.
Google Scholar
Clyde, M. A. (1999). Bayesian model averaging and model search strategies. Bayesian statistics (Vol. 6, pp. 157–185). Oxford: Oxford University Press.
Google Scholar
Clyde, M. A. (2003). Model averaging. In In S. J. Press (Ed.), Subjective and objective Bayesian statistics: Principles, models, and applications (pp. 320–335). Hoboken, NJ: Wiley-Interscience.
Clyde, M. A. (2017). BAS: Bayesian adaptive sampling for bayesian model averaging [Computer software manual]. (R package version 1.4.7).
Clyde, M. A., & George, E. I. (2004). Model uncertainty. Statistical Science, 19, 81–94.
Google Scholar
Clyde, M. A., & Iversen, E. S. (2013). Bayesian model averaging in the M-open framework. Bayesian theory and applications (pp. 483–498). Oxford: Oxford University Press.
Google Scholar
Dawid, A. P. (1982). The well-calibrated Bayesian. Journal of the American Statistical Association, 77, 605–610.
Google Scholar
Dawid, A. P. (1984). Statistical theory: The prequential approach. Journal of the Royal Statistical Society, Series A, 147, 202–278.
Google Scholar
de Finetti, B. (1962). Does it make sense to speak of good probability appraisers. In I. J. Good (Ed.), The scientist speculates—A anthology of partly-baked ideas (pp. 357–364). London: Heinemann.
Google Scholar
Draper, D. (1995). Assessment and propagation of model uncertainty (with discussion). Journal of the Royal Statistical Society (Series B), 57, 55–98.
Google Scholar
Draper, D. (2013). Bayesian model specification: Heuristics and examples. Bayesian theory and applications (pp. 483–498). Oxford: Oxford University Press.
Google Scholar
Draper, D., Hodges, J. S., Leamer, E. E., Morris, C. N., & Rubin, D. B. (1987). A Research Agenda for Assessment and Propagation of Model Uncertainty (Tech. Rep.). Santa Monica, CA: Rand Corporation. Retrieved from https://www.rand.org/pubs/notes/N2683.html (N-2683-RC).
Eicher, T. S., Papageorgiou, C., & Raftery, A. E. (2011). Default priors and predictive performance in Bayesian model averaging, with application to growth determinants. Journal of Applied Econometrics, 26(1), 30–55.
Google Scholar
Feldkircher, M. & Zeugner, S. (2009). Benchmark priors revisited: on adaptive shrinkage and the supermodel effect in Bayesian model averaging (No. 9-202). International Monetary Fund.
Fernández, C., Ley, E., & Steel, M. F. J. (2001). Benchmark priors for Bayesian model averaging. Journal of Econometrics, 100, 381–427.
Google Scholar
Fernández, C., Ley, E., & Steel, M. F. J. (2001). Model uncertainty in cross-country growth regressions. Journal of Applied Econometrics, 16, 563–576.
Google Scholar
Fletcher, D. (2018). Model averaging. Berlin: Springer.
Google Scholar
Foster, D. P., & George, E. I. (1994). The risk inflation criterion for multiple regression. Annals of Statistics, 22, 1947–1975.
Google Scholar
Furnival, G. M., & Wilson, R. W, Jr. (1974). Regressions by leaps and bounds. Technometrics, 16, 499–511.
Google Scholar
Geisser, S., & Eddy, W. F. (1979). Journal of the American Statistical Association, 74, 153–160.
Google Scholar
Gelfand, A. E. (1996). Model determination using sampling-based methods. In W. R. Gilks, S. Richardson, & D. J. Spiegelhalter (Eds.), Markov Chain Monte Carlo in practice (pp. 145–161). Boca Raton: Chapman & Hall.
Google Scholar
Gelman, A., Meng, X.-L., & Stern, H. (1996). Posterior predictive assessment of model fitness via realized discrepancies: With commentary. Statistical Science, 6, 733–807.
Google Scholar
George, E., & Foster, D. (2000). Calibration and empirical Bayes variable selection. Biometrika, 1, 87. https://doi.org/10.1093/biomet/87.4.731.
Article
Google Scholar
Gilks, W. R., Richardson, S., & Spiegelhalter, D. J. (Eds.). (1996). Markov Chain Monte Carlo in practice. London: Chapman and Hall.
Google Scholar
Gneiting, T., & Raftery, A. E. (2007). Strictly proper scoring rules, prediction, and estimation. Journal of the American Statistical Association, 102, 359–378.
Google Scholar
Good, I. J. (1952). Rational decisions. Journal of the Royal Statistical Society Series B (Methodological), 14, 107–114.
Google Scholar
Goodrich, B., Gabry, J., Ali, I., & Brilleman, S. (2020). rstanarm: Bayesian applied regression modeling via Stan. Retrieved from https://mc-stan.org/rstanarm (R package version 2.21.1)
Hannan, E. J., & Quinn, B. G. (1979). The determination of the order of an autoregression. Journal of the Royal Statistical Society Series B (Methodological), 41(2), 190–195.
Google Scholar
Hansen, M. H., & Yu, B. (2001). Model selection and the principle of minimum description length. Journal of the American Statistical Association, 96, 746–774.
Google Scholar
Heckman, J. J., & Kautz, T. (2012). Hard evidence on soft skills. Labour Economics, 19, 451–464.
PubMed
PubMed Central
Google Scholar
Hinne, M., Gronau, Q. F., van den Bergh, D., & Wagenmakers, E.-J. (2020). A conceptual introduction to Bayesian model averaging. Advances in Methods and Practices in Psychological Science, 3, 200–215.
Google Scholar
Hjort, N. L., & Claeskens, G. (2003). Frequentist model average estimators. Journal of the American Statistical Association, 98, 879–899.
Google Scholar
Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1), 55–67.
Google Scholar
Hoerl, R. W. (1985). Ridge analysis 25 years later. The American Statistician, 39(3), 186–192.
Google Scholar
Hoeting, J. A., Madigan, D., Raftery, A. E., & Volinsky, C. T. (1999). Bayesian model averaging: A tutorial. Statistical Science, 14, 382–417.
Google Scholar
Hsiang, T. C. (1975). A Bayesian View on Ridge Regression. Journal of the Royal Statistical Society, D (The Statistician), 24, 267–268.
Jose, V. R. R., Nau, R. F., & Winkler, R. L. (2008). Scoring rules, generalized entropy, and utility maximization. Operations Research, 56, 1146–1157.
Google Scholar
Kaplan, D., & Chen, J. (2014). Bayesian model averaging for propensity score analysis. Multivariate Behavioral Research, 49, 505–517.
PubMed
PubMed Central
Google Scholar
Kaplan, D., & Huang, M. (under review). Bayesian probabilistic forecasting with state NAEP data.
Kaplan, D., & Kuger, S. (2016). The methodology of PISA: Past, present, and future. In S. Kuger, E. Klieme, N. Jude, & D. Kaplan (Eds.), Assessing contexts of learning world-wide—extended context assessment frameworks. Dordrecht: Springer.
Google Scholar
Kaplan, D., & Lee, C. (2015). Bayesian model averaging over directed acyclic graphs with implications for the predictive performance of structural equation models. Structural Equation Modeling,. https://doi.org/10.1080/10705511.2015.1092088.
Article
Google Scholar
Kaplan, D., & Yavuz, S. (2019). An approach to addressing multiple imputation model uncertainty using Bayesian model averaging. Multivariate Behavioral Research, 1, 21. https://doi.org/10.1080/00273171.2019.1657790.
Article
Google Scholar
Kass, R. E., & Raftery, A. E. (1995). Bayes factors. Journal of the American Statistical Association, 90, 773–795.
Google Scholar
Kuger, S., Klieme, E., Jude, N., & Kaplan, D. (2016). Assessing contexts of learning: An international perspective. Dordrecht: Springer.
Google Scholar
Kullback, S. (1959). Information theory and statistics. New York: Wiley.
Google Scholar
Kullback, S. (1987). The Kullback–Leibler distance. The American Statistician, 41, 340–341.
Google Scholar
Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. Annals of Mathematical Statistics, 22, 79–86.
Google Scholar
Le, T., & Clarke, B. (2017). A Bayes interpretation of stacking for \(\cal{M}\)-complete and \(\cal{M}\)-open settings. Bayesian Analysis, 12, 807–829.
Google Scholar
Leamer, E. E. (1978). Specification searches: Ad hoc inference with nonexperimental data. New York: Wiley.
Google Scholar
Ley, E., & Steel, M. F. J. (2009). On the effect of prior assumptions in bayesian model averaging with applications to growth regression. Journal of Applied Econometrics, 24, 651–674.
Google Scholar
Li, Q., & Lin, N. (2010). The Bayesian elastic net. Bayesian Analysis, 5, 151–170. https://doi.org/10.1214/10-BA506.
Article
Google Scholar
Liang, F., Paulo, R., Molina, G., Clyde, M. A., & Berger, J. (2008). Mixtures of \(g\)-priors for Bayesian variable selection. Journal of the American Statistical Association, 103, 410–423.
Google Scholar
Lindley, D. (1991). Making Decisions. London: Wiley.
Google Scholar
Madigan, D., & Raftery, A. E. (1994). Model selection and accounting for model uncertainly in graphical models using Occam’s window. Journal of the American Statistical Association, 89, 1535–1546.
Google Scholar
Madigan, D., & York, J. (1995). Bayesian graphical models for discrete data. International Statistical Review, 63, 215–232.
Google Scholar
Merkle, E. C., & Steyvers, M. (2013). Choosing a strictly proper scoring rule. Decision Analysis, 10, 292–304.
Google Scholar
Mislevy, R. J. (1991). Randomization-based inference about latent variables from complex samples. Psychometrika, 56, 177–196.
Google Scholar
Mislevy, R. J., Beaton, A. E., Kaplan, B., & Sheehan, K. M. (1992). Estimating population characteristics from sparse matrix samples of item responses. Journal of Educational Measurement, 29, 133–161.
Google Scholar
Montgomery, J. M., & Nyhan, B. (2010). Bayesian model averaging: Theoretical developments and practical applications. Political Analysis, 18, 245–270.
Google Scholar
OECD. (2002). PISA 2000 Technical Report. Paris: Organization for Economic Cooperation and Development.
OECD. (2009). Pisa 2009 assessment framework-key competencies in reading, mathematics and science. Paris: Organization for Economic Cooperation and Development.
OECD. (2017). PISA 2015 Technical ReportParis: OECD.
OECD. (2018). Equity in Education: Breaking Down Barriers to Social Mobility (Tech. Rep.). Paris. https://doi.org/10.1787/9789264073234-en.
Park, T., & Casella, G. (2008). The Bayesian lasso. Journal of the American Statistical Association, 103, 681–686.
Google Scholar
Piironen, J., & Vehtari, A. (2017). Comparison of Bayesian prediction methods for model selection. Statistics and Computing, 27, 711–735.
Google Scholar
Raftery, A. E. (1995). Bayesian model selection in social research (with discussion). In P. V. Marsden (Ed.), Sociological Methodology (Vol. 25, pp. 111–196). New York: Blackwell.
Google Scholar
Raftery, A. E. (1996). Approximate Bayes factors and accounting for model uncertainty in generalized linear models. Biometrika, 83, 251–266.
Google Scholar
Raftery, A. E., Gneiting, T., Balabdaoui, F., & Polakowski, M. (2005). Using Bayesian model averaging to calibrate forecast ensembles. Monthly Weather Review, 133, 1155–1174.
Google Scholar
Raftery, A. E., Hoeting, J., Volinsky, C., Painter, I., & Yeung, K. (2015). BMA: Bayesian model averaging [Computer software manual]. Retrieved from http://CRAN.R-project.org/package=BMA (R package version 3.18.1).
Raftery, A. E., Madigan, D., & Hoeting, J. A. (1997). Bayesian model averaging for linear regression models. Journal of the American Statistical Association, 92, 179–191.
Google Scholar
Raftery, A. E., & Zheng, Y. (2003). Discussion: Performance of Bayesian model averaging. Journal of the American Statistical Association, 98, 931–938.
Google Scholar
Rights, J., Sterba, S., Cho, S.-J., & Preacher, K. (2018). Addressing model uncertainty in item response theory person scores through model averaging. Behaviormetrika, 45, 495–503. https://doi.org/10.1007/s41237-018-0052-1.
Article
Google Scholar
Rubin, D. B. (1981). The Bayesian bootstrap. The Annals of Statistics, 9, 130–134.
Google Scholar
Sloughter, J. M., Gneiting, T., & Raftery, A. E. (2013). Probabilistic wind vector forecasting using ensembles and Bayesian model averaging. Monthly Weather Review, 141, 2107–2119.
Google Scholar
Steel, M. F. J. (2020). Model averaging and its use in economics. Journal of Economic Literature, 58, 644–719.
Google Scholar
Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society Series B (Methodological), 58, 267–288.
Google Scholar
Tierney, L., & Kadane, J. B. (1986). Accurate approximations for posterior moments and marginal densities. Journal of the American Statistical Association, 81, 82–86.
Google Scholar
Vehtari, A., Gabry, J., Yao, Y., & Gelman, A. (2019). loo: Efficient leave-one-out cross-validation and WAIC for Bayesian models. Retrieved from https://CRAN.R-project.org/package=loo (R package version 2.1.0).
Vehtari, A., Gelman, A., & Gabry, J. (2017). Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and Computing, 27, 1413–1432. https://doi.org/10.1007/s11222-016-9696-4.
Article
Google Scholar
Vehtari, A., & Ojanen, J. (2012). A survey of Bayesian predictive methods for model assessment, selection and comparison. Statistics Surveys, 6, 142–228. https://doi.org/10.1214/12-SS102.
Article
Google Scholar
Watanabe, S. (2010). Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11, 3571–3594.
Google Scholar
Winkler, R. L. (1996). Scoring rules and the evaluation of probabilities. Test, 5, 1–60.
Google Scholar
Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5, 241–259.
Google Scholar
Yao, Y., Vehtari, A., Simpson, D., & Gelman, A. (2018). Using stacking to average Bayesian predictive distributions (with discussion). Bayesian Analysis, 13, 917–1007. https://doi.org/10.1214/17-BA1091.
Article
Google Scholar
Yeung, K. Y., Bumbarner, R. E., & Raftery, A. E. (2005). Bayesian model averaging: Development of an improved multi-class, gene selection, and classification tool for microarray data. Bioinformatics, 21, 2394–2402.
PubMed
Google Scholar
Zellner, A. (1986). On assessing prior distributions and Bayesian regression analysis with \(g\) prior distributions. In P. Goel & A. Zellner (Eds.), Bayesian Inference and Decision Techniques: Essays in Honor of Bruno de Finetti. Studies in Bayesian Econometrics (pp. 233–243). New York: Elsevier.
Google Scholar
Zeugner, S., & Feldkircher, M. (2015). Bayesian model averaging employing fixed and flexible priors: The BMS package for R. Journal of Statistical Software, 68(4), 1–37. https://doi.org/10.18637/jss.v068.i04.
Article
Google Scholar
Zou, H., & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 67, 301–320. https://doi.org/10.1111/j.1467-9868.2005.00503.x.
Article
Google Scholar