Abstract
The paper introduces two new data augmentation algorithms for sampling the parameters of a binary or multinomial logit model from their posterior distribution within a Bayesian framework. The new samplers are based on rewriting the underlying random utility model in such away that only differences of utilities are involved. As a consequence, the error term in the logit model has a logistic distribution. If the logistic distribution is approximated by a finite scale mixture of normal distributions, auxiliary mixture sampling can be implemented to sample from the posterior of the regression parameters. Alternatively, a data augmented Metropolis–Hastings algorithm can be formulated by approximating the logistic distribution by a single normal distribution. A comparative study on five binomial and multinomial data sets shows that the new samplers are superior to other data augmentation samplers and to Metropolis–Hastings sampling without data augmentation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Albert, J. H. & Chib, S. (1993). Bayesian analysis of binary and polychotomous response data, Journal of the American Statistical Association 88: 669–679.
Andrews, D. F. & Mallows, C. L. (1974). Scale mixtures of normal distributions, Journal of the Royal Statistical Society, Ser. B 36: 99–102.
Balakrishnan, N. (ed.) (1992). Handbook of the Logistic Distribution, Marcel Dekker, New York.
Chib, S. (1995). Marginal likelihood from the Gibbs output, Journal of the American Statistical Association 90: 1313–1321.
Dellaportas, P. & Smith, A. F. M. (1993). Bayesian inference for generalized linear and proportional hazards models via Gibbs sampling, Applied Statistics 42: 443–459.
Fahrmeir, L. & Kaufmann, H. (1986a). Asymptotic inference in discrete response models, Statistical Papers 27: 179–205.
Fahrmeir, L. & Kaufmann, H. (1986b). Consistency and asymptotic normality of the maximum likelihood estimator in generalized linear models, The Annals of Statistics 13: 342–368.
Fahrmeir, L. & Tutz, G. (2001). Multivariate Statistical Modelling based on Generalized Linear Models, Springer Series in Statistics, 2nd ed., Springer, New York/Berlin/Heidelberg.
Frühwirth-Schnatter, S. & Frühwirth, R. (2007). Auxiliary mixture sampling with applications to logistic models, Computational Statistics and Data Analysis 51: 3509–3528.
Frühwirth-Schnatter, S., Frühwirth, R., Held, L. & Rue, H. (2009). Improved auxiliary mixture sampling for hierarchical models of non-Gaussian data, Statistics and Computing 19, forthcoming.
Frühwirth-Schnatter, S. & Wagner, H. (2009). Stochastic model specification search for Gaussian and partially non-Gaussian state space models, Journal of Econometrics, forthcoming.
Gamerman, D. (1997). Sampling from the posterior distribution in generalized linear mixed models, Statistics and Computing 7: 57–68.
Geyer, C. (1992). Practical Markov chain Monte Carlo, Statistical Science 7: 473–511.
Holmes, C. C. & Held, L. (2006). Bayesian auxiliary variable models for binary and multinomial regression, Bayesian Analysis 1: 145–168.
Imai, K. & van Dyk, D. A. (2005). A Bayesian analysis of the multinomial probit model using marginal data augmentation, Journal of Econometrics 124: 311–334.
Kass, R. E., Carlin, B., Gelman, A. & Neal, R. (1998). Markov chain Monte Carlo in practice: A roundtable discussion, The American Statistician 52: 93–100.
Lenk, P. J. & DeSarbo, W. S. (2000). Bayesian inference for finite mixtures of generalized linear models with random effects, Psychometrika 65: 93–119.
McCulloch, R. E., Polson, N. G. & Rossi, P. E. (2000). A Bayesian analysis of the multinomial probit model with fully identified parameters, Journal of Econometrics 99: 173–193.
McFadden, D. (1974). Conditional logit analysis of qualitative choice behaviour, in P. Zarembka (ed.), Frontiers of Econometrics, Academic, New York, pp. 105–142.
Monahan, J. F. & Stefanski, L. A. (1992). Normal scale mixture approximations to F*(z) and computation of the logistics normal integral, in N. Balakrishnan (ed.), Handbook of the Logistic Distribution, Marcel Dekker, New York, pp. 529–549.
Roberts, G. O. & Rosenthal, J. S. (2001). Optimal scaling for various Metropolis–Hastings algorithms, Statistical Science 16: 351–367.
Rossi, P. E., Allenby, G. M. & McCulloch, R. (2005). Bayesian Statistics and Marketing, Wiley, Chichester.
Scott, S. L. (2009). Data augmentation and the Bayesian analysis of multinomial logit models, Statistical Papers, forthcoming.
Tüchler, R. (2008). Bayesian variable selection for logistic models using auxiliary mixture sampling, Journal of Computational and Graphical Statistics 17: 76–94.
Zeger, S. L. & Karim, M. (1991). Generalized linear models with random effects: a Gibbs sampling approach, Journal of the American Statistical Association 86: 79–86.
Zellner, A. & Rossi, P. E. (1984). Bayesian analysis of dichotomous quantal response models, Journal of Econometrics 25: 365–393.
Acknowledgments
The first author’s research is supported by the Austrian Science Foundation (FWF) under the grant S 10309-G14 (NRN “The Austrian Center for Labor Economics and the Analysis of the Welfare State”, Subproject “Bayesian Econometrics”).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Frühwirth-Schnatter, S., Frühwirth, R. (2010). Data Augmentation and MCMC for Binary and Multinomial Logit Models. In: Kneib, T., Tutz, G. (eds) Statistical Modelling and Regression Structures. Physica-Verlag HD. https://doi.org/10.1007/978-3-7908-2413-1_7
Download citation
DOI: https://doi.org/10.1007/978-3-7908-2413-1_7
Published:
Publisher Name: Physica-Verlag HD
Print ISBN: 978-3-7908-2412-4
Online ISBN: 978-3-7908-2413-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)