The order of variables, simulation noise, and accuracy of mixed logit estimates

Palma, Marco A.; Vedenov, Dmitry V.; Bessler, David

doi:10.1007/s00181-018-1609-2

The order of variables, simulation noise, and accuracy of mixed logit estimates

Published: 29 November 2018

Volume 58, pages 2049–2083, (2020)
Cite this article

Empirical Economics Aims and scope Submit manuscript

Marco A. Palma¹,
Dmitry V. Vedenov¹ &
David Bessler¹

359 Accesses
7 Citations
Explore all metrics

Abstract

The simulated choice probabilities in mixed logit models are usually approximated numerically using Halton or random draws from a multivariate mixing distribution for the random parameters. Theoretically, the order in which the estimated variables enter the model should not matter. However, in practice, simulation “noise” inherent in the numerical procedure leads to differences in the magnitude of the estimated coefficients depending on the arbitrary order in which the random variables are estimated. The problem is exacerbated when a low number of draws are used or if correlation among coefficients is allowed. In particular, the Cholesky factorization procedure, which is used to incorporate correlation into the model, propagates simulation noise in the estimate of one coefficient to estimates of all subsequent coefficients in the model. Ignoring the potential ordering effects in simulated maximum likelihood estimation methods may seriously compromise the ability of replicating the results and can inadvertently influence policy recommendations. We find that better estimation accuracy is achieved with Halton draws using small prime numbers as it is the case for small integrating dimensions; but random draws provide better accuracy than Halton draws from large prime numbers as it is normally the case in high integrating dimensions. With correlation, the standard deviations have very large fluctuations depending on the order of the variables, affecting the conclusions regarding heterogeneity of preferences.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Approximations of the information matrix for a panel mixed logit model

Article 01 June 2017

Simulating power of economic experiments: the powerBBK package

Article 13 October 2016

Many nonnormalities, one simulation: Do different data generation algorithms affect study results?

Article 22 February 2024

Notes

The first numbers in Halton sequences are highly correlated by the way they are constructed. In order to reduce this problem, it is recommended to burn at least the first n draws, where n is the largest prime number used.
The models were also estimated for 200, 500, and 1000 draws. The results were consistent across the number of draws, and they are available in “Appendix”.

References

Bhat CR (2001) Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model. Transp Res Part B Methodol 35(7):677–693. https://doi.org/10.1016/S0191-2615(00)00014-X
Article Google Scholar
Bhat CR (2003) Simulation estimation of mixed discrete choice models using randomized and scrambled Halton sequences. Transp Res Part B Methodol 37(9):837–855. https://doi.org/10.1016/S0191-2615(02)00090-5
Article Google Scholar
Calfee J, Winston C, Stempski R (2001) econometric issues in estimating consumer preferences from stated preference data: a case study of the value of automobile travel time. Rev Econ Stat 83(4):699–707. https://doi.org/10.1162/003465301753237777
Article Google Scholar
Cappellari L, Jenkins SP (2006) Calculation of multivariate normal probabilities by simulation, with applications to maximum simulated likelihood estimation. Stata J 6(2):156–189
Article Google Scholar
Chang JB, Lusk JL (2011) Mixed logit models: accuracy and software choice. J Appl Econ 26(1):167–172. https://doi.org/10.1002/jae.1201
Article Google Scholar
Croissant Y (2018) mlogit: multinomial logit model. R Package. https://CRAN.R-project.org/package=mlogit
Drukker DM, Gates R (2006) Generating Halton sequences using Mata. Stata J 6(2):214–228
Article Google Scholar
Geweke J, Keane M, Runkle D (1994) Alternative computational approaches to inference in the multinomial probit model. Rev Econ Stat 76(4):609–632. https://doi.org/10.2307/2109766
Article Google Scholar
Greene WH (2012) Econometric analysis. Prentice Hall, Boston
Google Scholar
Hensher DA, Greene WH (2003) the mixed logit model: the state of practice. Transportation 30(2):133–176. https://doi.org/10.1023/A:1022558715350
Article Google Scholar
Hess S, Rose JM (2009) Allowing for intra-respondent variations in coefficients estimated on repeated choice data. Transp Res Part B Methodol 43(6):708–719. https://doi.org/10.1016/j.trb.2009.01.007
Article Google Scholar
Hess S, Train KE, Polak JW (2006) On the use of a modified latin hypercube sampling (MLHS) method in the estimation of a mixed logit model for vehicle choice. Transp Res Part B Methodol 40(2):147–163. https://doi.org/10.1016/j.trb.2004.10.005
Article Google Scholar
Hole AR (2007) Estimating mixed logit models using maximum simulated likelihood. Stata J 7(3):388–401
Article Google Scholar
Koop G, Pesaran MH, Potter Simon M (1996) Impulse response analysis in nonlinear multivariate models. J Econ 74(1):119–147. https://doi.org/10.1016/0304-4076(95)01753-4
Article Google Scholar
McFadden D, Ruud PA (1994) Estimation by simulation. Rev Econ Stat 76(4):591–608. https://doi.org/10.2307/2109765
Article Google Scholar
McFadden D, Train K (2000) Mixed MNL models for discrete response. J Appl Econ 15(5):447–470
Article Google Scholar
Pesaran HH, Shin Y (1998) Generalized impulse response analysis in linear multivariate models. Econ Lett 58(1):17–29. https://doi.org/10.1016/S0165-1765(97)00214-0
Article Google Scholar
Revelt D, Train K (1998) Mixed logit with repeated choices: households’ choices of appliance efficiency level. Rev Econ Stat 80(4):647–657. https://doi.org/10.1162/003465398557735
Article Google Scholar
Sivakumar A, Bhat C, Ökten G (2005) Simulation estimation of mixed discrete choice models with the use of randomized quasi-monte carlo sequences: a comparative study. Transp Res Rec J Transp Res Board 1921:112–122. https://doi.org/10.3141/1921-13
Article Google Scholar
Train K (2000) Halton sequences for mixed logit. Department of Economics, UCB, Berkeley
Google Scholar
Train Kenneth E (2009) Discrete choice methods with simulation. Cambridge University Press, Cambridge
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Agricultural Economics, Texas A&M University, 2124 TAMU, College Station, TX, 77843, USA
Marco A. Palma, Dmitry V. Vedenov & David Bessler

Authors

Marco A. Palma
View author publications
You can also search for this author in PubMed Google Scholar
Dmitry V. Vedenov
View author publications
You can also search for this author in PubMed Google Scholar
David Bessler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marco A. Palma.

Appendix

See Tables 7, 8, 9, 10, 11, 12, 13, 14, and 15.

Table 7 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings without correlation for different numbers of Halton draws

Full size table

Table 8 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings without correlation for different numbers of random draws

Full size table

Table 9 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings without correlation for different numbers of Halton High Primes draws

Full size table

Table 10 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings with correlation for different numbers of Halton draws

Full size table

Table 11 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings with correlation for different numbers of random draws

Full size table

Table 12 Average parameter estimates of the mean and standard deviations representing the “simulation noise” over all 120 possible orderings with correlation for different numbers of Halton High Primes draws

Full size table

Table 13 Average parameter estimates of the mean and standard deviations representing the “Cholesky factorization effect” over all 120 possible orderings with correlation for different numbers of Halton draws

Full size table

Table 14 Average parameter estimates of the mean and standard deviations representing the “Cholesky factorization effect” over all 120 possible orderings with correlation for different numbers of random draws

Full size table

Table 15 Average parameter estimates of the mean and standard deviations representing the “Cholesky factorization effect” over all 120 possible orderings with correlation for different numbers of Halton High Primes

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Palma, M.A., Vedenov, D.V. & Bessler, D. The order of variables, simulation noise, and accuracy of mixed logit estimates. Empir Econ 58, 2049–2083 (2020). https://doi.org/10.1007/s00181-018-1609-2

Download citation

Received: 12 April 2017
Accepted: 30 October 2018
Published: 29 November 2018
Issue Date: May 2020
DOI: https://doi.org/10.1007/s00181-018-1609-2

Keywords

JEL codes

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The order of variables, simulation noise, and accuracy of mixed logit estimates

Abstract

Access this article

Similar content being viewed by others

Approximations of the information matrix for a panel mixed logit model

Simulating power of economic experiments: the powerBBK package

Many nonnormalities, one simulation: Do different data generation algorithms affect study results?

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix

Rights and permissions

About this article

Cite this article

Keywords

JEL codes

Navigation

The order of variables, simulation noise, and accuracy of mixed logit estimates

Abstract

Access this article

Similar content being viewed by others

Approximations of the information matrix for a panel mixed logit model

Simulating power of economic experiments: the powerBBK package

Many nonnormalities, one simulation: Do different data generation algorithms affect study results?

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL codes

Search

Navigation