Skip to main content
Log in

Developing and microsimulating demographic dynamics for an integrated urban model: a comparison between logistic regression and machine learning techniques

  • Published:
Transportation Aims and scope Submit manuscript

Abstract

Studies have shown that sociodemographic attributes significantly influence individuals' transportation choices. However, not all travel demand models do not account for this effect when predicting future travel scenarios. On the other hand, current integrated urban models (IUMs) that incorporate demographic dynamics mostly rely on conventional logit models and rule-based models. These models may not be optimal for complex modeling since they do not fully capture the non-linear relationship between inputs and output. In this research, we explore the feasibility of utilizing machine learning (ML) models to enhance the prediction of demographic dynamics within our proposed IUM—known as ‘STELARS’, in conjunction with conventional logit models. To address the challenge of the black-box nature of ML, we employ an explainable AI technique (xAI) to gain insights into the influence of the factors and compare them with the interpretation revealed by the logit models. Three demographic components are considered: marriage/common-law formation, separation and divorce, and childbirth events, while other components were developed using rate-based models. The results (on the testing dataset) indicate that ML models outperform conventional logit models in terms of overall accuracy by a margin of up-to 3%. However, when considering the true positive accuracy (correctly predicting the event of interest), a significant improvement of 30–48% is observed. Additionally, the xAI analysis reveals consistent interpretation with the logit model. Subsequently, we implemented our demographic dynamics module within our integrated urban modeling system to predict population changes in the Okanagan region of Canada. The multi-year validation of the simulation results against Census data suggests a reasonably close prediction of the observed population. We also optimize the runtime of the demographic dynamics module using vectorization, reducing the simulation time for the demographic changes in our study area (comprising approximately 200,000 individuals living in 85,000 households) to just about 100 s for the total 10 years of simulation. The development and implementation of this advanced demographic dynamics module to accurately predict the life events of individuals adds a fundamental capacity to the STELARS to be built as an event-based microsimulation model.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15

Similar content being viewed by others

References

Download references

Acknowledgements

The authors are grateful for the Discovery Grant provided by the National Sciences and Engineering Research Council. The authors would also like to thank Nathan Nichol from the University of British Columbia for proof editing the manuscript and offering some interesting suggestions to better present the results.

Author information

Authors and Affiliations

Authors

Contributions

Research Idea: MAK and MF, Methodology: MAK and MF, Data Collection: MAK and MF, Model Development: MAK, MF, and MO. Writing—First Draft: MAK and MF, Writing—Review: MF and MAK, Project Administration: MF, Acquisition of Funding: MF.

Corresponding author

Correspondence to Mohamad Ali Khalil.

Ethics declarations

Competing interests

We would like to declare that there is no conflict of interest between us.

AI-assisted technologies

During the preparation of this work, the authors used ChatGPT by OpenAI and Bing by Microsoft to proofread some sentences to enhance their readability. After using these tools, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Khalil, M.A., Fatmi, M.R. & Orvin, M. Developing and microsimulating demographic dynamics for an integrated urban model: a comparison between logistic regression and machine learning techniques. Transportation (2024). https://doi.org/10.1007/s11116-024-10468-7

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11116-024-10468-7

Keywords

Navigation