Abstract
Population densities provide valuable spatial information to identify populations at risk, quantify mobility, and improve our understanding of future urban settlements. Advancements in machine learning algorithms open up new horizons to face these challenges. This research proposes a supervised machine learning approach, Random Forest, for population density appraisal in a large and dense developing city. We studied Bogotá, where functional integration with neighboring municipalities exists, although they have different governments and uncoordinated urban development plans. As a starting point, we use simulated residential land-use patterns, classified according to socioeconomic levels, from a cellular automata-based model. We estimate population density with reliable land-use change models and nine simple representations of the urban structure, such as land values and the distance to urban amenities. Therefore, combining a cellular automata model with a classification model, considering both continuous and categorical variables, demonstrates this methodology’s potential and promises a reliable assessment of population density. Finally, we present a trip generation model integrated with densities and spatial location. A comprehensive results discussion suggests this study’s importance in urban planning and the accuracy of the proposed methodology to support decision-making processes and policy evaluation.
Similar content being viewed by others
References
Archila Bustos, M. F., Hall, O., Niedomysl, T., & Ernstson, U. (2020). A pixel level evaluation of five multitemporal global gridded population datasets: A case study in Sweden, 1990–2015. Population and Environment, 42(2), 255–277. https://doi.org/10.1007/s11111-020-00360-8
Bassolas, A., Barbosa-Filho, H., Dickinson, B., Dotiwalla, X., Eastham, P., Gallotti, R., et al. (2019). Hierarchical organization of urban mobility and its connection with city livability. Nature Communications, 10(1), 4817. https://doi.org/10.1038/s41467-019-12809-y
Batty, M., & Longley, P. (1994). Fractal cities: A geometry of form and function. Academic Press.
Belgiu, M., & Drăguţ, L. (2016). Random forest in remote sensing: A review of applications and future directions. ISPRS Journal of Photogrammetry and Remote Sensing, 114, 24–31. https://doi.org/10.1016/j.isprsjprs.2016.01.011
Bettencourt, L. M. A., Lobo, J., Helbing, D., Kuhnert, C., & West, G. B. (2007). Growth, innovation, scaling, and the pace of life in cities. Proceedings of the National Academy of Sciences, 104(17), 7301–7306. https://doi.org/10.1073/pnas.0610172104
Boulange, C., Gunn, L., Giles-Corti, B., Mavoa, S., Pettit, C., & Badland, H. (2017). Examining associations between urban design attributes and transport mode choice for walking, cycling, public transport and private motor vehicle trips. Journal of Transport and Health, 6, 155–166. https://doi.org/10.1016/j.jth.2017.07.007
Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. https://doi.org/10.1023/A:1010933404324.
Cantillo-García, V., Guzman, L. A., & Arellana, J. (2019). Socioeconomic strata as proxy variable for household income in transportation research. Evaluation for Bogotá, Medellín, Cali and Barranquilla. DYNA, 86(211), 258–267. https://doi.org/10.15446/dyna.v86n211.81821.
Carlino, G. A., Chatterjee, S., & Hunt, R. M. (2007). Urban density and the rate of invention. Journal of Urban Economics, 61(3), 389–419. https://doi.org/10.1016/j.jue.2006.08.003
Cervero, R. B. (2013). Linking urban transport and land use in developing countries. Journal of Transport and Land Use, 6(1), 7. https://doi.org/10.5198/jtlu.v6i1.425
Chen, Y. (2010). A new model of urban population density indicating latent fractal structure. International Journal of Urban Sustainable Development, 1(1–2), 89–110. https://doi.org/10.1080/19463131003661025
Chi, G. (2009). Can knowledge improve population forecasts at subcounty levels? Demography, 46(2), 405–427. https://doi.org/10.1353/dem.0.0059
Chi, G., & Voss, P. R. (2011). Small-area population forecasting: Borrowing strength across space and time. Population, Space and Place, 17(5), 505–520. https://doi.org/10.1002/psp.617
Chi, G., Zhou, X., & Voss, P. R. (2011). Small-area population forecasting in an urban setting: A spatial regression approach. Journal of Population Research, 28(2–3), 185–201. https://doi.org/10.1007/s12546-011-9053-6
Fernández-Aracil, P., & Ortuño-Padilla, A. (2016). Costs of providing local public services and compact population in Spanish urbanised areas. Land Use Policy, 58, 234–240. https://doi.org/10.1016/j.landusepol.2016.07.032
Gao, J., & O’Neill, B. C. (2020). Mapping global urban land for the 21st century with data-driven simulations and Shared Socioeconomic Pathways. Nature Communications, 11(1), 2302. https://doi.org/10.1038/s41467-020-15788-7
Gaughan, A. E., Stevens, F. R., Linard, C., Jia, P., & Tatem, A. J. (2013). High Resolution Population Distribution Maps for Southeast Asia in 2010 and 2015. PLoS ONE, 8(2), e55882. https://doi.org/10.1371/journal.pone.0055882
Gerland, P., Raftery, A. E., Ševčíková, H., Li, N., Gu, D., Spoorenberg, T., et al. (2014). World population stabilization unlikely this century. Science, 346(6206), 234–237. https://doi.org/10.1126/science.1257469
Güneralp, B., Reba, M., Hales, B. U., Wentz, E. A., & Seto, K. C. (2020). Trends in urban land expansion, density, and land transitions from 1970 to 2010: A global synthesis. Environmental Research Letters, 15(4), 044015. https://doi.org/10.1088/1748-9326/ab6669
Guzman, L. A. (2019). A strategic and dynamic land-use transport interaction model for Bogotá and its region. Transportmetrica b: Transport Dynamics, 7(1), 707–725. https://doi.org/10.1080/21680566.2018.1477636
Guzman, L. A., Arellana, J., Cantillo-García, V., & de Dios Ortúzar, J. (2021). Revisiting the benefits of combining data of a different nature: Strategic forecasting of new mode alternatives. Journal of Advanced Transportation, 2021, 1–15. https://doi.org/10.1155/2021/6672961.
Guzman, L. A., & Bocarejo, J. P. (2017). Urban form and spatial urban equity in Bogota, Colombia. Transportation Research Procedia, 25, 4491–4506. https://doi.org/10.1016/j.trpro.2017.05.345
Guzman, L. A., Escobar, F., Peña, J., & Cardona, R. (2020a). A cellular automata-based land-use model as an integrated spatial decision support system for urban planning in developing cities: The case of the Bogotá region. Land Use Policy, 92(104445), 104445. https://doi.org/10.1016/j.landusepol.2019.104445
Guzman, L. A., Oviedo, D., & Bocarejo, J. P. (2017a). City profile: The Bogotá Metropolitan Area that never was. Cities, 60, 202–215. https://doi.org/10.1016/j.cities.2016.09.004
Guzman, L. A., Oviedo, D., & Rivera, C. (2017b). Assessing equity in transport accessibility to work and study: The Bogotá region. Journal of Transport Geography, 58, 236–246. https://doi.org/10.1016/j.jtrangeo.2016.12.016
Guzman, L. A., Peña, J., & Carrasco, J. A. (2020b). Assessing the role of the built environment and sociodemographic characteristics on walking travel distances in Bogotá. Journal of Transport Geography, 88, 102844. https://doi.org/10.1016/j.jtrangeo.2020.102844
H2O.ai. (2021). H2O.ai. Distributed Random Forest (DRF). http://docs.h2o.ai/h2o/latest-stable/h2o-docs/data-science/drf.html
Jiao, J., Chen, Y., & Azimian, A. (2021). Exploring temporal varying demographic and economic disparities in COVID-19 infections in four U.S. areas: Based on OLS, GWR, and random forest models. Computational Urban Science, 1(1), 27. https://doi.org/10.1007/s43762-021-00028-5.
Jochem, W. C., Bird, T. J., & Tatem, A. J. (2018). Identifying residential neighbourhood types from settlement points in a machine learning approach. Computers, Environment and Urban Systems, 69, 104–113. https://doi.org/10.1016/j.compenvurbsys.2018.01.004
Kajimoto, M., & Susaki, J. (2013). Urban density estimation from polarimetric SAR images based on a POA correction method. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 6(3), 1418–1429. https://doi.org/10.1109/JSTARS.2013.2255584
Kamusoko, C., & Gamba, J. (2015). Simulating urban growth using a random forest-cellular automata (RF-CA) model. ISPRS International Journal of Geo-Information, 4(2), 447–470. https://doi.org/10.3390/ijgi4020447
Khosravi, K., Pham, B. T., Chapi, K., Shirzadi, A., Shahabi, H., Revhaug, I., et al. (2018). A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Science of the Total Environment, 627, 744–755. https://doi.org/10.1016/j.scitotenv.2018.01.266
Lewis, S. (2018). Neighborhood density and travel mode: New survey findings for high densities. International Journal of Sustainable Development & World Ecology, 25(2), 152–165. https://doi.org/10.1080/13504509.2017.1321052
Liu, X. H., Kyriakidis, P. C., & Goodchild, M. F. (2008). Population-density estimation using regression and area-to-point residual kriging. International Journal of Geographical Information Science, 22(4), 431–447. https://doi.org/10.1080/13658810701492225
Onda, K., Sinha, P., Gaughan, A. E., Stevens, F. R., & Kaza, N. (2019). Missing millions: Undercounting urbanization in India. Population and Environment, 41(2), 126–150. https://doi.org/10.1007/s11111-019-00329-2
Ortúzar J, & Willumsen, L. G. (2011). Modelling transport (4th Editio.). Chichester: Wiley.
Puissant, A., Rougier, S., & Stumpf, A. (2014). Object-oriented mapping of urban trees using Random Forest classifiers. International Journal of Applied Earth Observation and Geoinformation, 26, 235–245. https://doi.org/10.1016/j.jag.2013.07.002
Qiang, Y., Xu, J., & Zhang, G. (2020). The shapes of US cities: Revisiting the classic population density functions using crowdsourced geospatial data. Urban Studies, 57(10), 2147–2162. https://doi.org/10.1177/0042098019871191
Rahmati, O., Pourghasemi, H. R., & Melesse, A. M. (2016). Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran. CATENA, 137, 360–372. https://doi.org/10.1016/j.catena.2015.10.010
Raymer, J., Abel, G. J., & Rogers, A. (2012). Does specification matter? Experiments with simple multiregional probabilistic population projections. Environment and Planning a: Economy and Space, 44(11), 2664–2686. https://doi.org/10.1068/a4533
Sarmiento, O. L., Useche, A. F., Guzman, L. A., Rodriguez, D. A., Dronova, I., Guaje, O., et al. (2021). Built environment profiles for Latin American urban settings: The SALURBAL study. PLoS ONE, 16(10), e0257528. https://doi.org/10.1371/journal.pone.0257528
SDP - Secretaría Distrital de Planeación de. (2018). Análisis demográfico y proyecciones poblacionales de Bogotá. Bogotá. http://www.sdp.gov.co/sites/default/files/demografia_proyecciones_2017_0.pdf
Sinha, P., Gaughan, A. E., Stevens, F. R., Nieves, J. J., Sorichetta, A., & Tatem, A. J. (2019). Assessing the spatial sensitivity of a random forest model: Application in gridded population modeling. Computers, Environment and Urban Systems, 75, 132–145. https://doi.org/10.1016/j.compenvurbsys.2019.01.006
Smith, S. K., Tayman, J., & Swanson, D. A. (2013). A Practitioner’s Guide to State and Local Population Projections. (S. K. Smith, J. Tayman, & D. A. Swanson, Eds.) (Vol. 37). Dordrecht: Springer Netherlands. https://doi.org/10.1007/978-94-007-7551-0
Stevens, F. R., Gaughan, A. E., Linard, C., & Tatem, A. J. (2015). Disaggregating census data for population mapping using random forests with remotely-sensed and ancillary data. PLoS ONE, 10(2), e0107042. https://doi.org/10.1371/journal.pone.0107042
Stevens, F. R., Gaughan, A. E., Nieves, J. J., King, A., Sorichetta, A., Linard, C., & Tatem, A. J. (2020). Comparisons of two global built area land cover datasets in methods to disaggregate human population in eleven countries from the global South. International Journal of Digital Earth, 13(1), 78–100. https://doi.org/10.1080/17538947.2019.1633424
UAECD. (2019). UAECD. Unidad Administrativa Especial de Catastro Digital. https://datosabiertos.bogota.gov.co/dataset?q=UAECD
van Delden, H., & Vanhout, R. (2018). A short presentation of Metronamica. In Geomatic Approaches for Modeling Land Change Scenarios (pp. 511–519). https://doi.org/10.1007/978-3-319-60801-3_38
van Vliet, J., Hurkens, J., White, R., & van Delden, H. (2012). An activity-based cellular automaton model to simulate land-use dynamics. Environment and Planning b: Planning and Design, 39(2), 198–212. https://doi.org/10.1068/b36015
Wu, C., & Murray, A. T. (2005). A cokriging method for estimating population density in urban areas. Computers, Environment and Urban Systems, 29(5), 558–579. https://doi.org/10.1016/j.compenvurbsys.2005.01.006
Zheng, H. W., Shen, G. Q., Wang, H., & Hong, J. (2015). Simulating land use change in urban renewal areas: A case study in Hong Kong. Habitat International, 46, 23–34. https://doi.org/10.1016/j.habitatint.2014.10.008
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Guzman, L.A., Camacho, R., Herrera, A.R. et al. Modeling population density guided by land use-cover change model: a case study of Bogotá. Popul Environ 43, 553–575 (2022). https://doi.org/10.1007/s11111-022-00400-5
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11111-022-00400-5