Abstract
Machine learning (ML) models, particularly decision tree (DT)-based algorithms, are being increasingly utilized for flood susceptibility mapping. To evaluate the advantages of DT-based ML models over traditional statistical models on flood susceptibility assessment, a comparative study is needed to systematically compare the performances of DT- based ML models with that of traditional statistical models. New Orleans, which has a long history of flooding and is highly susceptible to flooding, is selected as the test bed. The primary purpose of this study is to compare the performance of multiple DT-based ML models namely DT, Adaptive Boosting (AdaBoost), Gradient Boosting (GdBoost), Extreme Gradient Boosting (XGBoost) and Random Forest (RF) models with a traditional statistical model known as Frequency Ratio (FR) model in New Orleans. This study also aims to identify the main drivers contributing to flooding in New Orleans using the best performing model. Based on the most recent Hurricane Ida-induced flood inventory map and nine crucial flood conditioning factors, the models’ accuracies are tested and compared using multiple evaluation metrics. The findings of this study indicate that all DT-based ML models perform better compared to FR. The RF model emerges as the best model (AUC = 0.85) among all DT-based ML models in every evaluation metrics. This study then adopts the RF model to simulate flood susceptibility map (FSM) of New Orleans and compares it with the prediction of FR model. The RF model also demonstrates that low elevation and higher precipitation are the main factors responsible for flooding in New Orleans. Therefore, this comparative approach offers a significant understanding about the advantages of advanced ML models over traditional statistical models in local flood susceptibility assessment.
Similar content being viewed by others
References
Abedi R, Costache R, Shafizadeh-Moghadam H, Pham QB (2022) Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees. Geocarto Int 37(19):5479–5496
Akinci H, Zeybek M (2021) Comparing classical statistic and machine learning models in landslide susceptibility mapping in Ardanuc (Artvin). Turk Nat Hazards 108(2):1515–1543
Al-Abadi AM (2018) Mapping flood susceptibility in an arid region of southern Iraq using ensemble machine learning classifiers: a comparative study. Arab J Geosci 11:1–19
Ali SA, Khatun R, Ahmad A, Ahmad SN (2019) Application of GIS-based analytic hierarchy process and frequency ratio model to flood vulnerable mapping and risk area estimation at Sundarban region. India Modeling Earth Syst Environ 5(3):1083–1102
Amare S, Langendoen E, Keesstra S, Ploeg MVD, Gelagay H, Lemma H, van der Zee SE (2021) Susceptibility to gully erosion: applying random forest (RF) and frequency ratio (FR) approaches to a small catchment in Ethiopia. Water 13(2):216
Ayalew L, Yamagishi H (2005) The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko mountains central Japan. Geomorphology 65(1–2):15–31
Beven, J.L., Hagan, A., and Berg, R. (2022). Hurricane Ida 26 August – 1 September 2021. National Hurricane Center Tropical Cyclone Report, 163 pp. https://www.nhc.noaa.gov/data/tcr/AL092021_Ida.pdf
Breiman L (2001) Random forests. Mach Learn 45:5–32
Brown VM, Keim BD, Black A (2019) Climatology and trends in hourly precipitation for the Southeast United States. J Hydrometeorol 20(8):1737–1755
Campanella, R. (2006). Geographies of New Orleans: Urban Fabrics Before the Storm. Center for Louisiana Studies, University of Louisiana-Lafayette, Lafayette, Louisisana, 433 pp
Cai H, Lam N, Zou L, Qiang Y, Li K (2016) Assessing community resilience to coastal hazards in the lower Mississippi River Basin. Water 8(2):46–64
Cass E, Shao W, Hao F, Moradkhani H, Yeates E (2023) Identifying trends in interpretation and responses to hurricane and climate change communication tools. Int J Disaster Risk Reduct. https://doi.org/10.1016/j.ijdrr.2023.103752
Costache R, Pham QB, Avand M, Linh NTT, Vojtek M, Vojteková J, Dung TD (2020) Novel hybrid models between bivariate statistics, artificial neural networks and boosting algorithms for flood susceptibility assessment. J Environ Manag 265:110485
Dano UL, Balogun AL, Matori AN, Wan Yusouf K, Abubakar IR, Said Mohamed MA, Pradhan B (2019) Flood susceptibility mapping using GIS-based analytic network process: a case study of Perlis Malaysia. Water 11(3):615
Das S (2020) Flood susceptibility mapping of the Western Ghat coastal belt using multi-source geospatial data and analytical hierarchy process (AHP). Remote Sens Appl: Soc Environ 20:100379
Elkhrachy I, Pham QB, Costache R, Mohajane M, Rahman KU, Shahabi H, Anh DT (2021) Sentinel-1 remote sensing data and hydrologic engineering centres river analysis system two-dimensional integration for flash flood detection and modelling in New Cairo City Egypt. J Flood Risk Manag 14(2):e12692
Elliott J, Pais J (2006) Race, class, and Hurricane Katrina: Social differences in human responses to disaster. Soc Sci Res 35(2):295–321
Elmahdy SI, Mohamed MM, Ali TA, Abdalla JED, Abouleish M (2020) Land subsidence and sinkholes susceptibility mapping and analysis using random forest and frequency ratio models in Al Ain UAE. Geocarto Int 37(1):315–331
Falah, F., Rahmati, O., Rostami, M., Ahmadisharaf, E., Daliakopoulos, I. N., & Pourghasemi, H. R. (2019). Artificial neural networks for flood susceptibility mapping in data-scarce urban areas. In Spatial modeling in GIS and R for Earth and Environmental Sciences (pp. 323–336). Elsevier.
Farhadi H, Najafzadeh M (2021) Flood risk mapping by remote sensing data and random forest technique. Water 13(21):3115
FEMA (2006) Hurricane Katrina in the Gulf Coast (FEMA 548). Federal Emergency Management Agency, Washington DC
Finch C, Emrich CT, Cutter SL (2010) Disaster disparities and differential recovery in new Orleans. Popul Environ 31:179–202
Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow. " O'Reilly Media, Inc.".
Ghanim AA, Shaf A, Ali T, Zafar M, Al-Areeq AM, Alyami SH, Rahman S (2023) An improved flood susceptibility assessment in Jeddah, Saudi Arabia, using advanced machine learning techniques. Water 15(14):2511
Goetz JN, Brenning A, Petschko H, Leopold P (2015) Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling. Comput Geosci 81:1–11
Gorelick N, Hancher M, Dixon M, Ilyushchenko S, Thau D, Moore R (2017) Google earth engine: planetary-scale geospatial analysis for everyone. Remote Sens Environ 202:18–27
Hallegatte S, Green C, Nicholls RJ, Corfee-Morlot J (2013) Future flood losses in major coastal cities. Nat Clim Chang 3:802–806. https://doi.org/10.1038/nclimate1979
Han J, Kim J, Park S, Son S, Ryu M (2020) Seismic vulnerability assessment and mapping of Gyeongju, South Korea using frequency ratio, decision tree, and random forest. Sustainability 12(18):7787
Hasanuzzaman M, Islam A, Bera B, Shit PK (2022) A comparison of performance measures of three machine learning algorithms for flood susceptibility mapping of river Silabati (tropical river, India). Phys Chem Earth Parts a/b/c 127:103198
Held IM, Soden BJ (2006) Robust responses of the hydrological cycle to global warming. J Clim 19(21):5686–5699
Hengl T (2018). Soil texture classes (USDA system) for 6 soil depths (0, 10, 30, 60, 100 and 200 cm) at 250 m (Version v02) [Data set]. Zenodo. 10.5281/zenodo.1475451
Hoque MAA, Tasfia S, Ahmed N, Pradhan B (2019) Assessing spatial flood vulnerability at Kalapara Upazila in Bangladesh using an analytic hierarchy process. Sensors 19(6):1302
Islam ARMT, Talukdar S, Mahato S, Kundu S, Eibek KU, Pham QB, Linh NTT (2021) Flood susceptibility modelling using advanced ensemble machine learning models. Geosci Front 12(3):101075
Islam MM, Sado K (2000) Development of flood hazard maps of Bangladesh using NOAA-AVHRR images with GIS. Hydrol Sci J 45(3):337–355
Jenks GF (1967) The data model concept in statistical mapping. Int Yearb Cartogr 7:186–190
Keim, B. D., & Muller, R. A. (2009). Hurricanes of the Gulf of Mexico. LSU Press. 232 pp.
Keim BD, Muller RA (1993) Frequency of heavy rainfall events in New Orleans, Louisiana, 1900 to 1991. Southeast Geogr 33(2):159–171
Keim BD, Muller RA (1992) Magnitude Fluctuations of Heavy Rainfall in New Orleans, Louisiana: 1871–1991. Water Resour Bull 28(4):721–730
Khosravi K, Pham BT, Chapi K, Shirzadi A, Shahabi H, Revhaug I, Bui DT (2018) A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci Total Environ 627:744–755
Khosravi K, Pourghasemi HR, Chapi K, Bahri M (2016) Flash flood susceptibility analysis and its mapping using different bivariate models in Iran: a comparison between Shannon’s entropy, statistical index, and weighting factor models. Environ Monit Assess 188:1–21
Kia MB, Pirasteh S, Pradhan B, Mahmud AR, Sulaiman WNA, Moradi A (2012) An artificial neural network model for flood simulation using GIS: Johor River Basin Malaysia. Environ Earth Sci 67(1):251–264
Kopp RE, DeConto RM, Bader DA, Hay CC, Horton RM, Kulp S, Oppenheimer M, Pollard D, Strauss BH (2017) Evolving understanding of Antarctic ice-sheet physics and ambiguity in probabilistic sea-level projections. Earth’s Future 5:1217–1233. https://doi.org/10.1002/2017EF000663
Lee, M.-J., Kang, J.-e., & Jeon, S. (2012). Application of frequency ratio model and validation for predictive flooded area susceptibility mapping using GIS. Paper presented at the 2012 IEEE international geoscience and remote sensing symposium.
Lee S, Kim JC, Jung HS, Lee MJ, Lee S (2017) Spatial prediction of flood susceptibility using random-forest and boosted-tree models in Seoul metropolitan city, Korea. Geomat Nat Haz Risk 8(2):1185–1203
Liang Z, Wang CM, Zhang ZM, Khan KUJ (2020) A comparison of statistical and machine learning methods for debris flow susceptibility mapping. Stoch Env Res Risk Assess 34:1887–1907
Madhuri R, Sistla S, Srinivasa Raju K (2021) Application of machine learning algorithms for flood susceptibility assessment and risk management. J Water Clim Change 12(6):2608–2623
Manandhar, B. (2010). Flood plain analysis and risk assessment of Lothar Khola. Master of Science Thesis in Watershed Management. Tribhuvan University Institute of Forestry Pokhara, Nepal.
National Centers for Environmental Information. (2019). Assessing the U.S. Climate in 2019. https://www.ncei.noaa.gov/news/national-climate-201912
National Academies of Sciences, Engineering, and Medicine (2019). Framing the Challenge of Urban Flooding in the United States. Washington, DC: The National Academies Press. https://doi.org/10.17226/25381.
Ogden FL, Pradhan NR, Downer CW, Zahner JA (2011) Relative importance of impervious area, drainage density, width function, and subsurface storm drainage on flood runoff from an urbanized catchment. Water Resour Res 47:W12503. https://doi.org/10.1029/2011WR010550
Oliveira S, Oehler F, San-Miguel-Ayanz J, Camia A, Pereira JM (2012) Modeling spatial patterns of fire occurrence in Mediterranean Europe using multiple regression and random forest. For Ecol Manage 275:117–129
Omer, S (2021). 2021 Hurricane Ida: Facts, FAQs, and how to help. World Vision. https://www.worldvision.org/disaster-relief-news-stories/2021-hurricane-ida-facts (Accessed on February 28, 2022).
Paul GC, Saha S, Hembram TK (2019) Application of the GIS-based probabilistic models for mapping the flood susceptibility in Bansloi sub-basin of Ganga-Bhagirathi river and their comparison. Remote Sens Earth Syst Sci 2(2):120–146
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Duchesnay É (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
Pekel JF, Cottam A, Gorelick N, Belward AS (2016) High-resolution mapping of global surface water and its long-term changes. Nature 540(7633):418–422
Pistrika AK, Jonkman SN (2010) Damage to residential buildings due to flooding of New Orleans after hurricane Katrina. Nat Hazards 54:413–434
Powell EJ, Keim BD (2015) Trends in Daily Temperature and Precipitation Extremes for the Southeastern United States: 1948–2012. J Climate 28:1592–1612. https://doi.org/10.1175/JCLI-D-14-00410.1
Quader MA, Dey H, Malak MA, Sajib AM (2021) Rohingya refugee flooding and changes of the physical and social landscape in Ukhiya, Bangladesh. Environ Dev Sustain 23:4634–4658
Rahman M, Ningsheng C, Islam MM, Dewan A, Iqbal J, Washakh RMA, Shufeng T (2019) Flood susceptibility assessment in Bangladesh using machine learning and multi-criteria decision analysis. Earth Syst Environ 3(3):585–601
Rahman M, Ningsheng C, Mahmud GI, Islam MM, Pourghasemi HR, Ahmad H, Dewan A (2021) Flooding and its relationship with land cover change, population growth, and road density. Geosci Front 12(6):101224
Rahmati O, Pourghasemi HR, Zeinivand H (2016) Flood susceptibility mapping using frequency ratio and weights-of-evidence models in the Golastan Province Iran. Geocarto Int 31(1):42–70
Rodrigues M, De la Riva J (2014) An insight into machine-learning algorithms to model human-caused wildfire occurrence. Environ Model Softw 57:192–201
Sahana M, Rehman S, Sajjad H, Hong H (2020) Exploring effectiveness of frequency ratio and support vector machine models in storm surge flood susceptibility assessment: a study of Sundarban biosphere reserve. India Catena 189:104450
Samanta RK, Bhunia GS, Shit PK, Pourghasemi HR (2018) Flood susceptibility mapping using geospatial frequency ratio technique: a case study of Subarnarekha River Basin India. Modeling Earth Syst Environ 4(1):395–408
Sarkar D, Mondal P (2020) Flood vulnerability mapping using frequency ratio (FR) model: a case study on Kulik river basin Indo-Bangladesh Barind Region. Appl Water Sci 10(1):1–13
Shahabi H, Shirzadi A, Ghaderi K, Omidvar E, Al-Ansari N, Clague JJ, Ahmad A (2020) Flood detection and susceptibility mapping using sentinel-1 remote sensing data and a machine learning approach: hybrid intelligence of bagging ensemble based on k-nearest neighbor classifier. Remote Sens 12(2):266
Shahabi H, Shirzadi A, Ronoud S, Asadi S, Pham BT, Mansouripour F, Bui DT (2021) Flash flood susceptibility mapping using a novel deep learning model based on deep belief network, back propagation and genetic algorithm. Geosci Front 12(3):101100
Sweet, W.V., B.D. Hamlington, R.E. Kopp, C.P. Weaver, P.L. Barnard, D. Bekaert, W. Brooks, M. Craghan, G. Dusek, T. Frederikse, G. Garner, A.S. Genz, J.P. Krasting, E. Larour, D. Marcy, J.J. Marra, J. Obeysekera, M. Osler, M. Pendleton, D. Roman, L. Schmied, W. Veatch, K.D. White, and C. Zuzak, (2022). Global and Regional Sea Level Rise Scenarios for the United States: Updated Mean Projections and Extreme Water Level Probabilities Along U.S. Coastlines. NOAA Technical Report NOS 01. National Oceanic and Atmospheric Administration, National Ocean Service, Silver Spring, MD, 111 pp. https://oceanservice.noaa.gov/hazards/sealevelrise/noaa-nostechrpt01- global-regional-SLR-scenarios-US.pdf
Tehrany MS, Pradhan B, Jebur MN (2013) Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS. J Hydrol 504:69–79
Tehrany MS, Lee MJ, Pradhan B, Jebur MN, Lee S (2014) Flood susceptibility mapping using integrated bivariate and multivariate statistical models. Environ Earth Sci 72(10):4001–4015
Tehrany MS, Pradhan B, Jebur MN (2015a) Flood susceptibility analysis and its verification using a novel ensemble support vector machine and frequency ratio method. Stoch Env Res Risk Assess 29:1149–1165
Tehrany MS, Pradhan B, Mansor S, Ahmad N (2015b) Flood susceptibility assessment using GIS-based support vector machine model with different kernel types. CATENA 125:91–101
Tehrany MS, Kumar L (2018) The application of a Dempster–Shafer-based evidential belief function in flood susceptibility mapping and comparison with frequency ratio and logistic regression methods. Environ Earth Sci 77(13):1–24
Tellman B, Sullivan JA, Kuhn C, Kettner AJ, Doyle CS, Brakenridge GR, Slayback DA (2021) Satellite imaging reveals increased proportion of population exposed to floods. Nature 596(7870):80–86
Thanh NN, Chotpantarat S, Trung NH, Ngu NH (2022) Mapping groundwater potential zones in Kanchanaburi Province, Thailand by integrating of analytic hierarchy process, frequency ratio, and random forest. Ecol Ind 145:109591
Tien Bui D, Ho TC, Pradhan B, Pham BT, Nhu VH, Revhaug I (2016) GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ Earth Sci 75:1–22
Tripathi R, Sengupta SK, Patra A, Chang H, Jung IW (2014) Climate change, urban development, and community perception of an extreme flood: A case study of Vernonia, Oregon, USA. Appl Geogr 46:137–146
USGCRP (2018): Impacts, Risks, and Adaptation in the United States: Fourth National Climate Assessment, Volume II: [Reidmiller, D.R., C.W. Avery, D.R. Easterling, K.E. Kunkel, K.L.M. Lewis, T.K. Maycock, and B.C. Stewart (eds.)]. U.S. Global Change Research Program, Washington, DC, USA, 1515 pp. doi: https://doi.org/10.7930/NCA4.2018.
Wang Y, Fang Z, Hong H, Peng L (2020a) Flood susceptibility mapping using convolutional neural network frameworks. J Hydrol 582:124482
Wang Y, Sun D, Wen H, Zhang H, Zhang F (2020b) Comparison of random forest model and frequency ratio model for landslide susceptibility mapping (LSM) in Yunyang County (Chongqing, China). Int J Environ Res Public Health 17(12):4206
Wang Z, Liu Q, Liu Y (2020c) Mapping landslide susceptibility using machine learning algorithms and GIS: A case study in Shexian county, Anhui province China. Symmetry 12(12):1954
Wing OEJ, Lehman W, Bates PD et al (2022) Inequitable patterns of US flood risk in the Anthropocene. Nat Clim Chang. https://doi.org/10.1038/s41558-021-01265-6
Wong, P.P., Losada, I.J., Gattuso, J.-P., Hinkel, J., Khattabi, A., Mclnnes, K.L., Saito, Y., Sallenger, A., (2014). Coastal Systems and Low-lying Area (Retrieved from Cambridge, United Kingdom and New York, NY, USA).
Youssef AM, Pourghasemi HR, El-Haddad BA (2022) Advanced machine learning algorithms for flood susceptibility modeling—performance comparison: Red Sea. Egypt Environ Sci Pollut Res 29(44):66768–66792
Funding
Funding was provide by National Oceanic and Atmospheric Administration (Grant No. PO-0000182731).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Dey, H., Shao, W., Moradkhani, H. et al. Urban flood susceptibility mapping using frequency ratio and multiple decision tree-based machine learning models. Nat Hazards (2024). https://doi.org/10.1007/s11069-024-06609-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11069-024-06609-x