Abstract
Flash flooding is considered one of the most lethal natural hazards in the USA as measured by the ratio of fatalities to people affected. However, the occurrence of injuries and fatalities during flash flooding was found to be rare (about 2% occurrence rate) based on our analysis of 6,065 flash flood events that occurred in Texas over a 15-year period (2005 to 2019). This article identifies climatic, environmental, and situational factors that affect the occurrence of fatalities and injuries in flash flood events and provides a predictive model to estimate the likelihood of these occurrences. Due to the highly imbalanced dataset, three forms of logit models were investigated to achieve unbiased estimations of the model coefficients. The rare event logistic regression (Relogit) model was found to be the most suitable model. The model considers ten independent situational, climatic, and environmental variables that could affect human safety in flash flood events. Vehicle-related activities during flash flooding exhibited the greatest effect on the probability of human harm occurrence, followed by the event’s time (daytime vs. nighttime), precipitation amount, location with respect to the flash flood alley, median age of structures in the community, low water crossing density, and event duration. The application of the developed model as a simulation tool for informing flash flood mitigation planning was demonstrated in two study cases in Texas.
Similar content being viewed by others
Data availability
The data and code are available at the following DesignSafe-ci.org DOI: https://doi.org/10.17603/ds2-e91y-cv92.
References
Ahmadalipour A, Moradkhani H (2019) A data-driven analysis of flash flood hazard, fatalities, and damages over the CONUS during 1996–2017. J Hydrol 578:124106
Albert A, Anderson JA (1984) On the existence of maximum likelihood estimates in logistic regression models. Biometrika 71(1):1–10
Anderson DG (1970) Effects of urban development on floods in northern Virginia. US Government Printing Office, p 22
Ashley ST, Ashley WS (2008) Flood fatalities in the United States. J Appl Meteorol Climatol 47(3):805–818
Bailey JF (1989) Estimation of flood-frequency characteristics and the effects of urbanization for streams in the Philadelphia, Pennsylvania area. Department of the Interior, US Geological Survey, pp 87–4194
Baker VR (1975) Flood hazards along the Balcones escarpment in central Texas; alternative approaches to their recognition, mapping, and management. Virtual Landscapes of Texas
Benight CC, Gruntfest EC, Hayden M, Barnes L (2007) Trauma and short-fuse weather warning perceptions. Environ Hazards 7(3):220–226
Bull SB, Mak C, Greenwood CM (2002) A modified score function estimator for multinomial logistic regression in small samples. Comput Stat Data Anal 39(1):57–74
Caran SC, Baker VR (1986) Flooding along the balcones escarpment, central Texas. KIP Articles. 2088
Choirat C, Honaker J, Imai K, King G, Lau O (2020) Zelig: everyone’s statistical software. Version 5.1.7, https://zeligproject.org/
Clauset A, Woodard R (2013) Estimating the historical and future probabilities of large terrorist events. Annal Appl Stat 7(4):1838–1865
Clogg CC, Rubin DB, Schenker N, Schultz B, Weidman L (1991) Multiple imputation of industry and occupation codes in census public-use samples using bayesian logistic regression. J Am Stat Assoc 86(413):68–78
Cosslett SR (1981) Maximum likelihood estimator for choice-based samples. Econom J Econom Soc 49:1289–1316
Cox DR, Hinkley DV (1979) Theoretical statistics. CRC Press, Florida
Diakakis M (2020) Types of behavior of flood victims around floodwaters. Correlation with situational and demographic factors. Sustainability 12(11):4409
Firth D (1993) Bias reduction of maximum likelihood estimates. Biometrika 80(1):27–38
Gao S, Shen J (2007) Asymptotic properties of a double penalized maximum likelihood estimator in logistic regression. Stat Probab Lett 77(9):925–930
Greene W (1993) Econometric analysis, 2nd edn. Macmillan, New York
Guns M, Vanacker V (2012) Logistic regression applied to natural hazards: rare event logistic regression with replications. Nat Hazards Earth Syst Sci 12(6):1937–1947
Hamilton K, Peden AE, Pearson M, Hagger MS (2016) Stop there’s water on the road! Identifying key beliefs guiding people’s willingness to drive through flooded waterways. Saf Sci 89:308–314. https://doi.org/10.1016/j.ssci.2016.07.004
Heinze G, Schemper M (2002) A solution to the problem of separation in logistic regression. Stat Med 21(16):2409–2419
Heinze G, Ploner M, Dunkler D, Southworth H (2013) logistf: firth’s bias reduced logistic regression. R package version 1.20. Available at: http://cran.r-project.org/web/packages/logistf/index.html
Imai K, King G, Lau O (2008) Toward a common framework for statistical analysis and development. J Comput Graphical Stat 17(4):892–913
Imbens GW (1992) An efficient method of moments estimator for discrete choice models with choice-based sampling. Econom J Econom Soc 60:1187–1214
Jeffreys H (1946) An invariant form for the prior probability in estimation problems. Proc R Soc London Ser A Math Phys Sci 186(1007):453–461
Kelsch M, Carporali E, Lanza LG (2001) Hydrometeorology of flash floods. In: Gruntfest E, Handmer J (eds) Coping with flash floods. Kluwer Academic Publishers, Dordrecht, pp 19–35
King G, Zeng L (2001a) Explaining rare events in international relations. Int Org 55(3):693–715
King G, Zeng L (2001b) Logistic regression in rare events data. Political Anal 9(2):137–163
Konrad CP, Booth DB (2002) Hydrologic trends associated with urban development for selected streams in the Puget Sound Basin, Western Washington, vol 2. US Geological Survey, 4040
Lancaster T, Imbens G (1996) Case-control studies with contaminated controls. J Econ 71(1–2):145–160
Leitgöb H (2020) Analysis of rare events. SAGE Publications Limited
Lindell MK, Perry RW (1992) Behavioral foundations of community emergency planning. Hemisphere Publishing Corp
Lindell MK, Prater C, Perry RW (2006) Wiley pathways introduction to emergency management. Wiley
Masterson JH, Peacock WG, Van Zandt SS, Grover H, Schwarz LF, Cooper JT (2014) Planning for community resilience: a handbook for reducing vulnerability to disasters. Island Press
McCullagh P, Nelder JA (1989) Generalized linear models II
NWS (2019) NWS Preliminary US Flood Fatality Statistics (2019). https://www.weather.gov/arx/usflood
NWS (2022) Storm events database. Available online: https://www.ncdc.noaa.gov/stormevents/ftp.jsp. Accessed Novemb 11, 2022
Pinker E (2018) Reporting accuracy of rare event classifiers. NPJ Digit Med 1(1):1–2
Quenouille MH (1949) Problems in plane sampling. Ann Math Stat 20:355–375
Quenouille MH (1956) Notes on bias in estimation. Biometrika 43(3/4):353–360
Ruin I, Gaillard JC, Lutoff C (2007) How to get there? Assessing motorists' flash flood risk perception on daily itineraries. Environ Hazards 7(3):235–244
Saito T, Rehmsmeier M (2015) The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS one 10(3):e0118432
Sanders DEA, Brix A, Duffy P, Forster W, Hartington T, Jones G, Wilkinson M (2002) The management of losses arising from extreme events. Convention general insurance study group GIRO, London
Shah V, Kirsch KR, Cervantes D, Zane DF, Haywood T, Horney JA (2017) Flash flood swift water rescues, Texas, 2005–2014. Clim Risk Manage 17:11–20
Sharif HO, Hossain MM, Jackson T, Bin-Shafique S (2012) Person-place-time analysis of vehicle fatalities caused by flash floods in Texas. Geomatics Nat Hazards Risk 3(4):311–323
Sharif HO, Jackson TL, Hossain MM, Zane D (2015) Analysis of flood fatalities in Texas. Nat Hazards Rev 16(1):04014016
Sofaer HR, Hoeting JA, Jarnevich CS (2019) The area under the precision-recall curve as a performance metric for rare binary events. Methods Ecol Evol 10(4):565–577
Terti G, Ruin I, Anquetin S, Gourley JJ (2015) Dynamic vulnerability factors for impact-based flash flood prediction. Nat Hazards 79(3):1481–1497
Terti G, Ruin I, Anquetin S, Gourley JJ (2017) A situation-based analysis of flash flood fatalities in the United States. Bull Am Meteorol Soc 98(2):333–345. https://doi.org/10.1175/BAMS-D-15-00276.1
Terti G, Ruin I, Gourley JJ, Kirstetter P, Flamig Z, Blanchet J, Anquetin S (2019) Toward probabilistic prediction of flash flood human impacts. Risk Anal 39(1):140–161
U.S. Census Bureau (2022) Glossary. Available online: https://www.census.gov/programs-surveys/geography/about/glossary.html#par_textimage_13. Accessed Novemb 9, 2022
Walsh J, Wuebbles D, Hayhoe K, Kossin J, Kunkel K, Stephens G, Somerville R (2014) Our changing climate. Climate change impacts in the United States: The third national climate assessment, 19, 67
Wang J, Zhang X (2008) Downscaling and projection of winter extreme daily precipitation over North America. J Clim 21(5):923–937
Wobus C, Lawson M, Jones R, Smith J, Martinich J (2014) Estimating monetary damages from flooding in the United States under a changing climate. J Flood Risk Manag 7(3):217–229
Zahran S, Brody SD, Peacock WG, Vedlitz A, Grover H (2008) Social vulnerability and the natural and built environment: a model of flood casualties in Texas. Disasters 32(4):537–560
Funding
This material is based on work supported by the National Science Foundation (NSF) under Grant # 1931301. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.
Author information
Authors and Affiliations
Contributions
All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by SC and RSW. The first draft of the manuscript was written by SC and all authors commented on previous versions of the manuscript. Dr. Nasir Gharaibeh performed review, editing, and project administration.
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chang, S., Wilkho, R.S., Gharaibeh, N. et al. Environmental, climatic, and situational factors influencing the probability of fatality or injury occurrence in flash flooding: a rare event logistic regression predictive model. Nat Hazards 116, 3957–3978 (2023). https://doi.org/10.1007/s11069-023-05845-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11069-023-05845-x