Skip to main content

Advertisement

Log in

Environmental, climatic, and situational factors influencing the probability of fatality or injury occurrence in flash flooding: a rare event logistic regression predictive model

  • Original Paper
  • Published:
Natural Hazards Aims and scope Submit manuscript

Abstract

Flash flooding is considered one of the most lethal natural hazards in the USA as measured by the ratio of fatalities to people affected. However, the occurrence of injuries and fatalities during flash flooding was found to be rare (about 2% occurrence rate) based on our analysis of 6,065 flash flood events that occurred in Texas over a 15-year period (2005 to 2019). This article identifies climatic, environmental, and situational factors that affect the occurrence of fatalities and injuries in flash flood events and provides a predictive model to estimate the likelihood of these occurrences. Due to the highly imbalanced dataset, three forms of logit models were investigated to achieve unbiased estimations of the model coefficients. The rare event logistic regression (Relogit) model was found to be the most suitable model. The model considers ten independent situational, climatic, and environmental variables that could affect human safety in flash flood events. Vehicle-related activities during flash flooding exhibited the greatest effect on the probability of human harm occurrence, followed by the event’s time (daytime vs. nighttime), precipitation amount, location with respect to the flash flood alley, median age of structures in the community, low water crossing density, and event duration. The application of the developed model as a simulation tool for informing flash flood mitigation planning was demonstrated in two study cases in Texas.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Data availability

The data and code are available at the following DesignSafe-ci.org DOI: https://doi.org/10.17603/ds2-e91y-cv92.

References

  • Ahmadalipour A, Moradkhani H (2019) A data-driven analysis of flash flood hazard, fatalities, and damages over the CONUS during 1996–2017. J Hydrol 578:124106

    Article  Google Scholar 

  • Albert A, Anderson JA (1984) On the existence of maximum likelihood estimates in logistic regression models. Biometrika 71(1):1–10

    Article  Google Scholar 

  • Anderson DG (1970) Effects of urban development on floods in northern Virginia. US Government Printing Office, p 22

  • Ashley ST, Ashley WS (2008) Flood fatalities in the United States. J Appl Meteorol Climatol 47(3):805–818

    Article  Google Scholar 

  • Bailey JF (1989) Estimation of flood-frequency characteristics and the effects of urbanization for streams in the Philadelphia, Pennsylvania area. Department of the Interior, US Geological Survey, pp 87–4194

  • Baker VR (1975) Flood hazards along the Balcones escarpment in central Texas; alternative approaches to their recognition, mapping, and management. Virtual Landscapes of Texas

  • Benight CC, Gruntfest EC, Hayden M, Barnes L (2007) Trauma and short-fuse weather warning perceptions. Environ Hazards 7(3):220–226

    Article  Google Scholar 

  • Bull SB, Mak C, Greenwood CM (2002) A modified score function estimator for multinomial logistic regression in small samples. Comput Stat Data Anal 39(1):57–74

    Article  Google Scholar 

  • Caran SC, Baker VR (1986) Flooding along the balcones escarpment, central Texas. KIP Articles. 2088

  • Choirat C, Honaker J, Imai K, King G, Lau O (2020) Zelig: everyone’s statistical software. Version 5.1.7, https://zeligproject.org/

  • Clauset A, Woodard R (2013) Estimating the historical and future probabilities of large terrorist events. Annal Appl Stat 7(4):1838–1865

    Google Scholar 

  • Clogg CC, Rubin DB, Schenker N, Schultz B, Weidman L (1991) Multiple imputation of industry and occupation codes in census public-use samples using bayesian logistic regression. J Am Stat Assoc 86(413):68–78

    Article  Google Scholar 

  • Cosslett SR (1981) Maximum likelihood estimator for choice-based samples. Econom J Econom Soc 49:1289–1316

    Google Scholar 

  • Cox DR, Hinkley DV (1979) Theoretical statistics. CRC Press, Florida

    Book  Google Scholar 

  • Diakakis M (2020) Types of behavior of flood victims around floodwaters. Correlation with situational and demographic factors. Sustainability 12(11):4409

    Article  Google Scholar 

  • Firth D (1993) Bias reduction of maximum likelihood estimates. Biometrika 80(1):27–38

    Article  Google Scholar 

  • Gao S, Shen J (2007) Asymptotic properties of a double penalized maximum likelihood estimator in logistic regression. Stat Probab Lett 77(9):925–930

    Article  Google Scholar 

  • Greene W (1993) Econometric analysis, 2nd edn. Macmillan, New York

    Google Scholar 

  • Guns M, Vanacker V (2012) Logistic regression applied to natural hazards: rare event logistic regression with replications. Nat Hazards Earth Syst Sci 12(6):1937–1947

    Article  Google Scholar 

  • Hamilton K, Peden AE, Pearson M, Hagger MS (2016) Stop there’s water on the road! Identifying key beliefs guiding people’s willingness to drive through flooded waterways. Saf Sci 89:308–314. https://doi.org/10.1016/j.ssci.2016.07.004

    Article  Google Scholar 

  • Heinze G, Schemper M (2002) A solution to the problem of separation in logistic regression. Stat Med 21(16):2409–2419

    Article  Google Scholar 

  • Heinze G, Ploner M, Dunkler D, Southworth H (2013) logistf: firth’s bias reduced logistic regression. R package version 1.20. Available at: http://cran.r-project.org/web/packages/logistf/index.html

  • Imai K, King G, Lau O (2008) Toward a common framework for statistical analysis and development. J Comput Graphical Stat 17(4):892–913

    Article  Google Scholar 

  • Imbens GW (1992) An efficient method of moments estimator for discrete choice models with choice-based sampling. Econom J Econom Soc 60:1187–1214

    Google Scholar 

  • Jeffreys H (1946) An invariant form for the prior probability in estimation problems. Proc R Soc London Ser A Math Phys Sci 186(1007):453–461

    Google Scholar 

  • Kelsch M, Carporali E, Lanza LG (2001) Hydrometeorology of flash floods. In: Gruntfest E, Handmer J (eds) Coping with flash floods. Kluwer Academic Publishers, Dordrecht, pp 19–35

    Chapter  Google Scholar 

  • King G, Zeng L (2001a) Explaining rare events in international relations. Int Org 55(3):693–715

    Article  Google Scholar 

  • King G, Zeng L (2001b) Logistic regression in rare events data. Political Anal 9(2):137–163

    Article  Google Scholar 

  • Konrad CP, Booth DB (2002) Hydrologic trends associated with urban development for selected streams in the Puget Sound Basin, Western Washington, vol 2. US Geological Survey, 4040

  • Lancaster T, Imbens G (1996) Case-control studies with contaminated controls. J Econ 71(1–2):145–160

    Article  Google Scholar 

  • Leitgöb H (2020) Analysis of rare events. SAGE Publications Limited

    Google Scholar 

  • Lindell MK, Perry RW (1992) Behavioral foundations of community emergency planning. Hemisphere Publishing Corp

    Google Scholar 

  • Lindell MK, Prater C, Perry RW (2006) Wiley pathways introduction to emergency management. Wiley

    Google Scholar 

  • Masterson JH, Peacock WG, Van Zandt SS, Grover H, Schwarz LF, Cooper JT (2014) Planning for community resilience: a handbook for reducing vulnerability to disasters. Island Press

    Book  Google Scholar 

  • McCullagh P, Nelder JA (1989) Generalized linear models II

  • NWS (2019) NWS Preliminary US Flood Fatality Statistics (2019). https://www.weather.gov/arx/usflood

  • NWS (2022) Storm events database. Available online: https://www.ncdc.noaa.gov/stormevents/ftp.jsp. Accessed Novemb 11, 2022

  • Pinker E (2018) Reporting accuracy of rare event classifiers. NPJ Digit Med 1(1):1–2

    Article  Google Scholar 

  • Quenouille MH (1949) Problems in plane sampling. Ann Math Stat 20:355–375

    Article  Google Scholar 

  • Quenouille MH (1956) Notes on bias in estimation. Biometrika 43(3/4):353–360

    Article  Google Scholar 

  • Ruin I, Gaillard JC, Lutoff C (2007) How to get there? Assessing motorists' flash flood risk perception on daily itineraries. Environ Hazards 7(3):235–244

    Article  Google Scholar 

  • Saito T, Rehmsmeier M (2015) The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PloS one 10(3):e0118432

    Article  Google Scholar 

  • Sanders DEA, Brix A, Duffy P, Forster W, Hartington T, Jones G, Wilkinson M (2002) The management of losses arising from extreme events. Convention general insurance study group GIRO, London

    Google Scholar 

  • Shah V, Kirsch KR, Cervantes D, Zane DF, Haywood T, Horney JA (2017) Flash flood swift water rescues, Texas, 2005–2014. Clim Risk Manage 17:11–20

    Article  Google Scholar 

  • Sharif HO, Hossain MM, Jackson T, Bin-Shafique S (2012) Person-place-time analysis of vehicle fatalities caused by flash floods in Texas. Geomatics Nat Hazards Risk 3(4):311–323

    Article  Google Scholar 

  • Sharif HO, Jackson TL, Hossain MM, Zane D (2015) Analysis of flood fatalities in Texas. Nat Hazards Rev 16(1):04014016

    Article  Google Scholar 

  • Sofaer HR, Hoeting JA, Jarnevich CS (2019) The area under the precision-recall curve as a performance metric for rare binary events. Methods Ecol Evol 10(4):565–577

    Article  Google Scholar 

  • Terti G, Ruin I, Anquetin S, Gourley JJ (2015) Dynamic vulnerability factors for impact-based flash flood prediction. Nat Hazards 79(3):1481–1497

    Article  Google Scholar 

  • Terti G, Ruin I, Anquetin S, Gourley JJ (2017) A situation-based analysis of flash flood fatalities in the United States. Bull Am Meteorol Soc 98(2):333–345. https://doi.org/10.1175/BAMS-D-15-00276.1

    Article  Google Scholar 

  • Terti G, Ruin I, Gourley JJ, Kirstetter P, Flamig Z, Blanchet J, Anquetin S (2019) Toward probabilistic prediction of flash flood human impacts. Risk Anal 39(1):140–161

    Article  Google Scholar 

  • U.S. Census Bureau (2022) Glossary. Available online: https://www.census.gov/programs-surveys/geography/about/glossary.html#par_textimage_13. Accessed Novemb 9, 2022

  • Walsh J, Wuebbles D, Hayhoe K, Kossin J, Kunkel K, Stephens G, Somerville R (2014) Our changing climate. Climate change impacts in the United States: The third national climate assessment, 19, 67

  • Wang J, Zhang X (2008) Downscaling and projection of winter extreme daily precipitation over North America. J Clim 21(5):923–937

    Article  Google Scholar 

  • Wobus C, Lawson M, Jones R, Smith J, Martinich J (2014) Estimating monetary damages from flooding in the United States under a changing climate. J Flood Risk Manag 7(3):217–229

    Article  Google Scholar 

  • Zahran S, Brody SD, Peacock WG, Vedlitz A, Grover H (2008) Social vulnerability and the natural and built environment: a model of flood casualties in Texas. Disasters 32(4):537–560

    Article  Google Scholar 

Download references

Funding

This material is based on work supported by the National Science Foundation (NSF) under Grant # 1931301. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation, data collection and analysis were performed by SC and RSW. The first draft of the manuscript was written by SC and all authors commented on previous versions of the manuscript. Dr. Nasir Gharaibeh performed review, editing, and project administration.

Corresponding author

Correspondence to Shi Chang.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Chang, S., Wilkho, R.S., Gharaibeh, N. et al. Environmental, climatic, and situational factors influencing the probability of fatality or injury occurrence in flash flooding: a rare event logistic regression predictive model. Nat Hazards 116, 3957–3978 (2023). https://doi.org/10.1007/s11069-023-05845-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11069-023-05845-x

Keywords

Navigation