Skip to main content

Advertisement

Log in

Toward the probabilistic forecasting of cyclone-induced marine flooding by overtopping at Reunion Island aided by a time-varying random-forest classification approach

  • Original Paper
  • Published:
Natural Hazards Aims and scope Submit manuscript

Abstract

In 2017, Irma and Maria highlighted the vulnerability of small islands to cyclonic events and the necessity of advancing the forecast techniques for cyclone-induced marine flooding. In this context, this paper presents a generic approach to deriving time-varying inundation forecasts from ensemble track and intensity forecasts applied to the case of Reunion Island in the Indian Ocean. The challenge for volcanic islands is to account for the full complexity of wave overtopping processes while also ensuring a robustness and timeliness that are compatible with emergency requirements. The challenge is addressed by following a hybrid approach relying on the combination of process-based models with a statistical model (herein, a random-forest classifier) trained with a precalculated database. The latter enables one to translate any time series of coastal marine conditions into the time-varying probability of inundation for different sectors. The application detailed for the case of Cyclone Dumile at Sainte-Suzanne city shows that the proposed approach enables quick discrimination, in both space and time, thereby identifying safe and exposed areas and demonstrating that probabilistic forecasting of marine flooding by overtopping is feasible. The whole method can be easily adapted to other territories and scales provided that validated process-based models are available. Beyond early warning applications, the developed database and statistical models may also be used for informing risk prevention and adaptation strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Amante C, Eakins BW (2009) ETOPO1 1 arc-minute global relief model: procedures, data sources and analysis. NOAA Technical Memorandum NESDIS NGDC-24, p 19

  • Ardhuin F, Rogers WE, Babanin AV, Filipot J, Magne R, Roland A, Van der Westhuysen A, Queffeulou P, Lefevre J, Aouf L, Collard F (2010) Semiempirical dissipation source functions for ocean waves. Part I: definition, calibration, and validation. J Phys Oceanogr 40(1):917–1941

    Google Scholar 

  • Bhaskaran PK, Nayak S, Subba Reddy B, Murty PLN, Sen D (2013) Performance and validation of a coupled parallel ADCIRC-SWAN model for THANE cyclone in the Bay of Bengal. Environ Fluid Mech 13:601–623

    Article  Google Scholar 

  • Bhaskaran PK, Rao AD, Murty T (2020) Tropical cyclone-induced storm surges and wind waves in the Bay of Bengal. In: Srivastava PK, Singh SK, Mohanty UC, Murty T (eds) Techniques for disaster risk management and mitigation, 1st edn. Wiley, New York, pp 239–294

    Google Scholar 

  • Bolle A, das Neves L, Smets S, Mollaert J, Buitrago S (2017) An impact-oriented early warning and Bayesian-based decision support system for flood risks in Zeebrugge harbour. Coast Eng. https://doi.org/10.1016/j.coastaleng.2017.10.006

    Article  Google Scholar 

  • Bonnardot F, Quetelard H, Jumeaux G, Leroux MD (2018) Probabilistic forecasts of tropical cyclone tracks and intensities in the Southwest Indian Ocean Basin. Q J R Meteorol Soc. https://doi.org/10.1002/qj.3459

    Article  Google Scholar 

  • Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159

    Article  Google Scholar 

  • Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  Google Scholar 

  • Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. Chapman and Hall/CRC, New York

    Google Scholar 

  • Brodtkorb PA, Johannesson P, Lindgren G, Rychlik I, Rydén J, Sjö E (2000) WAFO—a Matlab toolbox for analysis of random waves and loads. In: Proceedings of the 10th international offshore and polar engineering conference, Seattle, Vol III, pp 343–350

  • Brooks W, Corsi S, Fienen M, Carvin R (2016) Predicting recreational water quality advisories: a comparison of statistical methods. Environ Model Softw 76:81–94

    Article  Google Scholar 

  • Chateauminois E, De La Torre, Y, Le Roy S, Pedreros R, Druon J (2013) Caractérisation et cartographie des aléas côtiers pour l'élaboration du Plan de Prévention des Risques Littoraux des communes du Sud-Ouest de La Réunion. Rapport final. BRGM/RP-62172-FR, 69 p, 4 ann

  • Cloke HL, Pappenberger F (2009) Ensemble flood forecasting: a review. J Hydrol 375(3–4):613–626

    Article  Google Scholar 

  • Cutler DR, Edwards TC, Beard KH, Cutler A, Hess KT, Gibson J, Lawler JJ (2007) Random forests for classification in ecology. Ecology 88(11):2783–2792. https://doi.org/10.1890/07-0539.1

    Article  Google Scholar 

  • Cyriac R, Dietrich JC, Fleming JG, Blanton BO, Kaiser C, Dawson CN, Luettich RA (2018) Variability in coastal flooding predictions due to forecast errors during Hurricane Arthur. Coast Eng 137:59–78. https://doi.org/10.1016/j.coastaleng.2018.02.008

    Article  Google Scholar 

  • Davis JR, Paramygin VA, Forrest D, Sheng YP (2010) Probabilistic simulation of storm surge and inundation in a limited resource environment. Mon Wea Rev 138(7):2953–2974

    Article  Google Scholar 

  • Dean RG, Dalrymple RA (1991) Water wave mechanics for engineers and scientists, 2nd edn. World Scientific, Singapore, p 353

    Book  Google Scholar 

  • Dietrich JC, Dawson CN, Proft JM, Howard MT, Wells G, Fleming JG, Luettich RA Jr, Westerink JJ, Cobell Z, Vitse M, Lander H, Blanton BO, Szpilka CM, Atkinson JH (2013) Real-time forecasting and visualization of hurricane waves and storm surge using SWAN+ADCIRC and FigureGen. In: Dawson C, Gerritsen M (eds) Computational challenges in the geosciences. The IMA volumes in mathematics and its applications, vol 156. Springer, New York

    Google Scholar 

  • Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. Chapman & Hall, New York

    Book  Google Scholar 

  • EurOtop (2016) Manual on wave overtopping of sea defences and related structures. An overtopping manual largely based on European research, but for worldwide application

  • Fiedler JW, Smit PB, Brodie KL, McNinch J, Guza RT (2018) Numerical modeling of wave runup on steep and mildly sloping natural beaches. Coast Eng 106–113:0378–3839. https://doi.org/10.1016/j.coastaleng.2017.09.004

    Article  Google Scholar 

  • Filippini AG, De Brye S, Perrier V, Marche F, Ricchiuto M, Lannes D, Bonneton P (2018) UHAINA: a parallel high performance unstructured near-shore wave model. XVèmes Journées Nationales Génie Côtier – Génie Civil La Rochelle, 29 au 31 mai 2018

  • French Hydrographic and Oceanographic Service (2014) Marine altimetry references. https://refmar.shom.fr/fr/sea_level_news_2014/2014_t4/edition-2014-du-produit-ram-shom.

  • Geurts P, Ernst D, Wehenkel L (2006) Extremely randomized trees. Mach Learn 63:3–42. https://doi.org/10.1007/s10994-006-6226-1

    Article  Google Scholar 

  • Harper B, Hardy T, Mason L, Fryar R (2009) Developments in storm tide modelling and risk assessment in the Australian region. Nat Hazards 51(1):225–238

    Article  Google Scholar 

  • Hashemi MR, Spaulding ML, Shaw A, Farhadi H, Lewis M (2016) An efficient artificial intelligence model for prediction of tropical storm surge. Nat Hazards 82(1):471–491

    Article  Google Scholar 

  • Holland GJ (1980) An analytic model of the wind and pressure profiles in hurricanes. Mon Weather Rev 108(8):1212–1218

    Article  Google Scholar 

  • Irish JL, Song YK, Chang K-A (2011) Probabilistic hurricane surge forecasting using parameterized surge response functions. Geophys Res Lett 38(3):L03606

    Article  Google Scholar 

  • Jäger WS, Christie EK, Hanea AM, den Heijer C, Spencer T (2018) A Bayesian network approach for coastal risk analysis and decision making. Coast Eng 48–61:0378–3839. https://doi.org/10.1016/j.coastaleng.2017.05.004

    Article  Google Scholar 

  • Jia G, Taflanidis AA (2013) Kriging metamodeling for approximation of high-dimensional wave and surge responses in real-time storm/hurricane risk assessment. Comput Methods Appl Mech Eng 261:24–38

    Article  Google Scholar 

  • Kennedy AB, Westerink JJ, Smith J, Taflanidis AA, Hope M, Hartman M, Tanaka S, Westerink H, Cheung KF, Smith T, Hamman M, Minamide M, Ota A (2012) Tropical cyclone inundation potential on the Hawaiian islands of Oahu and Kauai. Ocean Model 52–53:54–68

    Article  Google Scholar 

  • Kim SW, Melby JA, Nadal-Caraballo NC, Ratcliff J (2015) A time-dependent surrogate model for storm surge prediction based on an artificial neural network using high-fidelity synthetic hurricane modeling. Nat Hazards 76(1):565–585

    Article  Google Scholar 

  • Lafore JP, Stein J, Asencio N, Bougeault P, Ducrocq V, Duron J, Fischer C, Hereil P, Mascart P, Pinty JP, Redelsperger JL, Richard E, Vila-Guerau de Arellano J (1998) The Meso-NH Atmospheric Simulation System. Part I: Adiabatic formulation and control simulations. Ann Geophys 16:90–109

    Article  Google Scholar 

  • Lavell A, Oppenheimer M, Diop C, Hess J, Lempert R, Li J, Muir-Wood R, Myeong S (2012) Climate change: new dimensions in disaster risk, exposure, vulnerability, and resilience. In: Managing the risks of extreme events and disasters to advance climate change adaptation, a special report of working groups I and II of the Intergovernmental Panel on Climate Change (IPCC). Cambridge University Press, Cambridge, UK, and New York, NY, USA, pp 25–64

  • Lazure P, Dumas F (2008) An external–internal mode coupling for a 3D hydrodynamical model for applications at regional scale (MARS). Adv Water Resour 31(233–250):2008

    Google Scholar 

  • Le Roy S, Pedreros R, André C, Paris F, Lecacheux S, Marche F, Vinchon C (2015) Coastal flooding of urban areas by overtopping: dynamic modelling application to the Johanna storm (2008) in Gâvres (France). Nat Hazards Earth Syst Sci 15:2497–2510. https://doi.org/10.5194/nhess-15-2497-2015

    Article  Google Scholar 

  • Lecacheux S, Pedreros R, Le Cozannet G, Thiebot J, De La Torre Y, Bulteau T (2012) A method to characterize the different extreme waves for islands exposed to various wave regimes: a case study devoted to Reunion Island. Nat Hazards Earth Syst Sci. https://doi.org/10.5194/nhess-12-2425-2012

    Article  Google Scholar 

  • Lecacheux FB, Rousseau M, Pedreros FPR, Lerma AN, Quetelard H, Barbary D (2018) Probabilistic forecast of coastal waves for flood warning applications at Reunion Island (Indian Ocean). In: Shim J-S, Chun I, Lim H-S (eds) Proceedings from the international coastal symposium (ICS) 2018 (Busan, Republic of Korea). Journal of Coastal Research, Special Issue No. 85, pp 776–780. Coconut Creek (Florida), ISSN 0749-0208

  • Liu X, Guillas S (2017) Dimension reduction for Gaussian process emulation: an application to the influence of bathymetry on tsunami heights. SIAM/ASA J Uncertain Quantif 5(1):787–812

    Article  Google Scholar 

  • Liu H, Xie L, Pietrafesa LJ, Bao S (2007) Sensitivity of wind waves to hurricane wind characteristics. Ocean Model 18(1):37–52. https://doi.org/10.1016/j.ocemod.2007.03.004

    Article  Google Scholar 

  • Luitel B, Villarini G, Vecchi GA (2018) Verification of the skill of numerical weather prediction models in forecasting rainfall from U.S. landfalling tropical cyclones. J Hydrol 556:1026–1037

    Article  Google Scholar 

  • Lumbroso DM, Suckall NR, Nicholls RJ, White KD (2017) Enhancing resilience to coastal flooding from severe storms in the USA: international lessons. Nat Hazards Earth Syst Sci 17(8):1357

    Article  Google Scholar 

  • Lyard F, Lefevre F, Letellier T, Francis O (2006) Modelling the global ocean tides: modern insights from FES2004 Ocean Dyn, 56, 394–415

  • Malley JD, Kruppa J, Dasgupta A, Malley KG, Ziegler A (2012) Probability machines: consistent probability estimation using nonparametric learning machines. Methods Inf Med 51(1):74

    Article  Google Scholar 

  • Mastrandrea MD, Field CB, Stocker TF, Edenhofer O, Ebi KL, Frame DJ, Held H, Kriegler E, Mach KJ, Matschoss PR, Plattner GK, Yohe GW, Zwiers FW (2010) Guidance note for lead authors of the IPCC fifth assessment report on consistent treatment of un IPCC cross-working group meeting on consistent treatment of uncertainties Jasper Ridge, CA, USA, pp 6–7

  • Murty PLN, Sandhya KG, Bhaskaran PK, Jose F, Gayathri R, Balakrishnan Nair TM, Srinivasa Rao T, Shenoi SSC (2014) A coupled hydrodynamic modeling system for PHAILIN cyclone in the Bay of Bengal. Coast Eng 93:71–81

    Article  Google Scholar 

  • Nicolae Lerma A, Bulteau T, Elineau S, Paris F, Durand P, Anselme B, Pedreros R (2018) High-resolution marine flood modelling coupling overflow and overtopping processes: framing the hazard based on historical and statistical approaches. Nat Hazards Earth Syst Sci 18:207–229. https://doi.org/10.5194/nhess-18-207-2018

    Article  Google Scholar 

  • Nuissier O, Roger RF, Roux F (2005) A numerical simulation of Hurricane Bret on 22–23 August 1999 initialized with airborne Doppler radar and dropsonde data. Q J R Meteorol Soc 131:155–194

    Article  Google Scholar 

  • Pappenberger F, Stephens E, Thielen J, Salamon P, Demeritt D, Van Andel SJ, Alfieri L (2013) Visualizing probabilistic flood forecast information: expert preferences and perceptions of best practice in uncertainty communication. Hydrol Process 27(1):132–146

    Article  Google Scholar 

  • Poelhekke L, Jäger WS, Van Dongeren A, Plomaritis TA, McCall R, Ferreira O (2016) Predicting coastal hazards for sandy coasts with a Bayesian network. Coast Eng 118:21–34. https://doi.org/10.1016/j.coastaleng.2016.08.011

    Article  Google Scholar 

  • Queffleulou P, Croizé-Fillon D (2009) La mesure satellite de hauteur de vague par altimètre. Etat des lieux, application à la climatologie et à la modélisation des états de mer. Ateliers modélisation de l’atmosphère

  • Rodrigues M, de la Riva J (2014) An insight into machine-learning algorithms to model human-caused wildfire occurrence. Environ Model Softw 57:192–201

    Article  Google Scholar 

  • Rohmer J, Lecacheux S, Pedreros R, Bonnardot F, Quetelard H (2016) Dynamic parameter sensitivity in numerical modeling of cyclone-induced waves: a multi-look approach using advanced meta-modelling techniques. Nat Hazards. https://doi.org/10.1007/s11069-016-2513-8

    Article  Google Scholar 

  • Rohmer J, Idier D, Paris F, Pedreros R, Louisor J (2018) Casting light on forcing and breaching scenarios that lead to marine inundation: combining numerical simulations with a random-forest classification approach. Environ Model Softw 104:64–80

    Article  Google Scholar 

  • Sahoo B, Bhaskaran PK (2017) A comprehensive data set for tropical cyclone storm surge-induced inundation for the east coast of India. Int J Climaolt. https://doi.org/10.1002/joc.5184

    Article  Google Scholar 

  • Sahoo B, Bhaskaran PK (2019) Prediction of storm surge and coastal inundation using Artificial Neural Network—A case study for 1999 Odisha Super Cyclone. Weather Clim Extremes 23:100196

    Article  Google Scholar 

  • Smith JM, Westerink JJ, Kennedy AB, Taflanidis AA, Cheung KF, Smith T (2011) SWIMS Hawaii hurricane wave, surge, and runup inundation fast forecasting tool. In: Proceedings of the 2011 solutions to coastal disasters. ASCE, pp 89–98

  • Stansby PK (2013) Coastal hydrodynamics—present and future. J Hydraul Res 51:341–350. https://doi.org/10.1080/00221686.2013.821678

    Article  Google Scholar 

  • Suzuki T, Verwaest T, Veale W, Trouw K, Zijlema M (2012) A numerical study on the effect of beach nourishment on wave overtopping in shallow foreshores. Coast Eng Proc 1:50. https://doi.org/10.9753/icce.v33.waves.50

    Article  Google Scholar 

  • Taylor AA, Glahn B (2008) Probabilistic guidance for hurricane storm surge. In: 19th conference on probability and statistics, pp 7–4

  • Tolman (2014) User manual and system documentation of WAVEWATCH- III version 4.18. NOAA/NWS/NCEP/MMABTech

  • Wager S, Hastie T, Efron B (2014) Confidence intervals for random forests: the jackknife and the infinitesimal jackknife. J Mach Learn Res 15(1):1625–1651

    Google Scholar 

  • Wright MN, Ziegler A (2017) A fast implementation of random forests for high dimensional data in C++ and R. J Stat Softw 77:1–17. https://doi.org/10.8637/jss.v077.i01

    Article  Google Scholar 

  • Zhang J, Taflanidis AA, Nadal-Caraballo NC, Melby JA, Diop F (2018) Advances in surrogate modeling for storm surge prediction: storm selection and addressing characteristics related to climate change. Nat Hazards 94(3):1225–1253

    Article  Google Scholar 

  • Zijlema M, Stelling GS, Smit PB (2011) SWASH: an operational public domain code for simulating wave fields and rapidly varied flows in coastal waters. Coast Eng 58:992–1012

    Article  Google Scholar 

Download references

Acknowledgements

This work is supported by the French National Research Agency within the SPICy project (ANR – 14 – CE03 – 0013). The authors thank the Region of Reunion Island for providing the wave measurements, the SHOM for providing the bathymetry and the tide gauge measurements and the IGN for providing the LIDAR data. We also thank the employees of Sainte-Suzanne city, the DEAL service and the emergency services for their high level of involvement during the project and their support and instructive information.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to S. Lecacheux.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: Random forest method

Appendix: Random forest method

Based on the set of n training data D = (Xi; yi)i=1,…,n, the objective of setting up a classification model (classifier) is to identify the rules for assigning a new “yet-unseen” input scenario into class 1 or 0 without resorting to a long-running numerical simulation. A basic classification model can rely on the concept of a decision tree (Breiman et al. 1984), which is recursively built using binary partitioning of the domain space into regions that are increasingly homogeneous with respect to the class variable. As an illustration, Fig. 

Fig. 10
figure 10

Schematic overview of a tree model for binary classification (0 and 1). X1–4 correspond to the input variables; C1–6 correspond to the cut-off values for partitioning

10 schematically depicts a tree model for binary classification (class label 0 or 1). The homogeneous regions correspond to the nodes. The splitting process aims to subdivide one parent node into two subnodes. Different splitting rules exist, the most popular of which relies on the maximization of node homogeneity, which is measured by the Gini impurity index denoted G (Breiman et al. 1984) defined as follows:

$$G = 1 - p_{1}^{2} - p_{0}^{2}$$
(3)

where \(p_{k} = \frac{{n_{k} }}{n}\) is the fraction of the nk samples from class k = {0,1}. The decrease of G is calculated as Gparent − Gsplit1 − Gsplit2 after splitting the parent node into two subnodes that depends on the cut-off value C. At each step of the construction, the algorithm selects the cut-off value C that results in the largest decrease of G. This process is performed until subdivision no longer decreases the Gini index, or until a minimum node size is reached (typically 1 for classification).

The random forest (RF) builds on the same principles as classification decision tree models but extends them by adding a random character to the construction process at the following two levels: (1) each tree is constructed using a different bootstrap sample of the data (Breiman 2001), which enables overcoming the instability issue of classical decision tree (i.e., the tree structure and resulting predicted classes might not be stable when new samples are provided); (2) each node is split using the best among a subset of input parameters randomly chosen at that node, which enables handling situations which have highly correlated input parameters (Cutler et al. 2007). Figure 

Fig. 11
figure 11

Schematic overview of the algorithm for RF setup (0 and 1). The original training dataset D = {Xi; yi} is randomly split into ntree bootstrap samples, which are used to set up the subsequent classification tree model. The out-of-bag samples DC are used to measure the prediction error Err. The ensemble of classification tree models constitutes the RF model (i.e., the aggregation of the different tree models)

11 schematically depicts the principles of the RF algorithm, as follows:

  • Step 1: Select ntree bootstrap samples Dj (using random sampling with replacement) from the original training data D = {Xi; yi}i=1…n;

  • Step 2: For each of the bootstrap samples Dj, grow a classification tree by splitting until all the observations in each terminal node come from the same class (“pure”). The split can rely, for instance, on the Gini index (an original formulation of Breiman 2001) or can be fully random (Geurts et al. 2006). At each node, rather than choosing the best split among all predictors (input parameters), choose it among mtry randomly selected predictors. For each tree j, the classification error is measured by the misclassification rate denoted Errj as follows:

    $${\text{Err}}_{j} = \frac{1}{{\left| {{\varvec{D}}_{j}^{C} } \right|}}\sum\limits_{{i \in {\varvec{D}}_{j}^{C} }}^{{}} {I(y_{i} \ne \hat{y}_{i} )}$$
    (4)

    where DC is the set of samples which are not used for the construction of the considered classification decision tree; this set constitutes the out-of-bag (OOB) samples so that \({\mathbf{D}}_{j}^{C} \cup {\mathbf{D}}_{j} = {\mathbf{D}};\,|{\varvec{D}}_{j}^{C} |\) is the size of the jth OOB set; yi is the ith class indicator within the jth OOB set; \(\hat{y}_{i}\) is the corresponding class prediction using the jth classification tree;

  • Step 3: Predict new data by aggregating the predictions of the ntree trees. The class of the “yet-unseen” input parameters is estimated as the one with the maximum of occurrence among the different trees (votes). A class probability (probability of belonging to class 1 or 0 otherwise) can be estimated either by relying on a voting-based approach (by counting the proportion of the considered class estimated using the ntree trees) or by using a regression-based RF model as described by Malley et al. (2012).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lecacheux, S., Rohmer, J., Paris, F. et al. Toward the probabilistic forecasting of cyclone-induced marine flooding by overtopping at Reunion Island aided by a time-varying random-forest classification approach. Nat Hazards 105, 227–251 (2021). https://doi.org/10.1007/s11069-020-04307-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11069-020-04307-y

Keywords

Navigation