One-dimensional deep learning driven geospatial analysis for flash flood susceptibility mapping: a case study in North Central Vietnam

Hoa, Pham Viet; Binh, Nguyen An; Hong, Pham Viet; An, Nguyen Ngoc; Thao, Giang Thi Phuong; Hanh, Nguyen Cao; Ngo, Phuong Thao Thi; Bui, Dieu Tien

doi:10.1007/s12145-024-01285-8

One-dimensional deep learning driven geospatial analysis for flash flood susceptibility mapping: a case study in North Central Vietnam

Research
Open access
Published: 06 July 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Earth Science Informatics Aims and scope Submit manuscript

One-dimensional deep learning driven geospatial analysis for flash flood susceptibility mapping: a case study in North Central Vietnam

Download PDF

Pham Viet Hoa¹,
Nguyen An Binh¹,
Pham Viet Hong²,
Nguyen Ngoc An¹,
Giang Thi Phuong Thao¹,
Nguyen Cao Hanh¹,
Phuong Thao Thi Ngo³ &
…
Dieu Tien Bui⁴

Abstract

Flash floods rank among the most catastrophic natural disasters worldwide, inflicting severe socio-economic, environmental, and human impacts. Consequently, accurately identifying areas at potential risk is of paramount importance. This study investigates the efficacy of Deep 1D-Convolutional Neural Networks (Deep 1D-CNN) in spatially predicting flash floods, with a specific focus on the frequent tropical cyclone-induced flash floods in Thanh Hoa province, North Central Vietnam. The Deep 1D-CNN was structured with four convolutional layers, two pooling layers, one flattened layer, and two fully connected layers, employing the ADAM algorithm for optimization and Mean Squared Error (MSE) for loss calculation. A geodatabase containing 2540 flash flood locations and 12 influencing factors was compiled using multi-source geospatial data. The database was used to train and check the model. The results indicate that the Deep 1D-CNN model achieved high predictive accuracy (90.2%), along with a Kappa value of 0.804 and an AUC (Area Under the Curve) of 0.969, surpassing the benchmark models such as SVM (Support Vector Machine) and LR (Logistic Regression). The study concludes that the Deep 1D-CNN model is a highly effective tool for modeling flash floods.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Flash floods continue to be recognized as a significant natural hazard (Kreibich et al. 2019), particularly in tropical regions (Hoyos et al. 2019), causing significant infrastructure damage and detrimental environmental effects and consistently resulting in fatalities each year (Al-Aizari et al. 2022, 2024; Dayan et al. 2021; Hussain et al. 2021; Lorenzo-Lacruz et al. 2019). Unlike other types of floods, such as coastal, river, and ice-jam floods, flash floods are predominantly triggered by short-duration, high-intensity rainfall events (Borga et al. 2011), notably those associated with tropical storms and depressions. Flash floods typically occur in confined geographic areas and are marked by their rapid and sudden onset, strong flow velocities (Ngo et al. 2018), nonlinear dynamics, and complex interdependencies (Wu et al. 2019). Consequently, predicting flash flood hazards remains challenging and distinct from forecasting other types of floods.

Literature surveys indicate that research pertaining to flash floods, when viewed from a hazard perspective, encompasses a diversity of methodologies. The first segment of this research perceives flash floods as external processes. It considers flash floods as external processes, concentrating on the cataloging, geographical spread, scale, and distinct attributes of these phenomena (Borga et al. 2011; Marchi et al. 2010), as well as examining the geomorphic impacts of flash floods (Ozturk et al. 2018; Scorpio et al. 2018). The second segment, which is more prevalent, employs modeling methods to forecast, predict, and map the spatial likelihood of flash floods (Hussain et al. 2023; Saharia et al. 2017). This approach is critical for disaster management and preventive measures.

Over the past three decades, a variety of hydrological methods based on rainfall-runoff modeling have been developed to simulate and forecast flash floods (Douinot et al. 2016; Zhang et al. 2021). These hydrological models are categorized into three principal types based on their structural framework: empirical models, hydraulic models, and hydrologic models (Beven 2011; Devia et al. 2015). Initial research on flash flood forecasting primarily employed empirical models, such as the Soil Conservation Service Curve Number (SCS-CN) method (Reilly and Piechota 2005). This method presupposes a static relationship between rainfall and runoff, a simplification that may not adequately capture the intricate hydrological processes across various watershed conditions. The hydraulic models, i.e., 1D HEC-RAS (US Army Corps of Engineers), LISFLOOD-FP (University of Bristol), or MIKE FLOOD (Danish Hydraulic Institute – DHI), which simulate water flow based on channel topography, which simulate water flow based on channel topography, are extensively utilized for forecasting flash floods (Kourgialas and Karatzas 2014; Vozinaki et al. 2015). However, these models often rely on simplifications and assumptions about flow conditions, channel roughness, and sediment transport, which might not accurately represent real-world complexities in some areas.

In the case of the hydrologic models, such as TOPMODEL (Vincendon et al. 2010), SWAT (Jodar-Abellan et al. 2019), HEC-HMS (Zema et al. 2017), HEC-RAS (Munna et al. 2021), HiResFlood-UCI (Nguyen et al. 2016), these models utilize a set of mathematical equations, ranging from simple empirical formulas to complex differential equations, to estimate rainfall-runoff and to implement routing schemes. Herein, they usually consider factors such as soil type, land use, topography, and antecedent moisture conditions (Zhai et al. 2018) to convert rainfall data into an estimate of surface runoff. Then, the water movement is simulated through the river systems or over the land surface, incorporating channel characteristics and interactions between the channel and floodplain. Generally, the rainfall-runoff models are crucial tools for predicting flash floods in both spatial and temporal scales with reasonable accuracy (Bournas and Baltas 2022; Coustau et al. 2012; Tramblay et al. 2010). Nevertheless, these rainfall-runoff models require a series of long-term monitoring data to to achieve dependable predictions. Consequently, an alternative methodology, referred to as the “on-off” modeling (Tien Bui and Hoang 2017), has been considered. Therein, On” signifies the presence or occurrence of a flash flood, while “Off” indicates its absence. This approach facilitates spatial prediction of flash floods by correlating historical flood events with conditioning factors.

With the development of Geographic Information Systems (GIS) and machine learning, various approaches employing “on-off” modeling have been explored for flash flood studies, i.e., logistic regression (Youssef et al. 2016), multilayer neural (Ngo et al. 2018), extreme learning (Bui et al. 2019), machine learning ensemble (Costache and Tien Bui 2020), CHAID tree ensemble (Nguyen et al. 2020b), Classification And Regression Tree (CART) (Liu et al. 2021), XGBoost and random forest (Abedi et al. 2022), support vector machine (Youssef et al. 2022), stacking ensemble (Yao et al. 2022). To construct precise flash flood prediction models, it is possible for these machine learning algorithms to assimilate a wide array of geospatial data sources, thereby identifying nuanced correlations and interactions among several influential factors. However, up to the present moment, there has not been a single methodology or technique for predicting flash floods that has achieved universal acknowledgment for its efficacy across diverse geographical regions. This underscores the critical need for ongoing research dedicated to developing and exploring innovative algorithmic models tailored explicitly for flash flood prediction.

In recent years, deep learning has emerged as a prominent approach in flood modeling (Trong et al. 2023), encompassing flash floods. The increasing interest in deep learning is attributed to its varied structures and groundbreaking successes in multiple fields (Dhillon and Verma 2020; Zhang et al. 2019). In this context, the amalgamation of exceptional performance capabilities of deep learning algorithms, the availability of extensive geospatial datasets (Hu et al. 2022), advancements in computational hardware (Rasch et al. 2023), and the development of accessible frameworks (Nguyen et al. 2019) has elevated deep learning to a prominent position in contemporary artificial intelligence research and applications. This has particularly heightened its appeal and relevance in the past five years, including deep neural networks (Panahi et al. 2021), deep belief network (Shahabi et al. 2021), deep learning ensemble (Costache et al. 2020), long short-term memory (LSTM) (Zhao et al. 2022), and 1D Convolutional neural network (CNN) (Tsangaratos et al. 2023). While these deep learning models have the potential to markedly enhance the accuracy of flash flood predictions, conducting further research is essential to comprehend their applicability across diverse geographical contexts, aiming not only to broaden and deepen the existing knowledge base but also to harness the capabilities of these models fully.

This study aims to partially fill the above gap in the existing literature by investigating the potential application of 1D-CNN for spatial predictions of flash floods in a tropical area of the Thanh Hoa province. This province belongs to North Central Vietnam, which frequently experiences flash floods after heavy rainfall during tropical storms. The subsequent sections of this paper are organized as follows. Section 2 delineates the Materials and Methods employed. Section 3 describes the proposed methodology for the spatial prediction of flash floods using Deep 1D-CNN and Multisourced Geospatial data. The Results and Analysis are presented in Sect. 4. Discussions are contained within Sect. 5, and the concluding remarks are provided in Sect. 6.

Materials and methods

Study area

The research is conducted in the Thanh Hoa province, situated in the north-central region of Vietnam. It is located approximately 110 km south of Hanoi and encompasses an area of approximately 11,080.8 km². Geographically, the coordinates of the study area span from 19°17’20” to 20°40’ North latitude and from 104°22’ to 106°04’ East longitude (Fig. 1). The study area showcases a diverse topography characterized by various landforms such as mountains, hills, plains, and coastal areas. The elevation of the region ranges from 0.0 to 1897.1 meters above sea level (m a.s.l), further adding to its topographical heterogeneity. The western part is dominated by the Truong Son Mountain Range, which runs along the border with Laos. This mountainous terrain gradually transitions into hills and plateaus towards the east before reaching the flat plains along the coast (Fig. 1). The study area exhibits a slope variation ranging from 0.0 to 77.1^o with a mean of 14.9^o and a standard deviation of 12.4^o. Approximately one-fourth (25%) of the study area consists of slopes less than 3^o. Additionally, 13.3% of the study area comprises slopes between 3 and 7.5^o, while another 13.3% features slopes ranging from 7.5 to 15^o.

Concerning the soil type, the study area encompasses various soil types. The predominant one is the yellow-red soil found on clay, metamorphic, and acid-magmatic rocks, covering approximately 35.9% of the area. Following this, the yellowish-brown soil is present on basalt and limestone, accounting for approximately 12.5% of the study area. Additionally, the pale yellow soil on sandstone occupies about 11.7% of the region. Geologically, the province showcases a diverse range of more than 30 exposed formations and complexes. These geological formations exhibit distinct spatial distributions. Notably, three dominant formations, namely Dong Trau, Dong Son, and Song Ca, cover 42.7% of the study area. The main lithologies within these formations include sandstone, silty sandstone, quartz-mica sandstone, clay-sericite shale, yellow sand, silt, and motley lateritized clay.

As reported by the General Statistics Office of Vietnam (www.gso.gov.vn), the population of Thanh Hoa province in 2019 was recorded at 3,640,128 individuals, establishing it as the third most populous province in Vietnam, after Hanoi capital and Ho Chi Minh city. The population distribution within this province exhibits significant disparities between its plains and mountainous regions. The majority of the population is concentrated in urban centers, towns, coastal areas, and along riverbanks, while the mountainous areas remain sparsely populated. In particular, Thanh Hoa city stands out with a population density of more than 2,400 people per square kilometer. Similarly, Hau Loc, Hoang Hoa, and Quang Xuong districts (Fig. 1) also show relatively dense populations, exceeding 1,100 people per square kilometer. On the other hand, the mountainous districts, such as Muong Lat, Quan Son, and Quan Hoa (Fig. 1), experience significantly lower population densities, hovering around 40 people per square kilometer. This marked variation in population density underscores the contrasting settlement patterns between the plains and mountainous terrains in the province.

Thanh Hoa province is situated within the tropical monsoon climate zone, which is influenced by both the temperate climate of the Gulf of Tonkin and the North Central Coast (Nguyen et al. 2021). The region experiences two distinct seasons annually: summer and winter. The summer from May to October is characterized by hot, humid weather with frequent rainfall, influenced mainly by hot and dry southwesterly winds. Conversely, the winter from November to April brings cold conditions but little rain to the province. The temperature in the area exhibits a range between 20 and 28 °C throughout the year, whereas, as for precipitation, the region receives an average annual rainfall of 1,800 mm (Nguyen et al. 2020a). The Thanh Hoa province is frequently impacted by tropical storms and depressions, resulting in the occurrence of numerous flash floods and associated landslides (Manh 2017; Thuy 2019). These natural events pose significant challenges to the region, necessitating careful monitoring and preparedness measures to mitigate potential risks and ensure the safety of its residents and infrastructure. For example, tropical storm Wipha occurred from the 2nd to the 4th of August 2019, bringing torrential rainfall that led to flash floods and inundation. Tragically, this event resulted in the loss of 16 lives, and the total estimated losses amounted to US$43.1 million.

Data

Historical flash-flooded location

Flash-flooded locations occurred previously, and their influencing indicators are necessary for building prediction models. Therefore, in this research, we prepared the flash-flood inventory map (Fig. 1) using 2540 flash flood polygons deriving from the research project VAST05.01/21–22 funded by the Vietnam Academy of Science and Technology (VAST). These flash flood polygons that happened in the last five years were detected from the Sentinel-1 SAR imagery using change detection methods. The Sentinel-1 mission is a notable C-band Synthetic Aperture Radar (SAR) constellation composed of two polar-orbiting satellites. Each satellite is equipped with a C-band SAR sensor capable of capturing imagery at a spatial resolution of 10 m, offering a high revisit time and ensuring imagery availability every six days in constellation mode (Tarpanelli et al. 2022).

First, we focused on assessing tropical storms that occurred within the past five years, from 2018 to 2022, which have resulted in heavy rainfall and subsequent flash floods. We collected Sentinel-1 images for each storm before and after the flooding events. These images were then subjected to a rigorous processing procedure to identify and localize flash floods and inundations through change detections. Further regarding the change detection technique employed for multi-temporal SAR Sentinel-1 A image processing for flash flood detection can be found in (Trong et al. 2023), which provides in-depth details on the methodologies and algorithms utilized in this flash flood detection study. Finally, fieldwork was carried out to study and check flood locations.

Flash flood influencing factors

In the context of predicting flash flood inundation, a typical approach involves analyzing past flood events and their associated influencing factors. The accurate prediction of future flash floods relies heavily on the careful selection of these influencing factors. To this end, based on a literature review (Abedi et al. 2022; Costache and Bui 2020; Ekmekcioğlu et al. 2022; Hapuarachchi et al. 2011; Ilia et al. 2022; Youssef et al. 2022) and our analyzing catchment characteristics to identify the most relevant influencing factors for our spatial prediction model. The selection of these factors is paramount in ensuring the precision and effectiveness of our flash flood prediction efforts. As a result, twelve flash flood influencing factors were considered: geology, soil type, Land use/land cover (LULC), stream density, NDVI, NDWI, elevation, TWI, slope, aspect, curvature, and rainfall.

Geology is crucial in predicting flash flood inundation, primarily due to its influence on key factors such as drainage characteristics, channel formation, and capacity (Mahala 2020; Montgomery and Buffington 1997). Different types of rocks and geological formations can either facilitate or hinder the movement of water. Moreover, the geological composition of riverbeds and channels plays a crucial role in determining their capacity to carry water (Matsuda 2004). In regions with narrow or obstructed channels due to geological features, flash floods can cause rapid water accumulation and overflow, leading to destructive flooding downstream (Ba et al. 2022). In this research, the geology map (Fig. 2a) was constructed with 20 classes based on the Geological and Mineral Resources Maps at a scale of 1:200,000 provided by the Ministry of Natural Resources and Environment of Vietnam.

Soil type should be considered for spatial prediction of flash floods because it may influence infiltration patterns and runoff processes (Liu et al. 2019). Herein, different soil types have varying levels of permeability, affecting the rate at which water can infiltrate into the ground or runoff over the surface. Soils with high permeability allow water to infiltrate quickly, reducing surface runoff (Huat et al. 2006) and potentially mitigating flash flood risks. On the other hand, soils with low permeability lead to increased surface runoff (Naef et al. 2002), contributing to flash flood occurrences. In this analysis, the soil type map with 18 categories was compiled and shown in Fig. 2b. The soil type data was extracted from national pedology maps at the scale of 1:100.000 provided by the Ministry of Agriculture and Rural Development of Vietnam.

Land Use and Land Cover (LULC) is a critical component in flash flood modeling due to its significant impact on how rainfall interacts with the ground and the subsequent movement of water (Rosso and Rulli 2002). Herein, different land cover types affect the amount of rainfall that infiltrates the soil versus that which becomes surface runoff. Therefore, incorporating LULC into flash flood modeling may provide a more comprehensive understanding of the potential flood dynamics, helping to predict flash floods better. In the conducted research, a LULC map featuring ten unique categories for the Thanh Hoa province was prepared, as illustrated in Fig. 2c. This map was constructed utilizing a LULC dataset from 2020 provided by the Japan Aerospace Exploration Agency (JAXA). The dataset, characterized by a 30-meter resolution, is accessible through JAXA’s online portal at www.eorc.jaxa.jp, which was accessed on June 15, 2023.

For the stream density, this factor should be selected for flash flood modeling (Dutta et al. 2023) as it provides vital insights into an area’s natural water flow patterns, drainage capacity, and overall hydrological characteristics, aiding in the accurate prediction of flash floods. In the present study, the stream density map for the Thanh Hoa province (Fig. 2d) was generated using the stream data obtained from Open Street Map (accessible at www.openstreetmap.org). The computation of stream density was performed utilizing the Line Density tool within ArcGIS Pro. The resulting stream density values varied, ranging from 0.0 to 4.7 km per square kilometer.

NDVI and NDWI are valuable indices that should be selected for spatial prediction of flash floods due to their significance in capturing crucial environmental information related to vegetation and water content. Herein, NDVI relates to the health and density of vegetation across the study area. Thus, areas with dense and healthy vegetation can slow down surface runoff by promoting water infiltration and reducing erosion (Rawat and Singh 2018). On the other hand, regions with sparse or degraded vegetation are more susceptible to rapid runoff (Miao et al. 2016), increasing the likelihood of flash floods during intense rainfall events. Regarding NDWI, this factor has a sensitivity to water content, and distribution allows (Tsangaratos et al. 2023) for more accurate and reliable flash flood predictions. In this research, the computation of the NDVI map (Fig. 2c) and the NDWI map (Fig. 2d) for the Thanh Hoa province was conducted using the reflectance values derived from bands 4, 5, and 6 of Landsat 8 OLI (Operational Land Imager) imagery, 30 m resolution. The calculation procedure followed Eq. 1 (Defries and Townshend 1994) for NDMI and Eq. 2 (Xu 2006) for NDWI, as presented below:

$${\rm{NDVI = }}\left( {{\rm{Band 5 - Band 4}}} \right){\rm{/}}\left( {{\rm{Band 5 + Band 4}}} \right)$$

(1)

$${\rm{NDWI = }}\left( {{\rm{Band}}\,{\rm{5 - Band}}\,{\rm{6}}} \right){\rm{/}}\left( {{\rm{Band}}\,{\rm{5 + Band}}\,{\rm{6}}} \right)$$

(2)

The Landsat 8 OLI imagery utilized in this study is accessible through the website www.earthexplorer.usgs.gov.

Topography and terrain characteristics play a significant role in determining the flow of water during rainfall events (Zevenbergen and Thorne 1987); therefore, they should be considered for flash flood modeling. In this research, the digital elevation model (DEM) for the study area was extracted from the ALOS DEM with 30 m resolution, which was provided by the Japan Aerospace Exploration Agency (JAXA) and can be accessed at www.eorc.jaxa.jp. Using the DEM, five morphometric factors were generated: elevation (Fig. 2e), TWI (Fig. 2f), slope (Fig. 2g), Aspect (Fig. 2h), and curvature (Fig. 2i).

Elevation refers to a location’s height or vertical position above sea level. As elevation changes across the study area, there is a corresponding variation in gravitational force. This relationship between elevation and gravity fundamentally impacts shaping water flow patterns (Charlton 2007; Ullah and Zhang 2020) and influencing the occurrence of flash floods; therefore, the elevation was selected. For the case of the TWI, this factor was used in flash flood modeling because it provides essential information regarding areas that are susceptible to runoff accumulation (Zahura et al. 2020) during rainfall events. Regarding the slope, this factor is recognized as a crucial factor because it directly affects the rate of surface runoff during rainfall events. Steeper slopes facilitate faster water flow, leading to higher runoff volumes (Abuzied et al. 2016) and increased flash flood potential. Aspect indicates the direction a slope faces and was included in this analysis. This is because different aspects can lead to varying water flow patterns, affecting the pathways of surface runoff during rainfall events (Hinckley et al. 2014). Curvature was used for flash flood modeling in this research because it provides valuable information about the shape and form of the terrain, which influences water flow during rainfall events. Different curvatures can create depressions and concave areas where water accumulates (Mahmoud and Gan 2018). Thus, such areas are more prone to water pooling and potentially generating flash floods during heavy rainfall.

Rainfall plays a decisive role in the formation and movement of the water flow within a watershed, affecting the magnitudes, velocity, and dynamics of flash flood flows (Bryndal et al. 2017). In this study, we investigated the most severe rainfall events that led to flash floods in Thanh Hoa province over the past five years, from 2018 to 2022. Consequently, the period from July 7, 2021, to August 31, 2021, was identified due to its association with multiple flash floods. During this time, the study area experienced the impact of a tropical depression originating from the East Sea, which resulted in heavy to hefty rainfall. The highest recorded total rainfall was 1069.9 mm in the Trieu Son district, with the lowest being 1023.9 mm in the Tinh Gia district. In this analysis, the rainfall data was extracted from the climate data POWER project, National Aeronautics and Space Administration (NASA) (www.firms.modaps.eosdis.nasa.gov, accessed on 15 February 2023). Then, the rainfall map (Fig. 2j) was generated using the Inverse Distance Weight interpolation method available in ArcGIS Pro software.

Deep 1D-convolution neural network

Deep learning is a modern machine learning field that employs structures with multiple processing layers to mine and represent data. In the last five years, deep learning that encompasses various types of neural networks, i.e., Deep neural networks (DNN), Convolutional Neural Networks (CNNs), recurrent neural networks (RNNs), long short-term memory (LSTM) networks, made a huge impact due to due to their groundbreaking success (Dhillon and Verma 2020) in various domains of our lives, i.e., energy forecasting (Wang et al. 2019); remote sensing and image classification (Paoletti et al. 2019), environmental modeling (López-Pérez et al. 2020), and flash flood prediction (Bui et al. 2020).

In this analysis, we selected CNNs for flash flood analysis, which has proven highly effective in various computer vision tasks, including image classification, object detection, and image segmentation (Feng et al. 2019; Yuan et al. 2023). Thus, CNNs have become a fundamental tool in the field of computer vision due to their ability to automatically learn hierarchical features from images, leading to impressive performance on various visual recognition challenges (Planche and Andres 2019). CNNs consist of various convolutional layers, where filters or kernels are applied to input images to detect diverse features, including edges, textures, and shapes. The subsequent layers aggregate these features to recognize higher-level patterns and objects.

The traditional CNNs were initially developed where the input data is typically in the form of 2D images (Kabir et al. 2020), and then, the CNNs’ architecture was further developed to process other types of data beyond 2D images, including 1D data processing (Kiranyaz et al. 2015), i.e., time series anomaly analysis (Kim et al. 2023) and 3D data handling (Cawte and Bazylak 2022), i.e., video frames or volumetric medical images (Lin et al. 2022). Literature review shows that 1D-CNNs have experienced substantial popularity and demonstrated successful applications across diverse fields and, therefore, were selected for this analysis.

Considering a flash flood dataset FFDS = ($X,y$), $X$ is a vector of ten flash flood factors, whereas $y$ is the flash flood index with values belonging to [0,1]. A typical 1D Convolutional Neural Network (1D-CNN), shown in Fig. 1, comprises an input layer, a convolutional layer, a pooling layer, a flattened layer, a fully connected layer, and an output layer. The purpose of the 1D-CNN is to build an inference model that infers the ten flash flood factors into the flash flood indices. Then, these indices will be used to generate a flash flood susceptibility map.

To begin, the input layer receives the ten flash flood factors as an X matrix and represents it as a 1D tensor. The subsequent step involves the 1D Convolution layer, a fundamental building block of the 1D-CNN, which conducts convolution operations on the input data using filters (kernels). These filters slide over the input data, extracting local patterns and features. Each filter generates feature maps that emphasize specific patterns within the input sequence (Fig. 3). The output of the convolution layer is then directed to the Pooling layer, which reduces the spatial dimensions of the feature maps while preserving essential information. By selecting the maximum value from a pooling window and discarding the rest, the Pooling layer effectively down samples the feature maps (Ugli et al. 2023).

After convolutional and pooling layers, the output undergoes flattening to transform the 2D feature maps into a 1D vector. This flattened vector is then fed into fully connected layers (also known as dense layers) to learn high-level representations and make predictions. The final layer of the 1D CNN is the output layer, which produces the spatial predictions of flash floods based on the learned representations. The number of nodes in the output layer depends on the specific task being solved. In this research context, the spatial prediction of flash floods is considered to be a binary classification task; therefore, the output layer has one node. The Rectified Linear Unit (ReLU) (Eq. 3) is commonly selected as the activate function (Ullah et al. 2022), whereas the sigmoid (Eq. 4) is used as the transfer function.

$$ActF\left(x\right)=\text{m}\text{a}\text{x}(0,x)$$

(3)

$$TranF\left(x\right)=\frac{1}{1+\text{e}\text{x}\text{p}(-x)}$$

(4)

Dropout is recommended to mitigate the risk of overfitting (Lang, et al. 2020). Dropout randomly sets a fraction of the neurons’ outputs to zero during training, encouraging the network to learn more resilient features. Additionally, incorporating batch normalization can enhance stability and accelerate the training process. Batch normalization normalizes the activations of each layer, ensuring consistent mean and variance throughout the training procedure.

Proposed methodology for spatial prediction of Flash Flood using deep 1D-CNN and multi-source geospatial data

The description of the proposed methodology for spatial prediction of Flash Floods using 1D-CNN and GIS is presented in Fig. 4. In this research, the SNAP toolbox and ArcGIS Pro 3.0 were utilized for processing the Sentinel-1 SAR imagery and flash flood influencing factors, respectively. The statistics test was carried out using the IBM SPSS Statistics 29.0. The python code for the 1D-CNN algorithm codes can be found at www.tensorflow.org, whereas the authors wrote another python script to covert the twelve influencing factors to the input format of the1D-CNN and convert the flash flood susceptibility indices to GIS format to open in the ArcGIS Pro. The modeling process was carried out using the Deep Learning toolset in ArcGIS Pro 3.2, which utilized the Tensorflow and Keras libraries, deep learning APIs Google developed. For the two benchmarked models, support vector machine (SVM) and logistic regression (LR), the API Weka Wrapper in Python (Reutermann 2020). It is noted that both Spyder and Microsoft Visual Studio Code Editor were used to edit and debug the Python code in this project.

Building flash flood database

This research utilized the ESRI-geodatabase format (Zeiler 1999) to construct the flash-flood database because it efficiently organizes geospatial data from diverse sources. We selected the WGS 1984 UTM Zone 48 N coordinate system for the study area. As a result, the flash-flood database in this research consists of 2540 flash-flood polygons and 12 influencing factors mentioned in Sect. 2.2. The next step involved converting all twelve influencing factors into a raster format with a spatial resolution of 30 m. Subsequently, these factors were normalized to a range of [0.01–0.99] using Eq. 5. This normalization process was performed using the Raster Calculator tool available in the ArcGIS Pro.

$${\rm{NewFLF = }}\left( {{\rm{FLF - Min}}\left( {{\rm{FLF}}} \right)} \right){\rm{ / }}\left( {{\rm{Max}}\left( {{\rm{FLF}}} \right){\rm{ - Min}}\left( {{\rm{FLF}}} \right)} \right){\rm{ * 0}}{\rm{.99 - 0}}{\rm{.01}}$$

(5)

Here, NewFLF represents the new raster value of the flash flood influencing factor, while FLF denotes its original raster value. Max(FLF) and Min(FLF) correspond to the maximum and minimum values in the influencing factor, respectively.

This study’s spatial prediction of flash floods is framed as a binary pattern recognition problem. The objective is to classify each pixel in the study area into one of two categories: “non-flash flood” or “flash flood” based on the patterns of 12 flash flood influencing factors. To achieve this, 2540 points representing non-flood areas were randomly generated in non-flash flood areas. As a result, a total of 5080 locations were obtained, combining flash flood and non-flash flood locations. Following this, the flash flood locations were labeled with a value of “1” while the non-flash flood locations were labeled with a value of “0”. In the next step, a sampling process was conducted to extract the values of the ten influencing factors for these locations, employing the sample tool in the ArcGIS Pro. Finally, the data was randomly divided into a 70/30 ratio to create the training dataset, which comprised 3556 samples, and the validation dataset, which comprised 1524 samples.

Multicollinearity and ranking of flash flood influencing factors

Multicollinearity checking of influencing factors is essential for flash flood modeling as it helps identify and address issues related to their intercorrelation. According to De Veaux and Ungar (1994), multicollinearity checking can help to enhance its predictive accuracy and preserve the model’s interpretability. Thus, when multicollinearity exists between the influencing factors, it becomes challenging to isolate the individual effect of each factor on the flash floods. The presence of multicollinearity can lead to inflated coefficient estimates and standard errors, making it difficult to determine the true significance of each flash flood influencing factor. In order to address the multicollinearity, the Variance Inflation Factor (VIF) and Tolerance (TOL) (Mansfield and Helms 1982; Miles 2014) were employed and computed for each influencing factor. Problematic multicollinearity is indicated by Variance Inflation Factor (VIF) values exceeding 10 and Tolerance (TOL) values below 0.1 (Menard 2002).

In addition to multicollinearity, the role of the influencing factors should be assessed to ensure all the factors are relevant before carrying out the flash flood modeling. The Random Forests-based Wrapper method (Cardenas-Martinez et al. 2021) with 5-fold cross-validation technique was employed for this task. We used 500 random trees in the Random Forests to search and rank each factor through various subset assessments, as suggested by Tuan et al. (2023). Herein, the Mean Absolute Error (MAE) in Eq. 6 was selected to measure the contribution of each factor.

$$\text{M}\text{A}\text{E}= \frac{1}{n }{\sum }_{i=1}^{n}\left|{FFL}_{i}-{FLO}_{i}\right|$$

(6)

where ${FFL}_{\text{i}}$ is the flash flood value, while ${FLO}_{\varvec{i}}$ denotes the flash flood output from the RFW; n is the total number of samples.

Designing Deep 1D-CNN model

The performance of the Deep 1D-CNN model significantly relies on its structure, activation function, transfer function, and parameter optimization, all of which require careful determination. This study proposes a specific structure for the Deep 1D-CNN model used in the spatial prediction of flash flood inundation, as depicted in Fig. 5. The model consists of an input layer, four 1D-CNN layers, two pooling layers, one flattened layer, two fully connected layers, and an output layer.

For the first 1D-CNN layer, we employed kernel sizes of 1 and 32 filters, while for the second 1D-CNN layer, kernel sizes of 3 and 64 filters were selected (Trong et al. 2023). The architecture incorporates pooling layers 1 and 2, each with a pool size of 2. Following these, the third 1D-CNN layer is configured with kernel sizes of 1, along with 64 filters, whereas the fourth 1D-CNN layer utilizes kernel sizes of 3 and is equipped with 128 filters. Upon integrating the flattened layer (as depicted in Fig. 5), two densely connected layers were systematically structured, featuring 200 neurons in the first dense layer and 50 neurons in the second dense layer.

Finally, the output layer was structured with two neurons, representing “non-flash flood” and “flash flood,” respectively. Herein, a threshold of 0.5 was adopted to separate output indices into the two classes, “non-flash flood” and “flash flood, for the model performance assessment. To facilitate the model’s performance, we chose the ReLU as the activation function and employed the sigmoid function for the transfer function. A summary of the proposed 1D-CNN model and its parameters is shown in Table 1.

Table 1 Summary of the proposed Deep 1D-CNN model with 112,994 parameters for spatial prediction of flash flood in this research

Full size table

Optimizer and loss function

As shown in Table 1, a total of 112,994 parameters of the proposed Deep 1D-CNN model were identified, and in this study, they were optimized using the Adaptive Moment Estimation (ADAM) algorithm (Kingma and Ba 2015). The ADAM algorithm calculates individual learning rates for each parameter of the Deep 1D-CNN model based on their historical gradients. This adaptiveness helps the optimizer converge faster and more efficiently compared to traditional optimizers with fixed learning rates (Goodfellow et al. 2016).

During the training phase, the 112994 parameters of the proposed Deep 1D-DNN model were adapted to identify the most appropriate functional mapping between the actual and the predicted values of the flash-flood and the non flash-flood. To measure the fitness of the parameters with the proposed Deep 1D-CNN model, we employed the Mean Squared Error (MSE) in Eq. 7 as the loss function in this study.

$$\text{M}\text{S}\text{E}= \frac{1}{n }{\sum }_{i=1}^{n}{({FFL}_{i}-{FLO}_{i})}^{2}$$

(7)

where ${FFL}_{\text{i}}$ represents the flash flood value in the inventory map, while ${FLO}_{\varvec{i}}$denotes the flash flood output obtained from the proposed Deep 1D-CNN model; the variable “n” corresponds to the total number of samples utilized in the analysis.

Performance assessment

A comprehensive set of performance measurement metrics was employed to evaluate the performance of the proposed Deep 1D-CNN model for spatial prediction of flash flood inundation. The evaluation included the use of TP (true positive), FP (false positive), FN (false negative), and TN (true negative), as described Nhu et al. (2020). Using these indices, additional metrics such as PPV (Positive Predictive Value), NPV (Negative Predictive Value), Sens (Sensitivity), Spec (Specificity), Accuracy, and F-Score were computed following the works of López et al. (2013). In addition to the above metrics, the widely adopted ROC curve and AUC (Area Under the Curve) were employed to assess the overall generalization capability of the 1D-CNN model, drawing insights from van Erkel and Pattynama (1998). Furthermore, the Kappa index, McHugh (2012) was also computed to quantify the predictive accuracy of the Deep 1D-CNN model.

Benchmark model comparison

In the present research, the effectiveness of the proposed Deep 1D-Convolutional Neural Network model is demonstrated through a comparative analysis with two established flash flood models: the Support Vector Machine (SVM) and Logistic Regression (LR). They were chosen as benchmark models for flash flood modeling (Ngo et al. 2021) owing to their demonstrated proficiency in predicting areas susceptible to flash floods across a variety of studies, including, i.e., (Costache 2019; El-Rawy et al. 2022; Pham et al. 2020; Youssef et al. 2016). The SVM analysis employed the Radial Basis Function (RBF) kernel, with the parameters C and gamma optimized through the grid search technique (Fayed and Atiya 2019). Meanwhile, for the Logistic Regression model, standard default parameters were utilized.

Compiling the flash flood susceptibility map

After successfully training and validating the Deep 1D-CNN model to meet the desired criteria, the model was utilized to calculate the flash flood susceptibility indices for the entire study area. The study area consists of a matrix with dimensions of 5926 columns × 5093 rows. In order to prepare the input data for the Deep 1D-CNN model, the twelve influencing factor maps were transformed into a study data matrix with a size of 30,181,118 rows × 12 columns (see Fig. 5), adhering to the model’s input format. Subsequently, these indices were converted into a GIS format to generate the final flash flood susceptibility map.

Results and analysis

Multicollinearity and ranking result

The results of the multicollinearity analysis for the 12 influencing factors are presented in Table 2. It is evident that all 12 influencing factors have Variance Inflation Factor (VIF) values below 10 and Tolerance (TOL) values above 0.1, indicating the absence of problematic multicollinearity in the dataset. Notably, among these factors, TWI exhibits the highest VIF value (2.087) and the lowest TOL value (0.479), while soil demonstrates the lowest VIF value (1.054) and the highest TOL value (0.949) (Table 2).

The 12 factors influencing flash floods in the study area have been assessed, and the findings are presented in Table 2. Notably, Slope, Topographic Wetness Index (TWI), geology, and rainfall emerged as the most influential factors. Their respective score values are 0.234, 0.159, 0.122, 0.109, and 0.086. They are followed by stream density (0.084), curvature (0.062), soil (0.057), LULC (0.041), and NDVI (0.035). Conversely, Aspect and NDWI exhibit the lowest impact on flash floods in this province, with scores of 0.005 and 0.020, respectively.

Table 2 Multicollinearity of flash flood influencing factors in this research

Full size table

Model fitting and validation

Using the 3556 samples within the training dataset, the Deep 1D-CNN model underwent training in the training phase, employing the ADAM algorithm to optimize the 112,994 parameters. The outcomes are depicted in Fig. 6; Table 3, and Fig. 7. The results reveal a robust fitting of the proposed Deep 1D-CNN model with the training dataset, as evidenced by a Mean Squared Error (MSE) of 0.059, an Error Mean of -0.016, and a Standard Error (Error StD) of 0.244. In addition, the errors are distributed according to a normal distribution (Fig. 6).

The detailed metrics of the Deep 1D-CNN model are presented in Table 3. The model achieved an accuracy of 91.5%, a Kappa value of 0.830, an F-score of 0.916, and an AUC of 0.977, indicating a high fit for the proposed Deep 1D-CNN model. The Positive Predictive Value (PPV) stands at 92.8%, signifying the likelihood that the model accurately classifies flash flood samples in 92.8% of cases. Conversely, the Negative Predictive Value (NPV) is 90.2%, indicating the model’s correct classification of non-flash flood samples in 90.2% of cases. The sensitivity (Sens) is 90.4%, demonstrating the deep 1D-CNN model’s ability to identify flash floods accurately in 90.4% of cases. Similarly, the Specificity (Spec) is 92.6%, indicating the model’s correct identification of non-flash floods in 92.6% of cases.

Table 3 Fitting performance of the flash flood models in the training phase

Full size table

The model is examined using the validation dataset to assess the Deep1D-CNN model’s ability to generalize to new data and accurately predict flash flood occurrences in regions. The result is shown in Figs. 7 and 8; Table 4. Our observations reveal a remarkable accuracy of 90.2%, a Kappa value of 0.804, an F-score of 0.903, and an AUC of 0.969, underscoring the model’s high predictive capability. Moreover, the model exhibits a low mean squared error (MSE) of 0.068, a mean error of -0.006, and a standard deviation of errors (Error STD) of 0.262, demonstrating a highly satisfactory outcome. Furthermore, the errors within the validation dataset follow a normal distribution pattern (Fig. 8). The model exhibits a PPV of 91.1%, implying a 91.1% accuracy in correctly classifying flash flood samples within the validation dataset. The NPV stands at 89.4%, indicating accurate classification of the non-flash flood samples in 89.4% of instances. The Sens of 89.5% underscores the Deep 1D-CNN model’s capability to correctly identify flash floods in 89.5% of cases, while the Spec of 86.7% showcases its accurate recognition of non-flash flood instances in 86.7% of cases. The other metrics measured for the Deep 1D-CNN model in the validation dataset are presented in Table 4.

Table 4 Prediction performance of the flash flood models on the validating phase

Full size table

Comparative analysis and statistical evaluation

The efficacy of the proposed Deep 1D-CNN model was meticulously evaluated through a comparative analysis, pitting its performance and predictive capabilities against established benchmarks. As delineated in Sect. 3.6, this study chose support vector machine (SVM) and logistic regression (LR) as the benchmark models. In the case of the SVM model, the Radial Basis Function (RBF) kernel was applied, and a grid search approach was employed to explore the optimal parameters, specifically C (9.0) and Gamma (0.625), whereas, for the LR model, default parameters in the Weka API were used.

The fitting performance results are presented in Tables 3 and 4. It is evident that both the SVM model (accuracy = 89.8.6%, Kappa value = 0.797, F-score = 0.899, and AUC of 0.960) and the LR model (accuracy = 80.2%, Kappa value = 0.603, F-score = 0.800, and AUC of 0.880) demonstrate a good fit with the training data. However, the SVM model outperforms the LR model, as indicated in Table 3. Additional statistical metrics are detailed in Table 3. Overall, it is apparent that the performance of both the SVM model and the LR model is inferior to that of the proposed Deep 1D-CNN model.

The prediction capability of the benchmark models, as depicted in Table 4, demonstrates commendable results. Both the SVM model (accuracy = 87.7%, Kappa value = 0.755, F-score = 0.876, and AUC = 0.948) and the LR model (accuracy = 80.7%, Kappa value = 0.614, F-score = 0.798, and AUC = 0.873) show satisfactory performance. However, it is clear that the prediction performance of the SVM model is higher than that of the LR model. Nonetheless, the predictive performance of both the SVM model and the LR model is lower when compared to the predictive performance attained by the proposed Deep 1D-CNN model.

In order to ensure confident and reliable conclusions regarding the effectiveness of the proposed Deep 1D-CNN model compared to the two benchmarks in predicting flash floods, a rigorous statistical analysis using the Paired Samples T-Test was conducted. Herein, three pairs of the flash flood models, Deep 1D-CNN vs. SVM, Deep 1D-CNN vs. LR, and SVM vs. LR, were considered. The null hypothesis (H0) posits that no significant difference in prediction capability exists among these model pairs within a 95% confidence interval around the difference in means. Subsequently, t-values and p-values are calculated for each pair. The null hypothesis is rejected if the t-value falls outside the range of -1.96 to + 1.96 and the p-value is less than or equal to 0.05. In this scenario, we deemed the prediction capability of these flash flood models to be statistically significant at the 5% level of significance.

The results of the Paired Samples t-Test for the flash flood models in this research are presented in Table 5. It is evident that the t-values for the two pairs, Deep 1D-CNN vs. SVM and Deep 1D-CNN vs. LR, fall outside the range of -1.96 to + 1.96, and the corresponding p-values are less than 0.05 (Table 5). These findings signify that the prediction performance of the Deep 1D-CNN model surpasses that of both the SVM model and the LR model, establishing statistical significance.

Table 5 Paired Samples T-Test for the flash flood models in this study

Full size table

Flash-flood susceptibility map

Based on the aforementioned analysis and result, the proposed Deep 1D-CNN mode has proved to be the best-suited model for flash-flood susceptibility assessment in this research; the model was used to compute the flash-flood susceptibility index for each pixel in the study area. As a result, the susceptibility index for 30,181,118 pixels of the study area (5926 columns × 5093 rows) was determined. These pixels, with index values from 0.0001 to 0.9999, were converted to the WGS 1984 UTM Zone 48 N coordinate system to generate the flash flood susceptibility map (Fig. 9).

An aerial interpretation of the susceptibility map indicates a high probability of flash floods in certain districts, namely Muong Lat, Quan Son, Ba Thuoc, and Lang Chanh. These districts frequently experience severe flash floods annually, attributed to the terrain’s elevated altitude and steep slopes. Conversely, in the southeast districts, such as Quang Xuong and Hoang Hoa, the flash flood index is notably lower. This is due to the relatively flat terrain and proximity to the sea, which facilitates more efficient drainage (refer to Fig. 9).

Discussion

Flash flooding persists as a perilous natural hazard, inflicting significant damage to infrastructure as well as natural and constructed environments, especially in tropical areas. Despite the challenges in forecasting flash floods, as highlighted in recent research (Brunner et al. 2021; Jay-Allemand et al. 2022; Maqtan et al. 2022; Mishra et al. 2022), identifying susceptibility in areas prone to flash floods in advance can be an effective strategy for reducing and mitigating flash flood risks. In this study, we propose a novel approach that combines 1D Deep Convolutional Neural Networks with multi-source geospatial data for modeling flash flood susceptibility, focusing on areas of the Thanh Hoa province in North Central Vietnam that have been heavily affected by flash floods in the last five years.

This study’s findings highlight that the structural design of the Deep 1D-CNN significantly impacts its predictive effectiveness. Based on this insight, our research involved configuring the deep learning model with four convolutional layers, two pooling layers, one flattened layer, and two fully connected layers. This structure follows the recommendations of Trong et al. (2023). Within this modeling approach, we employed the ADAM algorithm as the optimizer and Mean Squared Error (MSE) as the loss function. The observed high performance of the Deep 1D-CNN under this configuration suggests that the ADAM algorithm effectively optimizes the 112994 parameters of the model, where MSE is preferable for the lost function. However, asserting that this specific structure is the most suitable for our research objectives remains premature. Consequently, further investigations are necessary to identify the optimal structural design for autonomous flash flood modeling.

Comparing the proposed Deep 1D-CNN model with benchmarks, SVM, and LR, the proposed model performs better, as confirmed by the paired-sample sign test. This underscores the potential of 1D-CNN as a promising tool for spatial predictions of flash floods. The finding is inline with recent report results, i.e., (Bui et al. 2020; Shahabi et al. 2021; Tsangaratos et al. 2023). Therein, the better performance of the Deep 1D-CNN model over SVM (Support Vector Machines) and LR (Logistic Regression) in flash flood modeling can be attributed to its intrinsic ability to handle nonlinear relationships among influential factors. Logistic regression is less adept at capturing such complexities. Meanwhile, while capable of nonlinear modeling with appropriate kernels, SVM may not be as effective in delineating complex patterns in the given flash flood dataset. Moreover, the Deep 1D-CNN demonstrates robustness against noise and variability present in geospatial data pertinent to flash flood modeling. This robustness stems from its design focus on identifying and prioritizing the most relevant features, thereby diminishing the influence of extraneous or noisy data.

Another benefit of using the Deep 1D-CNN for flash flood modeling lies in its availability within the TensorFlow and Keras frameworks, as noted by Dürr et al. (2020). These frameworks, known for their open-source nature, offer significant advantages. TensorFlow and Keras, with their open-source licenses, facilitate extensive customization and benefit from community-driven improvements, enhancing their utility in complex modeling tasks like flash flood prediction. This accessibility ensures that Deep 1D-CNN architectures can be freely utilized, benefiting from the collaborative improvements and diverse applications contributed by the global open-source community. In this research, the Deep 1D-CNN modeling was conducted within the ArcGIS Pro 3.1.0 deep learning platform, which integrates both TensorFlow and Keras. This platform enables seamless integration with a variety of spatial analysis tools and the ArcGIS Pro model builder, thereby facilitating the autonomous processing of multi-sourced geospatial data. It aids in activities like data sampling, training, model validation, and the backend creation of the flash flood susceptibility map. As a result, there was a significant reduction in the time required for data processing, modeling, and creating the susceptibility map.

The modeling process necessitates the use of a Python Integrated Development Environment (IDE), which serves as an extensive coding tool. This environment facilitates the entire Deep 1D CNN modeling workflow for flash flood prediction, encompassing stages from data preprocessing to model deployment. While Spyder within Anaconda has been recognized as a powerful scientific environment in previous studies (Kadiyala and Kumar 2017), it is unsuitable for flash flood modeling in this project due to compatibility issues with libraries in the ArcGIS Pro 3.1.0 deep learning environment. Consequently, Visual Studio Code (bin Uzayr 2022) was employed. This choice, however, necessitates specific knowledge for effective utilization.

Regarding the input factors, in this work, twelve influencing factors were carefully considered, primarily based on an analysis of the characteristics of flash floods in the study area and the availability of geospatial data. The effective performance of the Deep 1D-CNN model suggests that the processes of selecting, processing, and integrating these influencing factors were successfully executed. Notably, slope and Topographic Wetness Index (TWI) emerged as the most critical factor for flash flood occurrences in this province. The prominence of slopes is justified because the province has diverse topography, where steep slopes are common, especially in Muong Lat, Quan Son, Lang Chanh, and Trieu Son (Fig. 9), significantly accelerating surface runoff, reducing infiltration, directing water flow rapidly downhill, and increasing the risk of soil erosion and landslides, all of which contribute to the heightened potential for flash flooding. Regarding TWI, this factor is clearly shown where water is likely to accumulate, such as in the areas of Muong Lat, Quan Son, Lang Chanh. High TWI values correspond to areas with greater soil saturation (Fig. 2h), leading to flash floods during heavy rains.

The constraint in this research is related to the utilized data arising from varying resolutions across different sources. For instance, DEM and its derivatives, LULC, NDVI, and NDWI, possess a spatial resolution of 30 m. Conversely, the soil map is derived from the pedological maps at a scale of 1:100,000, while the geological data is sourced from Geological and Mineral Resources Maps at a scale of 1:200,000. This variation in scale and detail among the source maps may lead to content and precision diverging, thereby introducing potential uncertainties in flood modeling. In order to improve the prediction accuracy of flash floods, it is recommended to utilize geospatial data of higher resolutions. This approach can offer more detailed and precise information, which is crucial for effective flash flood modeling and risk assessment.

A notable limitation of this research lies in the omission of considerations regarding the impact of climate change on the predictive capabilities of the Deep 1D-CNN model, as well as the investigation into the variability of the model’s performance. Consequently, future research will be undertaken to provide a more comprehensive evaluation and conclusions regarding the efficacy of this model in predicting flash floods, incorporating the potential effects of climate change and performance variability.

Nonetheless, on a regional scale, the flash flood susceptibility modeling conducted for Thanh Hoa province in this study carries substantial implications. The susceptibility map may help the authorities in creating strategic plans that reduce risks, improve disaster readiness, and establish policies that strengthen the resilience of both communities and infrastructure systems.

Concluding remarks

This study embodies a thorough research methodology for the spatial prediction of flash floods, incorporating Deep 1D-CNN and multi-sourced geospatial data to offer an innovative approach that improves predictive accuracy. Its contributions establish a result for future enhancements in flash flood management practices, underscoring the necessity for continued refinement of the model, an investigation into additional predictive variables, and the pragmatic application of these models. Moreover, this research not only progresses our comprehension of flash flood dynamics but also accentuates the capacity of deep learning techniques to enhance disaster preparedness and mitigation strategies. From the findings of this study, several key conclusions can be drawn:

The Deep 1D-CNN model, utilizing the ADAM optimizer and MSE (Mean Squared Error) loss function, has demonstrated its capability to generate accurate flash flood susceptibility maps.
Comparatively, the performance of the proposed Deep 1D-CNN model exceeded that of the SVM (Support Vector Machine) and LR (Logistic Regression) models, which served as benchmarks in this study. This finding underscores the potential and effectiveness of 1D-CNN as an advanced tool in susceptibility mapping for flash floods.
In the context of this study, Land Use and Land Cover (LULC), Slope, and Normalized Difference Vegetation Index (NDVI) have emerged as the most significant factors influencing flash flood occurrences.
For future expansions of this research, the exploration of other advanced metaheuristic algorithms for training deep learning models is recommended. Additionally, innovative methods for autonomously determining the structure of deep learning models warrant further investigation.

Data availability

No datasets were generated or analysed during the current study.

References

Abedi R, Costache R, Shafizadeh-Moghadam H, Pham QB (2022) Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees. Geocarto Int 37(19):5479–5496
Article Google Scholar
Abuzied S, Yuan M, Ibrahim S, Kaiser M, Saleem T (2016) Geospatial risk assessment of flash floods in Nuweiba area, Egypt. J Arid Environ 133:54–72
Article Google Scholar
Al-Aizari AR, Al-Masnay YA, Aydda A, Zhang J, Ullah K, Islam ARMT, Habib T, Kaku DU, Nizeyimana JC, Al-Shaibah B, Khalil YM, WMM AL-Hameedi, and, Liu X (2022) Assessment Analysis of Flood susceptibility in Tropical Desert Area: a case study of Yemen. Remote Sens 14(16):4050
Article Google Scholar
Al-Aizari AR, Alzahrani H, AlThuwaynee OF, Al-Masnay YA, Ullah K, Park H-J, Al-Areeq NM, Rahman M, Hazaea BY, Liu X (2024) Uncertainty reduction in Flood susceptibility mapping using Random Forest and eXtreme Gradient Boosting algorithms in two Tropical Desert cities, Shibam and Marib, Yemen. Remote Sens 16(2):336
Article Google Scholar
Ba LH, Nam TV, Hung L (2022) Knowledge of Flash floods and related problems. Flash floods in Vietnam: causes, impacts, and solutions. Springer, pp 9–34
Beven KJ (2011) Rainfall-runoff modelling: the primer. Wiley
bin Uzayr S (2022) Mastering Visual Studio Code: A Beginner’s Guide. CRC
Borga M, Anagnostou EN, Blöschl G, Creutin JD (2011) Flash flood forecasting, warning and risk management: the HYDRATE project. Environ Sci Policy 14(7):834–844
Article Google Scholar
Bournas A, Baltas E (2022) Investigation of the gridded flash flood Guidance in a peri-urban basin in greater Athens area, Greece. J Hydrol 610:127820
Article Google Scholar
Brunner MI, Slater L, Tallaksen LM, Clark M (2021) Challenges in modeling and predicting floods and droughts: a review. Wiley Interdisciplinary Reviews: Water, 8(3), e1520
Bryndal T, Franczak P, Kroczak R, Cabaj W, Kołodziej A (2017) The impact of extreme rainfall and flash floods on the flood risk management process and geomorphological changes in small Carpathian catchments: a case study of the Kasiniczanka river (outer carpathians, Poland). Nat Hazards 88(1):95–120
Article Google Scholar
Bui DT, Ngo P-TT, Pham TD, Jaafari A, Minh NQ, Hoa PV, Samui P (2019) A novel hybrid approach based on a swarm intelligence optimized extreme learning machine for flash flood susceptibility mapping. CATENA 179:184–196
Article Google Scholar
Bui DT, Hoang N-D, Martínez-Álvarez F, Ngo P-TT, Hoa PV, Pham TD, Samui P, Costache R (2020) A novel deep learning neural network approach for predicting flash flood susceptibility: a case study at a high frequency tropical storm area. Sci Total Environ 701:134413
Article Google Scholar
Cardenas-Martinez A, Rodriguez-Galiano V, Luque-Espinar JA, Mendes MP (2021) Predictive modelling benchmark of nitrate vulnerable zones at a regional scale based on Machine learning and remote sensing. J Hydrol 603:127092. https://doi.org/10.1016/j.jhydrol.2021.127092
Cawte T, Bazylak A (2022) A 3D convolutional neural network accurately predicts the permeability of gas diffusion layer materials directly from image data. Curr Opin Electrochem, 101101
Charlton R (2007) Fundamentals of fluvial geomorphology. Routledge
Costache R (2019) Flash-Flood potential assessment in the upper and middle sector of Prahova river catchment (Romania). A comparative approach between four hybrid models. Sci Total Environ 659:1115–1134
Article CAS Google Scholar
Costache R, Bui DT (2020) Identification of areas prone to flash-flood phenomena using multiple-criteria decision-making, bivariate statistics, machine learning and their ensembles. Sci Total Environ 712:136492
Article CAS Google Scholar
Costache R, Tien Bui D (2020) Identification of areas prone to flash-flood phenomena using multiple-criteria decision-making, bivariate statistics, machine learning and their ensembles. Sci Total Environ 712:136492
Article CAS Google Scholar
Costache R, Ngo PTT, Bui DT (2020) Novel ensembles of deep learning neural network and statistical learning for flash-flood susceptibility mapping. Water 12(6):1549
Article Google Scholar
Coustau M, Bouvier C, Borrell-Estupina V, Jourde H (2012) Flood modelling with a distributed event-based parsimonious rainfall-runoff model: case of the karstic lez river catchment. Nat Hazards Earth Syst Sci 12(4):1119–1133
Article Google Scholar
Dayan U, Lensky IM, Ziv B, Khain P (2021) Atmospheric conditions leading to an exceptional fatal flash flood in the Negev Desert, Israel. Nat Hazards Earth Syst Sci 21(5):1583–1597
Article Google Scholar
De Veaux RD, Ungar LH (1994) Multicollinearity: a tale of two nonparametric regressions. Selecting models from data: artificial intelligence and statistics IV. Springer, pp 393–402
Defries RS, Townshend JRG (1994) NDVI-derived land cover classifications at a global scale. Int J Remote Sens 15(17):3567–3586
Article Google Scholar
Devia GK, Ganasri BP, Dwarakish GS (2015) A review on Hydrological models. Aquat Procedia 4:1001–1007
Article Google Scholar
Dhillon A, Verma GK (2020) Convolutional neural network: a review of models, methodologies and applications to object detection. Progress Artif Intell 9(2):85–112
Article Google Scholar
Douinot A, Roux H, Garambois P-A, Larnier K, Labat D, Dartus D (2016) Accounting for rainfall systematic spatial variability in flash flood forecasting. J Hydrol 541:359–370
Article Google Scholar
Dürr O, Sick B, Murina E (2020) Probabilistic deep learning: with python, keras and tensorflow probability. Manning
Dutta M, Saha S, Saikh NI, Sarkar D, Mondal P (2023) Application of bivariate approaches for flood susceptibility mapping: a district level study in Eastern India. HydroResearch 6:108–121
Article Google Scholar
Ekmekcioğlu Ö, Koc K, M Özger, and, Işık Z (2022) Exploring the additional value of class imbalance distributions on interpretable flash flood susceptibility prediction in the Black Warrior River basin, Alabama, United States. J Hydrol 610:127877
Article Google Scholar
El-Rawy M, Elsadek WM, De Smedt F (2022) Flash flood susceptibility mapping in Sinai, Egypt using hydromorphic data, principal component analysis and logistic regression. Water 14(15):2434
Article Google Scholar
Fayed HA, Atiya AF (2019) Speed up grid-search for parameter selection of support vector machines. Appl Soft Comput 80:202–210
Article Google Scholar
Feng X, Jiang Y, Yang X, Du M, Li X (2019) Computer vision algorithms and hardware implementations: a survey. Integration 69:309–320
Article Google Scholar
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press
Hapuarachchi H, Wang Q, Pagano T (2011) A review of advances in flash flood forecasting. Hydrol Process 25(18):2771–2784
Article Google Scholar
Hinckley ELS, Ebel BA, Barnes RT, Anderson RS, Williams MW, Anderson SP (2014) Aspect control of water movement on hillslopes near the rain–snow transition of the Colorado Front Range. Hydrol Process 28(1):74–85
Article Google Scholar
Hoyos CD, Ceballos LI, Pérez-Carrasquilla JS, Sepúlveda J, López-Zapata SM, Zuluaga MD, Velásquez N, Herrera-Mejía L, Hernández O, Guzmán-Echavarría G, Zapata M (2019) Meteorological conditions leading to the 2015 Salgar flash flood: lessons for vulnerable regions in tropical complex terrain. Nat Hazards Earth Syst Sci 19(11):2635–2665
Article Google Scholar
Hu Y, Gui Z, Wang J, Li M (2022) Enriching the metadata of map images: a deep learning approach with GIS-based data augmentation. Int J Geogr Inf Sci 36(4):799–821
Article Google Scholar
Huat BB, Ali FH, Low T (2006) Water infiltration characteristics of unsaturated soil slope and its effect on suction and stability. Geotech Geol Eng 24:1293–1306
Article Google Scholar
Hussain M, Tayyab M, Zhang J, Shah AA, Ullah K, Mehmood U, Al-Shaibah B (2021) GIS-based multi-criteria approach for flood vulnerability assessment and mapping in district Shangla: Khyber Pakhtunkhwa. Pakistan Sustain 13(6):3126
Article Google Scholar
Hussain M, Tayyab M, Ullah K, Ullah S, Rahman ZU, Zhang J, Al-Shaibah B (2023) Development of a new integrated flood resilience model using machine learning with GIS-based multi-criteria decision analysis. Urban Clim 50:101589
Article Google Scholar
Ilia I, Tsangaratos P, Tzampoglou P, Chen W, Hong H (2022) Flash flood susceptibility mapping using stacking ensemble machine learning models. Geocarto Int 37(27):15010–15036
Article Google Scholar
Jay-Allemand M, Demargne J, Garambois P-A, Javelle P, Gejadze I, Colleoni F, Organde D, Arnaud P, Fouchier C (2022) Spatially distributed calibration of a hydrological model with variational optimization constrained by physiographic maps for flash flood forecasting in France. Copernicus Meetings
Jodar-Abellan A, Valdes-Abellan J, Pla C, Gomariz-Castillo F (2019) Impact of land use changes on flash flood prediction using a sub-daily SWAT model in five Mediterranean ungauged watersheds (SE Spain). Sci Total Environ 657:1578–1591
Article CAS Google Scholar
Kabir S, Patidar S, Xia X, Liang Q, Neal J, Pender G (2020) A deep convolutional neural network model for rapid prediction of fluvial flood inundation. J Hydrol 590:125481
Article Google Scholar
Kadiyala A, Kumar A (2017) Applications of Python to evaluate environmental data science problems. Environ Prog Sustain Energy 36(6):1580–1586
Article CAS Google Scholar
Kim J, Kang H, Kang P (2023) Time-series anomaly detection with stacked transformer representations and 1D convolutional network. Eng Appl Artif Intell 120:105964
Article Google Scholar
Kingma D, Ba J (2015) Adam: A method for stochastic optimization in: Proceedings of the 3rd international conference for learning representations (iclr’15). San Diego, 500
Kiranyaz S, Ince T, Hamila R, Gabbouj M (2015) Convolutional neural networks for patient-specific ECG classification. Proc., 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), IEEE, 2608–2611
Kourgialas N, Karatzas G (2014) A hydro-sedimentary modeling system for flash flood propagation and hazard estimation under different agricultural practices. Nat Hazards Earth Syst Sci 14(3):625–634
Article Google Scholar
Kreibich H, Thaler T, Glade T, Molinari D (2019) Preface: damage of natural hazards: assessment and mitigation. Nat Hazards Earth Syst Sci 19(3):551–554
Article Google Scholar
Lang C, Steinborn F, Steffens O, Lang EW (2020) Applying a 1D-CNN network to electricity load forecasting. Proc., theory and applications of time series analysis: selected contributions from ITISE 2019 6, Springer, 205–218
Lin Q-H, Niu Y-W, Sui J, Zhao W-D, Zhuo C, Calhoun VD (2022) SSPNet: an interpretable 3D-CNN for classification of schizophrenia using phase maps of resting-state complex-valued fMRI data. Med Image Anal 79:102430
Article Google Scholar
Liu J, Engel BA, Wang Y, Wu Y, Zhang Z, Zhang M (2019) Runoff response to Soil moisture and micro-topographic structure on the plot scale. Sci Rep 9(1):2532
Article Google Scholar
Liu J, Wang J, Xiong J, Cheng W, Sun H, Yong Z, Wang N (2021) Hybrid models incorporating Bivariate statistics and Machine Learning methods for Flash Flood Susceptibility Assessment based on remote sensing datasets. Remote Sens 13(23):4945
Article Google Scholar
López V, Fernández A, García S, Palade V, Herrera F (2013) An insight into classification with imbalanced data: empirical results and current trends on using data intrinsic characteristics. Inf Sci 250:113–141
Article Google Scholar
López-Pérez M, García L, C Benítez, and, Molina R (2020) A contribution to Deep Learning approaches for Automatic classification of volcano-seismic events: deep gaussian processes. IEEE Trans Geosci Remote Sens
Lorenzo-Lacruz J, Amengual A, Garcia C, Morán-Tejeda E, Homar V, Maimó-Far A, Hermoso A, Ramis C, Romero R (2019) Hydro-meteorological reconstruction and geomorphological impact assessment of the October 2018 catastrophic flash flood at Sant Llorenç, Mallorca (Spain). Nat Hazards Earth Syst Sci 19(11):2597–2617
Article Google Scholar
Mahala A (2020) The significance of morphometric analysis to understand the hydrological and morphological characteristics in two different morpho-climatic settings. Appl Water Sci 10(1):1–16
Article Google Scholar
Mahmoud SH, Gan TY (2018) Urbanization and climate change implications in flood risk management: developing an efficient decision support system for flood susceptibility mapping. Sci Total Environ 636:152–167
Article CAS Google Scholar
Manh TL (2017) Assessment of Sustainable Development Index for Thanh Hoa Province during period from 2010–2014. VNU J Science: Earth Environ Sci, 33(1S)
Mansfield ER, Helms BP (1982) Detecting multicollinearity. Am Stat 36(3a):158–160
Article Google Scholar
Maqtan R, Othman F, Wan Jaafar WZ, Sherif M, El-Shafie A (2022) A scoping review of flash floods in Malaysia: current status and the way forward. Nat Hazards 114(3):2387–2416
Article Google Scholar
Marchi L, Borga M, Preciso E, Gaume E (2010) Characterisation of selected extreme flash floods in Europe and implications for flood risk management. J Hydrol 394(1–2):118–133
Article Google Scholar
Matsuda I (2004) River morphology and channel processes. Fresh Surf Water, 299–309
McHugh ML (2012) Interrater reliability: the kappa statistic. Biochem Med (Zagreb) 22(3):276–282
Article Google Scholar
Menard S (2002) Applied logistic regression analysis. Sage
Miao Q, Yang D, Yang H, Li Z (2016) Establishing a rainfall threshold for flash flood warnings in China’s mountainous areas based on a distributed hydrological model. J Hydrol 541:371–386
Article Google Scholar
Miles J (2014) Tolerance and variance inflation factor. Wiley statsref: statistics reference online
Mishra A, Mukherjee S, Merz B, Singh VP, Wright DB, Villarini G, Paul S, Kumar DN, Khedun CP, Niyogi D (2022) An overview of flood concepts, challenges, and future directions. J Hydrol Eng 27(6):03122001
Article Google Scholar
Montgomery DR, Buffington JM (1997) Channel-reach morphology in mountain drainage basins. Geol Soc Am Bull 109(5):596–611
Article Google Scholar
Munna GM, Alam MJB, Uddin MM, Islam N, Orthee AA, Hasan K (2021) Runoff prediction of Surma basin by curve number (CN) method using ARC-GIS and HEC-RAS. Environ Sustain Indic 11:100129
Google Scholar
Naef F, Scherrer S, Weiler M (2002) A process based assessment of the potential to reduce flood runoff by land use change. J Hydrol 267(1–2):74–79
Article Google Scholar
Ngo P-TT, Hoang N-D, Pradhan B, Nguyen QK, Tran XT, Nguyen QM, Nguyen VN, Samui P, Tien Bui D (2018) A novel hybrid swarm optimized multilayer neural network for spatial prediction of flash floods in tropical areas using sentinel-1 SAR imagery and geospatial data. Sensors 18(11):3704
Article Google Scholar
Ngo P-TT, Pham TD, Nhu V-H, Le TT, Tran DA, Phan DC, Hoa PV, Amaro-Mellado JL, Bui DT (2021) A novel hybrid quantum-PSO and credal decision tree ensemble for tropical cyclone induced flash flood susceptibility mapping with geospatial data. J Hydrol 596:125682
Article Google Scholar
Nguyen P, Thorstensen A, Sorooshian S, Hsu K, AghaKouchak A, Sanders B, Koren V, Cui Z, Smith M (2016) A high resolution coupled hydrologic–hydraulic model (HiResFlood-UCI) for flash flood modeling. J Hydrol 541:401–420
Article Google Scholar
Nguyen G, Dlugolinsky S, Bobák M, Tran V, López García Á, Heredia I, Malík P, Hluchý L (2019) Machine learning and deep learning frameworks and libraries for large-scale data mining: a survey. Artif Intell Rev 52:77–124
Article Google Scholar
Nguyen H-H, Nghia NH, Nguyen HTT, Le AT, Tran LTN, Duong LVK, Bohm S, Furniss MJ (2020a) Classification methods for mapping mangrove extents and drivers of change in Thanh Hoa province, Vietnam during 2005–2018. For Soc 4(1):225–242
Google Scholar
Nguyen V-N, Yariyan P, Amiri M, Dang Tran A, Pham TD, Do MP, Thi Ngo PT, Nhu V-H, Long NQ, Tien Bui D (2020b) A new modeling approach for spatial prediction of flash flood with biogeography optimized CHAID tree ensemble and remote sensing data. Remote Sens 12(9):1373
Article Google Scholar
Nguyen HTT, Hardy GE, Le TV, Nguyen HQ, Nguyen HH, Nguyen TV, Dell B (2021) Mangrove forest landcover changes in coastal Vietnam: a case study from 1973 to 2020 in Thanh Hoa and Nghe an provinces. Forests 12(5):637
Article Google Scholar
Nhu V-H, Thi Ngo P-T, Pham TD, Dou J, Song X, Hoang N-D, Tran DA, Cao DP, Aydilek IB, Amiri M (2020) A new hybrid firefly–PSO optimized random subspace tree intelligence for torrential rainfall-induced flash flood susceptible mapping. Remote Sens 12(17):2688
Article Google Scholar
Ozturk U, Wendi D, Crisologo I, Riemer A, Agarwal A, Vogel K, López-Tarazón JA, Korup O (2018) Rare flash floods and debris flows in southern Germany. Sci Total Environ 626:941–952
Article CAS Google Scholar
Panahi M, Jaafari A, Shirzadi A, Shahabi H, Rahmati O, Omidvar E, Lee S, Bui DT (2021) Deep learning neural networks for spatially explicit prediction of flash flood probability. Geosci Front 12(3):101076
Article Google Scholar
Paoletti M, Haut J, Plaza J, Plaza A (2019) Deep learning classifiers for hyperspectral imaging: a review. ISPRS J Photogrammetry Remote Sens 158:279–317
Article Google Scholar
Pham BT, Phong TV, Nguyen HD, Qi C, Al-Ansari N, Amini A, Ho LS, Tuyen TT, Yen HPH, Ly H-B (2020) A comparative study of kernel logistic regression, radial basis function classifier, multinomial naïve bayes, and logistic model tree for flash flood susceptibility mapping. Water 12(1):239
Article Google Scholar
Planche B, Andres E (2019) Hands-On Computer Vision with TensorFlow 2: leverage deep learning to create powerful image processing apps with TensorFlow 2.0 and Keras. Packt Publishing Ltd
Rasch MJ, Mackin C, Gallo ML, Chen A, Fasoli A, Odermatt F, Li N, Nandakumar S, Narayanan P, Tsai H (2023) Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators. Nat Commun 14(1):5282
Article CAS Google Scholar
Rawat KS, Singh SK (2018) Appraisal of soil conservation capacity using NDVI model-based C factor of RUSLE model for a semi arid ungauged watershed: a case study. Water Conserv Sci Eng 3:47–58
Article Google Scholar
Reilly JA, Piechota TC (2005) Actual storm events outperform synthetic design storms: a review of SCS curve number applicability. Impacts Global Clim Change, 1–13
Reutermann P (2020) Python3 Wrapper for the Weka Machine Learning Workbench. Available online: https://pypi.Org/project/python-weka-wrapper3/ (accessed on 16 August 2023)
Rosso R, Rulli MC (2002) An integrated simulation method for flash-flood risk assessment: 2. Effects of changes in land-use under a historical perspective. Hydrol Earth Syst Sci 6(2):285–294
Article Google Scholar
Saharia M, Kirstetter P-E, Vergara H, Gourley JJ, Hong Y, Giroud M (2017) Mapping flash flood severity in the United States. J Hydrometeorol 18(2):397–411
Article Google Scholar
Scorpio V, Crema S, Marra F, Righini M, Ciccarese G, Borga M, Cavalli M, Corsini A, Marchi L, Surian N, Comiti F (2018) Basin-scale analysis of the geomorphic effectiveness of flash floods: a study in the northern Apennines (Italy). Sci Total Environ 640–641:337–351
Article Google Scholar
Shahabi H, Shirzadi A, Ronoud S, Asadi S, Pham BT, Mansouripour F, Geertsema M, Clague JJ, Bui DT (2021) Flash flood susceptibility mapping using a novel deep learning model based on deep belief network, back propagation and genetic algorithm. Geosci Front 12(3):101100
Article Google Scholar
Tarpanelli A, Mondini AC, Camici S (2022) Effectiveness of Sentinel-1 and Sentinel-2 for flood detection assessment in Europe. Nat Hazards Earth Syst Sci 22(8):2473–2489
Article Google Scholar
Thuy NB (2019) The risk of typhoon and storm surge along the coast of Vietnam. Vietnam J Mar Sci Technol 19(3):327–336
Article Google Scholar
Tien Bui D, Hoang N-D (2017) A bayesian framework based on a gaussian mixture model and radial-basis-function Fisher discriminant analysis (BayGmmKda V1.1) for spatial prediction of floods. Geosci Model Dev 10(9):3391–3409
Article Google Scholar
Tramblay Y, Bouvier C, Martin C, Didon-Lescot J-F, Todorovik D, Domergue J-M (2010) Assessment of initial soil moisture conditions for event-based rainfall–runoff modelling. J Hydrol 387(3–4):176–187
Article Google Scholar
Trong NG, Quang PN, Cuong NV, Le HA, Nguyen HL, Tien Bui D (2023) Spatial prediction of Fluvial Flood in high-frequency Tropical Cyclone Area using TensorFlow 1D-Convolution neural networks and Geospatial Data. Remote Sens 15(22):5429
Article Google Scholar
Tsangaratos P, Ilia I, Chrysafi A-A, Matiatos I, Chen W, Hong H (2023) Applying a 1D convolutional neural network in Flood susceptibility assessments: the case of the island of Euboea. Greece Remote Sens 15(14):3471
Article Google Scholar
Tuan TA, Pha PD, Tam TT, Bui DT (2023) A new approach based on balancing Composite Motion optimization and deep neural networks for spatial prediction of landslides at tropical cyclone areas. IEEE Access
Ugli OEM, Lee K-H, Lee C-H (2023) Automatic Optimization of One-Dimensional CNN Architecture for Fault Diagnosis of a Hydraulic Piston Pump Using Genetic Algorithm. IEEE Access
Ullah K, Zhang J (2020) GIS-based flood hazard mapping using relative frequency ratio method: a case study of Panjkora River Basin, eastern hindu kush. Pakistan PLOS ONE, 15(3), e0229153
Ullah K, Wang Y, Fang Z, Wang L, Rahman M (2022) Multi-hazard susceptibility mapping based on convolutional neural networks. Geosci Front 13(5):101425
Article Google Scholar
van Erkel AR, Pattynama PMT (1998) Receiver operating characteristic (ROC) analysis: basic principles and applications in radiology. Eur J Radiol 27(2):88–94
Article Google Scholar
Vincendon B, Ducrocq V, Saulnier G-M, Bouilloud L, Chancibault K, Habets F, Noilhan J (2010) Benefit of coupling the ISBA land surface model with a TOPMODEL hydrological model version dedicated to Mediterranean flash-floods. J Hydrol 394(1–2):256–266
Article Google Scholar
Vozinaki A-EK, Karatzas GP, Sibetheros IA, Varouchakis EA (2015) An agricultural flash flood loss estimation methodology: the case study of the Koiliaris basin (Greece), February 2003 flood. Nat Hazards 79:899–920
Article Google Scholar
Wang H, Lei Z, Zhang X, Zhou B, and J Peng (2019) A review of deep learning for renewable energy forecasting. Energy Conv Manag 198:111799
Article Google Scholar
Wu J, Liu H, Wei G, Song T, Zhang C, Zhou H (2019) Flash Flood forecasting using support Vector Regression Model in a small mountainous catchment. Water 11(7):1327
Article Google Scholar
Xu H (2006) Modification of normalised difference water index (NDWI) to enhance open water features in remotely sensed imagery. Int J Remote Sens 27(14):3025–3033
Article Google Scholar
Yao J, Zhang X, Luo W, Liu C, Ren L (2022) Applications of Stacking/Blending ensemble learning approaches for evaluating flash flood susceptibility. Int J Appl Earth Obs Geoinf 112:102932
Google Scholar
Youssef AM, Pradhan B, Sefry SA (2016) Flash flood susceptibility assessment in Jeddah city (Kingdom of Saudi Arabia) using bivariate and multivariate statistical models. Environ Earth Sci 75(1):12
Article Google Scholar
Youssef AM, Pradhan B, Dikshit A, Mahdi AM (2022) Comparative study of convolutional neural network (CNN) and support vector machine (SVM) for flood susceptibility mapping: a case study at Ras Gharib, Red Sea. Egypt Geocarto Int 37(26):11088–11115
Article Google Scholar
Yuan F, Zhang Z, Fang Z (2023) An effective CNN and Transformer complementary network for medical image segmentation. Pattern Recogn 136:109228
Article Google Scholar
Zahura FT, Goodall JL, Sadler JM, Shen Y, Morsy MM, Behl M (2020) Training machine learning surrogate models from a high-fidelity physics‐based model: application for real‐time street‐scale flood prediction in an urban coastal community. Water Resour Res, 56(10), e2019WR027038.
Zeiler M (1999) Modeling our world: the ESRI guide to geodatabase design. ESRI, Inc.
Zema DA, Labate A, Martino D, Zimbone SM (2017) Comparing different infiltration methods of the HEC-HMS model: the case study of the Mésima Torrent (Southern Italy). Land Degrad Dev 28(1):294–308
Article Google Scholar
Zevenbergen LW, Thorne CR (1987) Quantitative analysis of land surface topography. Earth Surf Proc Land 12(1):47–56
Article Google Scholar
Zhai X, Guo L, Liu R, Zhang Y (2018) Rainfall threshold determination for flash flood warning in mountainous catchments with consideration of antecedent soil moisture and rainfall pattern. Nat Hazards 94:605–625
Article Google Scholar
Zhang Q, Zhang M, Chen T, Sun Z, Ma Y, Yu B (2019) Recent advances in convolutional neural network acceleration. Neurocomputing 323:37–51
Article Google Scholar
Zhang Y, Wang Y, Zhang Y, Luan Q, Liu H (2021) Multi-scenario flash flood hazard assessment based on rainfall–runoff modeling and flood inundation modeling: a case study. Nat Hazards 105:967–981
Article Google Scholar
Zhao G, Liu R, Yang M, Tu T, Ma M, Hong Y, Wang X (2022) Large-scale flash flood warning in China using deep learning. J Hydrol 604:127222
Article Google Scholar

Download references

Funding

Open access funding provided by University Of South-Eastern Norway. This research was funded by Vietnam Academy of Science and Technology (VASC), Vietnam. The grant number is VAST05.01/21–22.

Open access funding provided by University Of South-Eastern Norway

Author information

Authors and Affiliations

Ho Chi Minh City Institute of Resources Geography, Vietnam Academy of Science and Technology, Mac Dinh Chi 1, Ben Nghe, 1 District, Ho Chi Minh City, 700000, Vietnam
Pham Viet Hoa, Nguyen An Binh, Nguyen Ngoc An, Giang Thi Phuong Thao & Nguyen Cao Hanh
Institute of Marine Geology and Geophysics, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Cau Giay District, Hanoi, 10000, Vietnam
Pham Viet Hong
Department of Geoinformatics, Faculty of Information Technology, Hanoi University of Mining and Geology, Duc Thang, Bac Tu Liem, Hanoi, 10000, Vietnam
Phuong Thao Thi Ngo
GIS Group, Department of Business and IT, University of South-Eastern Norway, Gullbringvegen 36, Bø i Telemark, N-3800, Norway
Dieu Tien Bui

Authors

Pham Viet Hoa
View author publications
You can also search for this author in PubMed Google Scholar
Nguyen An Binh
View author publications
You can also search for this author in PubMed Google Scholar
Pham Viet Hong
View author publications
You can also search for this author in PubMed Google Scholar
Nguyen Ngoc An
View author publications
You can also search for this author in PubMed Google Scholar
Giang Thi Phuong Thao
View author publications
You can also search for this author in PubMed Google Scholar
Nguyen Cao Hanh
View author publications
You can also search for this author in PubMed Google Scholar
Phuong Thao Thi Ngo
View author publications
You can also search for this author in PubMed Google Scholar
Dieu Tien Bui
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization, PVH, NAB, PVH, NNA, GTPT, NCH, PTTN, DTB methodology, PVH, NAB, PVH, DTB; software: PVH, NAB, PVH, NNA, DTB; validation, PVH, NAB, PVH; formal analysis, PVH, NAB, PVH; writing—original draft preparation, PVH, NAB, PVH, NNA, GTPT, NCH, PTTN, DTB; writing—review and editing, PVH, NAB, PVH, NNA, GTPT, NCH, PTTN, DTB; supervision, PVH, NAB, PVH, DTB.

Corresponding author

Correspondence to Dieu Tien Bui.

Ethics declarations

Competing interests

The authors declare no competing interests.

Conflict of interest

The authors declare no conflict of interest.

Additional information

Communicated by H. Babaie.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hoa, P.V., Binh, N.A., Hong, P.V. et al. One-dimensional deep learning driven geospatial analysis for flash flood susceptibility mapping: a case study in North Central Vietnam. Earth Sci Inform (2024). https://doi.org/10.1007/s12145-024-01285-8

Download citation

Received: 13 January 2024
Accepted: 16 March 2024
Published: 06 July 2024
DOI: https://doi.org/10.1007/s12145-024-01285-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

One-dimensional deep learning driven geospatial analysis for flash flood susceptibility mapping: a case study in North Central Vietnam

Abstract

Introduction

Materials and methods

Study area

Data

Historical flash-flooded location

Flash flood influencing factors

Deep 1D-convolution neural network

Proposed methodology for spatial prediction of Flash Flood using deep 1D-CNN and multi-source geospatial data

Building flash flood database

Multicollinearity and ranking of flash flood influencing factors

Designing Deep 1D-CNN model

Optimizer and loss function

Performance assessment

Benchmark model comparison

Compiling the flash flood susceptibility map

Results and analysis

Multicollinearity and ranking result

Model fitting and validation

Comparative analysis and statistical evaluation

Flash-flood susceptibility map

Discussion

Concluding remarks

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Conflict of interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation