Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model

Anshuka, Anshuka; Chandra, Rohitash; Buzacott, Alexander J. V.; Sanderson, David; van Ogtrop, Floris F.

doi:10.1007/s00477-022-02204-3

Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model

Original Paper
Open access
Published: 31 March 2022

Volume 36, pages 3467–3485, (2022)
Cite this article

Download PDF

You have full access to this open access article

Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model

Download PDF

Anshuka Anshuka ORCID: orcid.org/0000-0002-2434-8701¹,
Rohitash Chandra²,
Alexander J. V. Buzacott³,
David Sanderson¹ &
…
Floris F. van Ogtrop³

3697 Accesses
12 Citations
Explore all metrics

Abstract

Hydrological extremes occupy a large spatial extent, with a temporal sequence, both of which can be influenced by a range of climatological and geographical phenomena. Understanding the key information in the spatial and temporal domain is essential to make accurate forecasts. The capabilities of deep learning methods can be applied in such instances due to their enhanced ability in learning complex relationships. Given its success in other domains, this study presents a framework that features a long short-term memory deep learning model for spatio temporal hydrological extreme forecasting in the South Pacific region. The data consists of satellite rainfall estimates and sea surface temperature (SST) anomalies. We use the satellite rainfall estimate to calculate the effective drought index (EDI), an indicator of hydrological extreme events. The framework is developed to forecast monthly EDI using three different approaches: (i) univariate (ii) multivariate with neighbouring spatial points (iii) multivariate with neighbouring spatial points and the eigenvector values of SST. Additionally, better identification of extreme wet events is noted with the inclusion of the eigenvector values of SST. By establishing the framework for the multivariate approach in two forms, it is evident that the model accuracy is contingent on understanding the dominant feature which influences precipitation regimes in the Pacific. The framework can be used to better understand linear and non-linear relationships within multi-dimensional data in other study regions, and provide long-term climate outlooks.

Improving flood forecasting using time-distributed CNN-LSTM model: a time-distributed spatiotemporal method

Article 10 June 2024

Going Beyond Traditional Methods: Using LSTM Networks to Predict Rainfall in Kerala

Analysis and Forecasting of Temporal Rainfall Variability Over Hundred Indian Cities Using Deep Learning Approaches

Article 20 April 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The number of hydro-meteorological extreme events such as floods and droughts are increasing in frequency and intensity globally (Thomas and López 2015; Grillakis 2019; Marsooli et al. 2019). The spatial distribution of these hazards is expected to increase throughout the South Pacific region (Shen et al. 2018). These hydrological hazards can be impacted by a range of climatic phenomena. For instance, the South Pacific Convergence Zone (SPCZ), a mass of convective cloud bands, is one of the leading climate drivers in the oceanic nations, in the South Pacific (SP) region (Vincent 1994). Minute changes in the sea surface temperature can displace the SPCZ, triggering anomalies in annual precipitation (Strauch et al. 2015). Fiji, one of the oceanic islands in the South-Western Pacific, is prone to hydro-meteorological hazards (Anshuka et al. 2020; Moishin et al. 2020; Varo et al. 2020). The movement of SPCZ northward or southward in Fiji results in extreme wet and dry conditions, respectively.

Risk management entails reducing harm to life, livelihoods, property and the surrounding environment (Zschau and Küppers 2013). A pivotal step to risk management necessitates establishing efficient early warning systems. Empirical studies in the Pacific have shown that informational and behavioural adaptive strategies such as responding to warnings are more practical in mitigating hydro-meteorological risks (Handmer and Iveson 2017; Anshuka et al. 2021; Pauli et al. 2021). Therefore, an early warning system providing climate outlooks on extremes is of utmost importance (Iese et al. 2021). An ideal hydrological early warning system would consist of a dense observation network that monitors key precipitation, temperature, and discharge parameters. However, insufficient gauge stations are often a deterrent to efficient hazard monitoring in Pacific nations. Despite the challenges, there are opportunities to devise an early warning system in these countries. Instead of investing in expensive observation networks, a cost-beneficial strategy would be to use data-driven methods such as machine learning models.

Among the many machine learning methods (Elman 1990), deep learning methods have shown innovation in the area of forecasting across a range of fields (Chandra et al. 2016; Lin et al. 2018; Shi et al. 2019). The method considers different neural network architectures and learning algorithms that are particularly suited for big data and multimedia problems (Xu and Mo 2020). Deep learning models can be loosely characterised into feedforward and recurrent neural network (RNN). Simple neural networks, also known as multilayer perceptron (Gardner and Dorling 1998) and convolutional neural networks (CNN) (Albawi et al. 2017), are distinguished examples of feedforward neural networks. RNN feature feedback (recurrent) connections for modelling temporal sequences and dynamical systems (Elman 1990; Medsker and Jain 1999). The long short term memory (LSTM) network model (Hochreiter and Schmidhuber 1997); is a prominent RNN used for temporal sequences (Shi and Kim 2017) and language modelling (Wilcox et al. 2018).

Deep learning methods have seen an application for hydrological extreme modelling (Anshuka et al. 2019; Poornima and Pushpalatha 2019; Kaur and Sood 2020). However, several studies have handled the problem from a point forecast perspective using a univariate approach (Poornima and Pushpalatha 2019; Liu et al. 2020; Moishin et al. 2021). Specifically for Fiji, previous studies established point-based hydrological index forecasting with time series models (Anshuka et al. 2018) and simple neural network (Anshuka et al. 2020). Point forecasts only consist of memory in a single variable and disregard the interactive effects of other variables or features (Reichstein et al. 2019). On the other hand, a spatio temporal forecast can look at the relationships across the entire spatial extent and make a more accurate forecast based on the information learnt from multiple features for a response variable. Spatio temporal predictions are useful in accounting for the local climate, topography, and orographic effects (Verma et al. 2021). The deep learning model architectures have been shown to understand complex spatial and temporal characteristics of hydrological data (Ren et al. 2019). As the climate becomes more erratic, there is a need to employ models that can better understand more extreme and complex relationships (Agana and Homaifar 2017). In Fiji, the effective drought index (EDI) is considered a good indicator of hydrological extremes (Anshuka et al. 2020); based on sensitivity to precipitation and vegetation. Therefore, the index can be used to investigate extreme wet events (Deo et al. 2019; Moishin et al. 2020) and extreme dry events (Malik et al. 2021).

Relatedly, ocean-atmospheric circulation, such as large-scale sea surface temperature (SST) gradients, can also contain meaningful information which affects precipitation patterns (Griffiths et al. 2003; McGree et al. 2014). Multivariate analysis, such as principal component analysis (PCA), can be used to yield information on multi-dimensional climate features (Barnett and Preisendorfer 1987; Salinger et al. 2001). Other advantages of PCA include the use of smaller memory, reduction in execution time and bandwidth requirement due to dimensionality reduction (Kaur and Sood 2020). While PCA is a powerful tool in extracting useful information from multi-dimensional datasets, it does not fully account for non-linear relationships in the data (Lever et al. 2017). However, PCA can be used in conjunction with deep learning models to better understand non-linear behaviour (Oja 1989; Mohsen et al. 2018; Sammen et al. 2021). Given their success in other domains, there is a potential in adapting these hybrid methods to aid in understanding and predicting of spatio temporal hydro-meteorological extremes.

This study presents a spatial–temporal hydro-meteorological extreme forecasting framework in the South Pacific region, using Fiji as a case study. The framework forecasts EDI with the LSTM model using three different approaches: (i) univariate (ii) multivariate with neighbouring spatial points (iii) multivariate with neighbouring spatial points together and the principal components of SST. The novelty of this study exists in demonstrating the effectiveness of advanced deep learning methods for spatio temporal hydrological extreme events. Notably, the result from the framework could be used to help in long term decision making and policy design.

2 Related works deep learning for hydrological extremes

Deep learning methods have shown innovation in hydrological extreme forecasting (Hu et al. 2018; Ham et al. 2019; Anshuka et al. 2019; Dikshit et al. 2021). The advanced learning algorithm and model architectures of deep learning models can provide automatic feature extraction (such as CNNs) and enhanced memory (such as LSTM) which facilities learning of complex relationships within large datasets (LeCun et al. 2015; Janiesch et al. 2021). Recently, Dikshit et al. (2021), used the LSTM model to forecast drought index and concluded that LSTM models show superiority compared to conventional machine learning models. Similarly, deep belief networks for streamflow prediction, showed high accuracy over the simple neural network; however, its performance compared to support vector machine was low (Agana and Homaifar 2017). Wu et al. (2021), coupled wavelet analysis with an autoregressive integrated moving average (ARIMA) based LSTM model, and reported better drought forecasts. In another study, low-flow hydrological time-series predictions were generated, and conclusions suggested better performance of the RNN-LSTM model rather than the RNN model on its own (Sahoo et al. 2019). Shen et al. (2019), used multi-source remote sensing data for drought monitoring using a combination of LSTM, support vector regression, and PCA to classify extreme events with high accuracy. Furthermore, the performance of deep learning models has also been compared to climate models. Climate models are sensitive and generally accurate to model primary phenomena or variables such as temperature and rainfall; however, climate models tend to be less sensitive for secondary and tertiary phenomena such as droughts and floods (Maity et al. 2021). This notion was used to develop a drought assessment model based on multiple hydro-meteorological precursors such as wind speed, air temperature, rainfall and geo-potential height using deep learning algorithms, which showed higher skills compared to climate models on its own for drought monitoring (Maity et al. 2021).

Fang et al. (2021), considered a local sequential LSTM model to predict flood susceptibility using a feature engineering method. This technique showed improved skill in capturing information on flood conditioning factors and increased ability to deal with sequential modelling capabilities within different spatial relationships. Hu et al. (2019), created a spatio temporal flood prediction model using LSTM and a reduced-order framework. The hybrid framework reduced the dimensionality of large spatial datasets and retained the important orthogonal features and singular value decomposition. Due to the reduced features, the model showed high computational efficiency while maintaining the accuracy for real-time flood forecasting (Hu et al. 2019). Gude et al. (2020) used deep learning models, which showed better performance for flood peak predictions than other statistical models (such as ARIMA) as well as physical models (hydrological and hydraulic), despite the latter’s ability in establishing understanding based on equations between the provided datasets. Therefore, in this paper, we exploit spatio temporal forecasting with a combination of linear and non-linear approaches, given the ability of deep learning models to understand complex relationships in data.

3 Materials and methods

3.1 Study region

The boundaries of Fiji’s islands span between 16°–9.5° S and 177–180.25° E, occupying an area of 18,270 square kilometres (Neall and Trewick 2008) in the South-West Pacific (SWP) region. Fiji has a tropical climate with a mean annual precipitation of 1676–3544 mm. The average annual precipitation is unequally distributed as most of it is concentrated in the summer season (wet season). Fiji is a low-lying island with mainly low elevation (Fig. 1); however, a mountain range on the main island results in orographic effect induced rainfall (Terry 2005). Consequently, the rainfall distribution among the island differs; as such, the windward of the island receives 13% more rainfall (Terry 2005). The most recent station-based temperature trend analysis indicated increases in mean and extreme temperature together with warming for countries located in the Western Pacific Ocean, including Fiji (McGree et al. 2019). The effects of global warming on SST influences the SPCZ, and induces El-Niño Southern Oscillation (ENSO) conditions (Lorrey et al. 2012), which is the leading climate driver affecting precipitation patterns in the SWP (McGree et al. 2014). The La Niña phase in Fiji is correlated with many flooding events (McAneney et al. 2017), while the El Niño is associated with dry spells.

3.2 Data

3.2.1 Precipitation data

Data-driven model development is contingent on reliable data records; however, lack of dense gauge networks and uneven distribution often makes the data records sparse, presenting challenges for modellers (Sun et al. 2018). In such circumstances, satellite data tends to be a better choice due to its equidistant spatial scale, reliability, and longer temporal scale. In this study, we use the NOAA Climate Prediction Centre Global Unified Gauge-Based Analysis of Daily Precipitation data (hereon referred to as CPC data). The dataset has a daily temporal resolution from January 1979 to the present, and the spatial resolution is 0.5°. The product has been validated using 30,000 rain gauge stations globally (Sun et al. 2018). The suitability of satellite rainfall estimates was previously undertaken for the Fiji region (Anshuka 2019; Anshuka et al. 2022) where the CPC satellite data showed the highest correlation (~ 0.8) with the on-the-ground data. Therefore, the CPC satellite data is one of the best available options as it also has the longest record (time frame). The data was extracted for the entire spatial region of Fiji from 1980 to 2020, resulting in 22 individual grid cells for analysis.

3.2.2 Sea surface temperature data

We retrieve the global analyses of monthly Kaplan Sea Surface temperature anomalies data (Kaplan et al. 1998), with a spatial resolution of 5.0° latitude by 5.0° longitude. The data comprises a large area, spanning from 100° E to 200° and 50° S to 10° N (Fig. 2), covering the Southwest Pacific region using the Arc GIS software. We use the monthly data from 1980 to 2020 to match the temporal scale and resolution of the satellite rainfall data.

3.3 Index calculation

In previous work, Anshuka et al. (2020), evaluated the suitability of different hydrological indices for Fiji. The study revealed EDI to be sensitive in the early detection of precipitation anomalies and is considered a suitable index for extreme hydrological events monitoring. The EDI, developed by Byun and Wilhite (1999), is calculated using daily precipitation which is based on the principle of harmonic sum, in such a way that water resources decline as time progresses from Day 1 to Day 365. While most of the hydrological indices operate on a monthly timescale, the EDI can operate on a daily scale, making it suitable also to investigate short-duration events such as floods. However, for this study, the daily EDI was aggregated to a monthly timescale for the entire area of Fiji. The monthly EDI values can be categorised based on a classification scale to give insight into the different intensities of hydrological extremes. In the results analysis and visualisation each EDI category has been assigned a ‘number’ value as a numerical category (Table 1). A Mann Kendall trend analysis was taken for the entire Fiji region on monthly EDI using the Python Mann–Kendall module (Hussain and Mahmud 2019). The Mann Kendall analysis is a rank-based non-parametric test applied to hydrological time series data to understand underlying trends (Wang et al. 2020). The trend analysis shows increasing EDI values for most months (April, May, June, July, August, September) indicating an increase in precipitation trend (Fig. 3); while for other months, an increase in EDI is noted initially which then reaches a plateau (January, February, March, October, November, December).

Table 1 Effective drought index classes from normal to extremely dry and wet conditions

Full size table

3.4 Framework for spatio temporal prediction

3.4.1 LSTM model

RNNs can be seen as deep feedforward neural networks with one-way flow of information during training. RNNs had major challenges for training sequences with long-term dependencies, in which the temporal information between the input and the output sequences extends for a long time span (Bengio et al. 1994). Consequently, this results in problems associated with the backpropagation of errors as the time span of the long-term dependencies increases. This leads to the problem of vanishing gradient in RNN training resulting in weight updates to become insignificant as the error is back-propagated (Hochreiter 1998). The LSTM model, in contrast to the canonical RNN, does not have such problems due to augmented memory components such as memory cells and gates, which retain lengthy sequences of information for longer durations (Mikolov et al. 2014; Bai et al. 2019). Hence, the LSTM serves to be more valuable than its other counterparts due to its ability to retain information over long time series and multiple-time steps. LSTM networks have recently been further improved with the emergence of bidirectional LSTM (Huang et al. 2015), encoder-decoder LSTM (Cho et al. 2014), attention LSTM (Wang et al. 2016) and transformer LSTM models (Sun et al. 2021). Although these derivatives are mainly suited for language modelling tasks, they can be used for time series prediction (Chandra et al. 2021). Hence, LSTM is the designated model for our forecasting framework, although other machine learning models or LSTM derivatives can also be implemented.

3.4.2 Data pre-processing

In this study, we use the 80:20 partition (train/test) for model development. While there is no fixed rule for the amount of data needed for model training and validation (Shirzadi et al. 2018; Umar Ibrahim et al. 2021); we use larger training set in order to ensure that we have specific extreme wet events and ENSO induced extreme dry events covered. The split also ensured that both the train and test sets reflected occurrence of significant extreme wet and dry events.

We process the data in a 3D tensor, the specific structure required for the LSTM model. This was based on Taken’s theorem, where the data is transformed by embedding or reconstruction (Noakes 1991). Therefore, in our case, we create the 3D tensor structure using a sliding window based on [input samples, window timesteps, features]. Hence, we implement this windowing technique in a way such that the prediction y*(t) is made using regressed Y_1, i.e., y(t−1), y(t−2), y(t−d) for a window size (WS), given by d which is user-defined (Fig. 4). Based on the WS = 3, the data series was split at every third step for each consecutive value in the entire train and test set, for which a y value was predicted.

3.4.3 Spatio temporal framework

In this study, the spatio temporal framework provides a multi-step ahead forecast, whereby the forecast is made for three-time steps (horizons). Forecasting refers to time-series prediction mapped from past data into future data at a longer time scale (Gaitán 2020). In our forecasting method, we consider the immediate history of the time series (window) to make a future prediction. For instance, monthly EDI data for the previous three months (January, February, and March), are used to predict an EDI value for the subsequent three months (April, May, and June). This also corresponds to the earlier selected WS = 3. Since the forecast is done iteratively using the windowing technique, the model is continually being updated as new data becomes available. The training was done to simulate this process, and this means that the model can learn and cope with any irregularities arising in the test data.

Below, the three different approaches to model development are described.

3.4.3.1 Univariate approach

For the univariate approach, the immediate history of the time series y*(t) is considered for three steps using y(t−1), y(t−2), y(t−d). We note that our framework considers multi-step ahead prediction i.e., y*(t + 2), y*(t + 1), y*(t). We represent y in the data with the EDI time series of a particular spatial location in (given by a grid) the region selected. Hence, using the univariate approach, our framework will provide three-step ahead predictions for the 7 × 8 grid (22 spatial locations) shown in Fig. 1. Note that each grid cell in Fig. 1 represents a different time-series data which is processed by our framework independently.

3.4.3.2 Multivariate approach

In the multivariate approach, we examine relationships between the different features given by the adjacent grid locations. Hence, this is the spatio temporal approach, where we take the spatial data into account. We implement a Pearson correlation feature selection approach (Pearson and Filon 1898) on the entire study region, using monthly EDI to obtain a correlation matrix. Based on this information, we select four features with the highest correlation for a given grid location. This ensures that the spatial patterns observed in the data are considered for model development.

Thereafter, the framework is similar to the univariate approach that concerns taking immediate past information (given by a window) and making a three-step ahead prediction. In the multivariate approach, the spatial data from the neighbouring grid cells are augmented to the univariate approach (except that the number of features used in this approach increases to four).

3.4.3.3 Multivariate approach with sea-surface temperature (SST-PCA)

For the multivariate approach with SST-PCA, our goal is to augment the previous multivariate approach with sea surface temperature data using PCA (SST-PCA). The first step entails PCA, which identifies new orthogonal feature vectors that extract the main information on the entire multi-dimensional dataset, giving us the principal component series. The PCA is implemented using R studio software (Wilks 2011). The first step requires SST data processing, by standardising and centring the data. Thereafter, we calculate the covariance matrix and perform eigenvalue decomposition. The loading patterns derived from the PCA indicate the extent to which the features correlate with each other, mainly in linear combinations. These can be used to spatially visualise the influence of large-scale processes such as ENSO. The eigenvalues and eigenvectors, which comprise the PCA series, represents the sea surface temperature anomalies.

In the second step, we undertake a correlation analysis to understand the relationship between the top ten selected principal components of SST and the entire gridded EDI data, similar to the plain multivariate approach. We select the PCA-SST series, which had the highest correlation with the gridded EDI. In this case, we augment the data in the earlier multivariate approach using the SST derived from the principal components.

3.4.4 Technical setup

The framework for LSTM model is implemented in Python using the Keras application from the TensorFlow module (Abadi et al. 2016) (Ketkar 2017). We fine-tuned the model's training parameters in trial experiments, such as identifying the optimal number of hidden layers, the number of hidden neurons in the respective layers, and the number of epochs (training iterations). The model architecture comprises one LSTM layer and 10 neurons in the hidden layer, and an output layer with three neurons representing the three predicting horizons.

We use the adaptive moment (Adam) optimiser (Kingma and Ba 2014), with a ‘relu’ activation function for dense LSTM layers. We utilise the root mean square error (RMSE) as the loss function to measure the model’s performance by quantifying the difference between the predicted and observed values. Overfitting the data causes errors in the testing stage due to the large starting weights of the network. To avoid this issue, we use a dropout regularisation of 0.2 in the model training phase. Additionally, we set the training to only 30 epochs to prevent overfitting. Each independent experiment ensures that the LSTM model begins with a different initial set of weights and biases (LSTM model parameters), allowing the model to deal with non-stationarity in the data. We summarise the model hyperparameters in Table 2.

Table 2 Summary of hyperparameters used for LSTM model training.

Full size table

3.5 Performance metrics

Both the training and the independent test sets are evaluated as follows:

Quantitative metrics We calculate the RMSE, MSE and R² for the three timesteps and compute the average for the 30 independent experimental runs. In addition, the overall RMSE in the train and test stages is also calculated for each grid cell.

Uncertainty quantification We calculate the higher and lower percentile on the mean predicted value (for 30 experimental runs) using the standard deviation for uncertainty quantification in the forecast.

Categorical forecast We evaluate the categorical forecast by obtaining a classification report using the sk learn package for the following scores: precision, recall, F-1 score and accuracy. The categorical forecast categories are cross tabulated in a confusion matrix for the approach with the best prediction. A confusion matrix table indicates the number of hits and misses for the predicted and the observed categories.

Spatio temporal maps We visualise spatio temporal maps for two time periods that correspond to the two seasons in Fiji. This shows the spatio temporal pattern comparison of the predicted and actual forecasts in the different seasons.

4 Results

4.1 PCA results

We applied PCA on the SST data and retained the top 10 principal components (PC) for further analysis. The percent of variation explained by the first 10 PCs is 99%. The loadings of PC 3 and PC 7 are shown in Fig. 5, highlights the dominance of SST anomalies in a given spatial location. We observe the strongest correlations for the third principal component loadings present North of Australia near the equator. The third PC loadings are also more apparent in the South Pacific region (~ 0.4), indicating short-term climate phenomena such as ENSO. The loadings of PC7 are more evident in the South-West Pacific region, just South-East of Fiji.

4.2 Feature correlations

In this section, we look at data visualisation and analysis using Pearson’s correlation feature matrix (Pearson and Filon 1898), as shown in Fig. 6 with reference to the ‘grid’ in Fig. 1, which denotes the features for the spatio temporal problem. We observe that the individual grid cell points are highly correlated with other spatial points. We select four other spatial points with high correlations with a corresponding grid cell for model development in the entire study area. For instance, Grid 1 is highly correlated with Grid 2, Grid 4, Grid 5, and Grid 9 with Pearson’s correlation value of ~ 0.97. The ten principal components also correlates with the spatial EDI grids. We observe that the principal components 3 and 7 have a reasonable correlation ~ 0.45 with the spatial EDI. Hence, PC 3 and PC 7 are used for model development in the multivariate approach using SST. The high correlation of PC3 and PC7 also corresponds with the PC loadings plot in Fig. 5 as these are spatially located and apparent in the South Pacific region close to Fiji.

4.3 LSTM model performance

For ease of understanding, the multivariate approach with neighbouring spatial points will be referred to as multivariate-standard. The multivariate approach with SST, will be referred to as multivariate-SST, hereafter.

In Table 3, the average RMSE, MSE, and R² values have been reported and summarised for the 30 independent experiments across 22 grid locations. The metrics are provided for the overall train and test sets where the LSTM model is used in all the approaches for three prediction horizons (three step-ahead predictions). The results show that the mean RMSE for the train set averaged across all the grid locations; i.e. 0.49 for multivariate SST, 0.54 for multivariate-standard, and 0.56 for univariate approach. Furthermore, the results also show the mean RMSE for the test set, averaged across the grid locations; i.e. 0.584 for the standard multivariate approach, 0.60 for the multivariate-SST, and 0.588 for the univariate approach. It can be noted that although the error value is low for multivariate-SST in the training stage, the error increases in the testing stage. This is also applicable to the mean square error values, which were 0.25 for the multivariate-SST approach during the training stage and then increased to 0.37 during the testing stage. The highest R² value recorded in the test set for the first-time horizon is 0.66 for the multivariate-standard. It is also evident that the R² value is the highest for the first-time horizon prediction and then decreases for the third time horizon for all three approaches. Overall, the multivariate-standard has the lowest error and highest R² values in the test stage.

Table 3 Performance metrics: RMSE, MSE and R² values, together with the standard deviation (STD) reported for the training set and testing set for the three different approaches (UV, MV, MV-SST) used for model development

Full size table

4.4 Prediction visualisation

We next visualise the forecast from trained LSTM models for the period covered by the test dataset. The predictions are only shown for the multivariate-standard and the first-time horizon as it rendered the best results (Fig. 7). The plot is shown only for 6 spatial locations in Fiji. We plot the observed and the predicted (mean). In addition, the lower and the upper percentile has been plotted using standard deviation within 95% confidence interval to show uncertainty quantification. From the plots, we observe that the predictions closely follow the observed patterns in the actual test set. There are instances of slight under-prediction and over-prediction for some of the grid-cell. The model performs well to predict the extremes, with the exception of a few where the forecast is within the 0 range. It is also worth pointing out that the uncertainty fluctuation between the lower and the upper percentile is low.

4.5 Verification of categorical forecast

For an operational early warning system, the EDI time series needs to be converted to a form that the public can interpret and use. Therefore, we convert the EDI series into the respective categories and quantify the scores for the predicted and the observed EDI categories (Table 4). The classification report is only shown for the multivariate-standard as it renders the best results. The average precision, recall, F1-score, and accuracy scores are 0.73, 0.75, 0.73, and 0.75, respectively. The report also shows that selected grid locations have an accuracy of approximately 85%, while the lowest accuracy is around 65%.

Table 4 Classification report for predicted and observed EDI categories for the multivariate-standard approach

Full size table

The confusion matrix plot has only been shown for the multivariate-standard which rendered better results (Fig. 8). In the confusion matrix, we observe that the most prevalent categories are Normal, Moderately Wet, Very Wet, Extremely Wet. An extremely wet event may coincide with a flooding episode in many cases. There are a few instances of under-prediction, Grid 16 and Grid 17, respectively, whereby category 6 (Extremely Wet) has been predicted as Very Wet. The misclassification of category 5 (Moderately Wet) is higher, with a few grid locations indicated as ‘Very Wet’. However, it is essential to note that all the events have been correctly predicted on a diverging range, where the midpoint is classified as the ‘normal’ category which then branches out to wet and dry events.

4.6 Spatial plots

The spatial plots have been generated for two time periods reflecting the wet and the dry season in Fiji to evaluate model performance in each season. The spatial plot contains the observed, predicted, and the residuals for the three different model development approaches (Fig. 9). The colour ramp is indicative of the EDI values. The spatial plot corresponds to Fiji’s peak dry season months. We observe that the multivariate-standard has a better prediction than the multivariate-SST and the univariate approach. The other observation is that even though it is peak dry season, most EDI values range between -0.5 and 0.75, which is still considered to be the ‘normal’ category.

Figure 10 shows a spatial plot for the second time period, which corresponds to the peak wet season in Fiji. Most of the actual EDI values for the peak wet season range between 0 and 2.5. These values are depictive of extreme wet conditions. Ideally, we aim to predict extreme events; therefore, we chose a window slice for visualisation which shows the extreme events. From the spatial plot, it appears that the multivariate-SST is better able to predict extreme wet events. The univariate approach does not show good performance in predicting extreme wet events. The residual plot shows the difference between the actual and the predicted values, where the lightest residual colour indicates low error values. All three approaches give predictions with high accuracy for the ‘normal’ category.

5 Discussion

According to our study, the performance metrics showed that the lowest MSE (~ 0.2) is achieved for the plain multivariate approach. While the RMSE and MSE values may vary for different studies due to the usage of different datasets; however, it can provide a baseline threshold for result comparison. The error values achieved in our study is consistent with other studies; for instance, a study used multi-source remote sensing data to develop a deep learning-based drought model and reported RMSE values from 0.295 to 0.357, coefficient of correlation values between 0.772 and 0.910, and an accuracy of 80% (Shen et al. 2019). Similarly, another study explored the internet of things (IoT) and PCA in real-time to classify extreme dry events, achieving an accuracy of 95% (Kaur and Sood 2020). This agrees with our study, whereby the accuracy ranged from 69 to 90%.

Furthermore, the results can be influenced in the data processing and reconstruction phase. For instance, we used a fixed sliding window of size three where the respective LSTM model used information from three data points in the past to generate forecasts for three-prediction horizons in future as the model iterated through the entire training and test set. Zan et al. (2020) and Baig et al. (2020) tested two approaches for identifying the optimum sliding window, that is using a fixed or an adaptive window size, and concluded that the latter produces better forecast. Similarly, Chandra et al. (2018), presented a neural network method based on multi-task learning that dynamically selected the window size and applied it for cyclone wind-intensity forecasting with the motivation to make a prediction given limited information. The window size determines the amount of data available from which the model learns and uses for forecasting. Although not within the scope of our study, this warrants further investigation to understand the effect of the sliding window size on model performance. This can also provide critical information on climate features influencing the precipitation patterns; for instance, a 20-month window will show effects of features such as ENSO, while a 10-year window will show effects of elements that occur at decadal scales.

The results show that a better prediction of extreme wet events is obtained when using the principal components of SST data. Using linear combinations of patterns in the data can be successfully paired with deep learning models to generate better forecasts. The principal components may also indicate certain climate phenomena like ENSO or Madden Julian oscillation. However, it is essential to consider that these climatic phenomena have a certain irregularity attached and are not prevalent every year or every season. In such cases, other phenomena may influence or trigger the precipitation activity in the Pacific (Landman and Mason 1999). For instance, precipitation can also be affected by other topographical features such as orography (Ntale et al. 2003; Lal et al. 2008). Our study showed that the multivariate-SST model performed better during the summer months. This is because sea surface temperature induced SPCZ movement is the dominant climate feature during the summer season. In contrast, the multivariate-standard approach showed better performance in the drier months. This may be because the orographic effects are more predominant in drier months than summer, and can generate localised weather events instead of ocean-atmospheric influenced weather events prevalent in winter. A plausible reason for this is that in the dry season the SST fluctuations results in reduced SPCZ activity, which results in the Westward movement of SPCZ near the equatorial region (Griffiths et al. 2003; McGree et al. 2014).

Additionally, we note from this study that the extreme dry events were not well captured based on the effective drought index classification scale. It is essential to note the importance of class classification, which forms an integral component in long term early warning systems. Empirical studies on Fiji and the Pacific have shown more severe dry periods in Fiji (McGree et al. 2019; Anshuka et al. 2021); however, this is not captured well by the EDI, where the lower range which falls within 0 to − 1 still indicates a ‘normal’ category. With the current classification, some crucial aspects of drought, such as duration, intensity, and initiation, may be overlooked. To ensure that the drought conditions are correct, we recommend adjusting the scale by a value of 0.5. At the same time, it is essential to note that values larger than 2 will correspond to flooding events.

A number of studies have shown that as the forecast horizon increases, so does the corresponding error (Aghelpour et al. 2019; Ashrafzadeh et al. 2020). The extreme events are rare occurrences, which can also results in a class imbalance problem in the case of classification problem using deep learning methods (Johnson and Khoshgoftaar 2019). This becomes challenging for the model to learn and predict extreme events. The challenge is even more significant with the unpredictable climate, which introduces more noise in the data making it harder for the model to learn patterns of these low-frequency extreme events. It has been noted that “the only practical indication of the likely margin of future error is provided by the past forecast errors” (Knüppel 2018). Although not within the scope of this study, future studies can compute uncertainty analysis with more advanced methods such as Monte Carlo simulation and Bayesian inference (Chandra et al. 2019), particularly for longer-term time horizons. Additionally, uncertainty quantification for a predicted outcome can allow authorities to make more informed decision (Wang et al. 2019).

6 Conclusion

In this study, we presented a spatiotemporal deep learning-based framework for hydrological extreme forecasting. We found that the hybrid approach in the framework that employs a multivariate approach with neighbouring spatial points and principal components of the SST, generates better forecasts when compared to their counterparts. Notably, the framework incorporates deep learning-based approach to understand meaningful information in different spatial and temporal domains. By establishing the framework for the multivariate approach in two forms, it is evident that the model accuracy is contingent on understanding the dominant feature which influences precipitation regimes. In extreme dry events, the plain multivariate approach has good performance that features neighbouring spatial points. However, the multivariate approach with SST is a better predictor for extreme wet events. As part of future studies, other variables such as sea level pressure, relative humidity and geopotential height can be used as predictors for hydrological extremes. The current system can be coupled with the local forecasting system and be fully automated to provide climate outlooks. Finally, the framework can be used with other LSTM network and machine learning based models and applied to other regions.

Data availability

https://github.com/sydney-machine-learning/climateextremes_deeplearning

References

Abadi M et al (2016) Tensorflow: A system for large-scale machine learning. In: 12th {USENIX} symposium on operating systems design and implementation ({OSDI} 16), pp 265–283
Agana NA, Homaifar A (2017) A deep learning based approach for long-term drought prediction. In: SoutheastCon 2017. IEEE, pp 1–8
Aghelpour P, Mohammadi B, Biazar SM (2019) Long-term monthly average temperature forecasting in some climate types of Iran, using the models SARIMA. SVR, and SVR-FA Theor Appl Climatol 138:1471–1480. https://doi.org/10.1007/s00704-019-02905-w
Article Google Scholar
Albawi S, Mohammed TA, Al-Zawi S (2017) Understanding of a convolutional neural network. In: 2017 International Conference on Engineering and Technology (ICET). IEEE, pp 1–6
Anshuka (2019) Forecasting drought indices using machine learning and spatial modelling approaches: an application in Fiji University of Sydney eScholarship. http://hdl.handle.net/2123/20582
Anshuka A, Alexander JVB, Floris van O (2022) Application of Multivariate Techniques for Spatial Drought Modelling using Satellite Rainfall Estimate in Fiji Research Square. https://doi.org/10.21203/rs.3.rs-1255720/v1
Anshuka A, Buzacott AJV, Vervoort RW, van Ogtrop FF (2020) Developing drought index–based forecasts for tropical climates using wavelet neural network: an application in Fiji Theoretical and Applied Climatology 143:557–569 doi:https://doi.org/10.1007/s00704-020-03446-3
Anshuka A, Ogtrop F, Vervoort W (2018) Drought modelling in small island developing states: a case study in Fiji. EGU General Assembly: https://meetingorganizer.copernicus.org/EGU2018/EGU2018-3251.pdf
Anshuka A, van Ogtrop FF, Sanderson D, Thomas E, Neef A (2021) Vulnerabilities shape risk perception and influence adaptive strategies to hydro-meteorological hazards: a case study of Indo-Fijian farming communities. Int J Disaster Risk Reduct 62:102401. https://doi.org/10.1016/j.ijdrr.2021.102401
Article Google Scholar
Anshuka A, van Ogtrop FF, Willem Vervoort R (2019) Drought forecasting through statistical models using standardised precipitation index: a systematic review and meta-regression analysis. Nat Hazards 97:955–977. https://doi.org/10.1007/s11069-019-03665-6
Article Google Scholar
Ashrafzadeh A, Kişi O, Aghelpour P, Biazar SM, Masouleh MA (2020) Comparative study of time series models, support vector machines, and GMDH in forecasting long-term evapotranspiration rates in northern Iran. J Irrigat Drainage Eng 146:04020010
Google Scholar
Bai Y, Zeng B, Li C, Zhang J (2019) An ensemble long short-term memory neural network for hourly PM2.5 concentration forecasting. Chemosphere 222:286. https://doi.org/10.1016/j.chemosphere.2019.01.121
CAS Google Scholar
Baig S-U-R, Iqbal W, Berral JL, Carrera D (2020) Adaptive sliding windows for improved estimation of data center resource utilization. Future Generat Comput Syst 104:212–224
Article Google Scholar
Barnett T, Preisendorfer R (1987) Origins and levels of monthly and seasonal forecast skill for United States surface air temperatures determined by canonical correlation analysis. Mon Weather Rev 115:1825–1850
Google Scholar
Bengio Y, Simard P, Frasconi P (1994) Learning long-term dependencies with gradient descent is difficult. IEEE Trans Neural Netw 5:157–166. https://doi.org/10.1109/72.279181
Article CAS Google Scholar
Byun H-R, Wilhite DA (1999) Objective quantification of drought severity and duration. J Clim 12:2747–2756
Google Scholar
Chandra R, Deo R, Omlin CW (2016) An architecture for encoding two-dimensional cyclone track prediction problem in coevolutionary recurrent neural networks. In: 2016 International Joint Conference on Neural Networks (IJCNN), 24–29, pp 4865–4872. https://doi.org/10.1109/IJCNN.2016.7727839
Chandra R, Goyal S, Gupta R (2021) Evaluation of deep learning models for multi-step ahead time series prediction IEEE. Access 9:83105–83123. https://doi.org/10.1109/ACCESS.2021.3085085
Article Google Scholar
Chandra R, Jain K, Deo RV, Cripps S (2019) Langevin-gradient parallel tempering for Bayesian neural learning. Neurocomputing 359:315–326
Google Scholar
Chandra R, Ong Y-S, Goh C-K (2018) Co-evolutionary multi-task learning for dynamic time series prediction. Appl Soft Comput 70:576–589
Google Scholar
Cho K, Van Merriënboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y (2014) Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:14061078
Deo RC, Adamowski JF, Begum K, Salcedo-Sanz S, Kim D-W, Dayal KS, Byun H-R (2019) Quantifying flood events in Bangladesh with a daily-step flood monitoring index based on the concept of daily effective precipitation. Theoret Appl Climatol 137:1201–1215. https://doi.org/10.1007/s00704-018-2657-4
Article Google Scholar
Dikshit A, Pradhan B, Huete A (2021) An improved SPEI drought forecasting approach using the long short-term memory neural network. J Environ Manag 283:111979
Google Scholar
Elman JL (1990) Finding structure in time. Cognit Sci 14:179–211. https://doi.org/10.1207/s15516709cog1402_1
Google Scholar
Fang Z, Wang Y, Peng L, Hong H (2021) Predicting flood susceptibility using LSTM neural networks. J Hydrol 594:125734. https://doi.org/10.1016/j.jhydrol.2020.125734
Article Google Scholar
Gaitán CF (2020) Machine learning applications for agricultural impacts under extreme events. In: Climate extremes and their implications for impact and risk assessment. Elsevier, pp 119–138
Gardner MW, Dorling S (1998) Artificial neural networks (the multilayer perceptron)—a review of applications in the atmospheric sciences. Atmos Environ 32:2627–2636. https://doi.org/10.1016/S1352-2310(97)00447-0
CAS Google Scholar
Griffiths G, Salinger M, Leleu I (2003) Trends in extreme daily rainfall across the south pacific and relationship to the south pacific convergence zone. Int J Climatol 23:847–869. https://doi.org/10.1002/joc.923
Google Scholar
Grillakis MG (2019) Increase in severe and extreme soil moisture droughts for Europe under climate change. Sci Total Environ 660:1245–1255. https://doi.org/10.1016/j.scitotenv.2019.01.001
CAS Google Scholar
Gude V, Corns S, Long S (2020) Flood prediction and uncertainty estimation using deep learning. Water 12:884
Google Scholar
Ham Y-G, Kim J-H, Luo J-J (2019) Deep learning for multi-year ENSO forecasts. Nature 573:568–572. https://doi.org/10.1038/s41586-019-1559-7
CAS Google Scholar
Handmer J, Iveson H (2017) Cyclone Pam in Vanuatu: learning from the low death toll. Australian J Emerg Manag 32(2):60–65. https://doi.org/10.3316/ielapa.816029420012804
Google Scholar
Hochreiter S (1998) The vanishing gradient problem during learning recurrent neural nets and problem solutions. Int J Uncertain Fuzziness Knowledge-Based Syst 6:107–116. https://doi.org/10.1142/s0218488598000094
Google Scholar
Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9:1735–1780
CAS Google Scholar
Hu C, Wu Q, Li H, Jian S, Li N, Lou Z (2018) Deep learning with a long short-term memory networks approach for rainfall-runoff simulation. Water 10:1543
Google Scholar
Hu R, Fang F, Pain CC, Navon IM (2019) Rapid spatio-temporal flood prediction and uncertainty quantification using a deep learning method. J Hydrol 575:911–920. https://doi.org/10.1016/j.jhydrol.2019.05.087
Article Google Scholar
Huang Z, Xu W, Yu K (2015) Bidirectional LSTM-CRF models for sequence tagging. arXiv preprint arXiv:150801991
Hussain MM, Mahmud I (2019) pyMannKendall: a python package for non parametric Mann Kendall family of trend tests. J Open Sour Softw 4:1556
Google Scholar
Iese V et al (2021) Historical and future drought impacts in the Pacific islands and atolls. Clim Change 166:19. https://doi.org/10.1007/s10584-021-03112-1
Article Google Scholar
Janiesch C, Zschech P, Heinrich K (2021) Machine learning and deep learning. Electron Markets. https://doi.org/10.1007/s12525-021-00475-2
Article Google Scholar
Johnson JM, Khoshgoftaar TM (2019) Survey on deep learning with class imbalance. J Big Data 6:27. https://doi.org/10.1186/s40537-019-0192-5
Google Scholar
Kaplan A, Cane MA, Kushnir Y, Clement AC, Blumenthal MB, Rajagopalan B (1998) Analyses of global sea surface temperature 1856–1991. J Geophys Res: Oceans 103:18567–18589. https://doi.org/10.1029/97JC01736
Google Scholar
Kaur A, Sood SK (2020) Deep learning based drought assessment and prediction framework. Ecol Inform 57:101067. https://doi.org/10.1016/j.ecoinf.2020.101067
Google Scholar
Keijzer M, Babovic V (2000) Genetic programming, ensemble methods and the bias/variance tradeoff–introductory investigations. European Conference on Genetic Programming. Springer, pp 76–90
Google Scholar
Ketkar N (2017) Introduction to keras. In: Deep learning with Python. Springer, pp 97–111
Kingma DP, Ba J (2014) Adam: A method for stochastic optimization. arXiv preprint arXiv:14126980
Knüppel M (2018) Forecast-error-based estimation of forecast uncertainty when the horizon is increased. Int J Forecast 34:105–116. https://doi.org/10.1016/j.ijforecast.2017.08.006
Article Google Scholar
Lal M, McGregor JL, Nguyen KC (2008) Very high-resolution climate simulation over Fiji using a global variable-resolution model. Clim Dyn 30:293–305. https://doi.org/10.1007/s00382-007-0287-0
Google Scholar
Landman WA, Mason SJ (1999) Operational long-lead prediction of South African rainfall using canonical correlation analysis. Int J Climatol 19:1073–1090. https://doi.org/10.1002/(SICI)1097-0088(199908)19:10
Google Scholar
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521:436–444. https://doi.org/10.1038/nature14539
CAS Google Scholar
Lever J, Krzywinski M, Altman N (2017) Points of significance: principal component analysis. Nature Methods 14:641–643
CAS Google Scholar
Lin Q, Yang W, Zheng C, Lu K, Zheng Z, Wang J, Zhu J (2018) Deep-learning based approach for forecast of water quality in intensive shrimp ponds. Indian J Fish 65:75–80
Google Scholar
Liu D, Jiang W, Mu L, Wang S (2020) Streamflow prediction using deep learning neural network: case study of yangtze river. IEEE Access 8:90069–90086. https://doi.org/10.1109/ACCESS.2020.2993874
Article Google Scholar
Lorrey A, Dalu G, Renwick J, Diamond H, Gaetani M (2012) Reconstructing the South Pacific Convergence Zone position during the presatellite era: a La Niña case study. Mon Weather Rev 140:3653–3668
Google Scholar
Maity R, Khan MI, Sarkar S, Dutta R, Maity SS, Pal M, Chanda K (2021) Potential of deep learning in drought assessment by extracting information from hydrometeorological precursors. J Water Clim Change. https://doi.org/10.2166/wcc.2021.062
Article Google Scholar
Malik A, Kumar A, Kisi O, Khan N, Salih SQ, Yaseen ZM (2021) Analysis of dry and wet climate characteristics at Uttarakhand (India) using effective drought index. Nat Hazards 105:1643–1662
Google Scholar
Marsooli R, Lin N, Emanuel K, Feng K (2019) Climate change exacerbates hurricane flood hazards along US atlantic and gulf coasts in spatially varying patterns. Nat Commun 10:1–9
CAS Google Scholar
McAneney J, van den Honert R, Yeo SJIJoC (2017) Stationarity of major flood frequencies and heights on the Ba River, Fiji, over a 122‐year record 37:171–178
McGree S et al (2019) Recent changes in mean and extreme temperature and precipitation in the Western Pacific Islands 32:4919-4941
McGree S et al (2014) An updated assessment of trends and variability in total and extreme rainfall in the western Pacific. Int J Climatol 34:2775–2791
Google Scholar
Medsker L, Jain LC (1999) Recurrent neural networks: design and applications. CRC Press, Inc.
Google Scholar
Mikolov T, Joulin A, Chopra S, Mathieu M, Ranzato MA (2014) Learning longer memory in recurrent neural networks. arXiv preprint arXiv:14127753
Mohsen H, El-Dahshan E-SA, El-Horbaty E-SM, Salem A-BM (2018) Classification using deep learning neural networks for brain tumors. Future Comput Informat J 3:68–71
Google Scholar
Moishin M, Deo RC, Prasad R, Raj N, Abdulla S (2020) Development of flood monitoring index for daily flood risk evaluation: case studies in Fiji Stochastic. Environ Res Risk Assess 1–16
Moishin M, Deo RC, Prasad R, Raj N, Abdulla S (2021) Designing deep-based learning flood forecast model with ConvLSTM hybrid algorithm. IEEE Access 9:50982–50993. https://doi.org/10.1109/ACCESS.2021.3065939
Article Google Scholar
Neal B, Mittal S, Baratin A, Tantia V, Scicluna M, Lacoste-Julien S, Mitliagkas I (2018) A modern take on the bias-variance tradeoff in neural networks arXiv preprint arXiv:181008591
Neall VE, Trewick SA (2008) The age and origin of the Pacific islands: a geological overview. Philos Trans Royal Soc b: Biol Sci 363:3293–3308
Google Scholar
Noakes L (1991) The Takens embedding theorem. Int J Bifurcat Chaos 1:867–872
Google Scholar
Ntale HK, Gan TY, Mwale D (2003) Prediction of East African seasonal rainfall using simplex canonical correlation analysis. J Clim 16:2105–2112
Google Scholar
Oja E (1989) Neural networks, principal components, and subspaces. Int J Neural Syst 1:61–68
Google Scholar
Pauli N et al (2021) “Listening to the sounds of the water”: bringing together local knowledge and biophysical data to understand climate-related hazard dynamics. Int J Disaster Risk Sci 12:326–340. https://doi.org/10.1007/s13753-021-00336-8
Article Google Scholar
Pearson K, Filon LNG (1898) Mathematical contributions to the theory of evolution. IV. On the probable errors of frequency constants and on the influence of random selection on variation and correlation. Proc Royal Soc London 62:173–176. https://doi.org/10.1098/rspl.1897.0091
Article Google Scholar
Poornima S, Pushpalatha M (2019) Drought prediction based on SPI and SPEI with varying timescales using LSTM recurrent neural network. Soft Comput 23:8399–8412. https://doi.org/10.1007/s00500-019-04120-1
Article Google Scholar
Reichstein M, Camps-Valls G, Stevens B, Jung M, Denzler J, Carvalhais N, Prabhat (2019) Deep learning and process understanding for data-driven earth system. Sci Nat 566:195–204. https://doi.org/10.1038/s41586-019-0912-1
Article CAS Google Scholar
Ren H, Cromwell E, Kravitz B, Chen X (2019) Using deep learning to fill spatio-temporal data gaps in hydrological monitoring networks. Hydrol Earth Syst Sci Discussions 1–20
Roshan S, Srivathsan G, Deepak K, Chandrakala S (2020) Violence detection in automated video surveillance: recent trends and comparative studies the cognitive approach in cloud computing and internet of things technologies for surveillance tracking systems:157–171
Sahoo BB, Jha R, Singh A, Kumar D (2019) Long short-term memory (LSTM) recurrent neural network for low-flow hydrological time series forecasting. Acta Geophys 67:1471–1481. https://doi.org/10.1007/s11600-019-00330-1
Article Google Scholar
Salinger M, Renwick J, Mullan A (2001) Interdecadal Pacific oscillation and south Pacific climate. Int J Climatol 21:1705–1721
Google Scholar
Sammen SS, Ehteram M, Abba S, Abdulkadir R, Ahmed AN, El-Shafie A (2021) A new soft computing model for daily streamflow forecasting Stochastic. Environ Res Risk Assess 1–13
Shen R, Huang A, Li B, Guo J (2019) Construction of a drought monitoring model using deep learning based on multi-source remote sensing data. Int J Appl Earth Obs Geoinf 79:48–57
Google Scholar
Shen S et al (2018) Spatial distribution patterns of global natural disasters based on biclustering. Nat Hazards 92:1809–1820. https://doi.org/10.1007/s11069-018-3279-y
Article Google Scholar
Shi X, Huang S, Huang Q, Lei X, Li J, Li P, Yang M (2019) Deep-learning-based wind speed forecasting considering spatial–temporal correlations with adjacent wind turbines. J Coast Res 93:623–632
Google Scholar
Shi Z, Kim T-K (2017) Learning and refining of privileged information-based RNNs for action recognition from depth sequences. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 3461–3470
Shirzadi A et al (2018) Novel GIS based machine learning algorithms for shallow landslide susceptibility mapping. Sensors 18(11):3777. https://doi.org/10.3390/s18113777
Article Google Scholar
Strauch AM, MacKenzie RA, Giardina CP, Bruland GL (2015) Climate driven changes to rainfall and streamflow patterns in a model tropical island hydrological system. J Hydrol 523:160–169
Google Scholar
Sun G, Zhang C, Woodland PC (2021) Transformer language models with lstm-based cross-utterance information representation. In: ICASSP 2021–2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, pp 7363–7367
Sun Q, Miao C, Duan Q, Ashouri H, Sorooshian S, Hsu KL (2018) A review of global precipitation data sets: data sources, estimation, and intercomparisons. Rev Geophys 56:79–107
Google Scholar
Terry J (2005) Hazard warning! Hydrological responses in the Fiji Islands to climate variability and severe meteorological events. In: Proceedings of 7th IAHS scientific assembly. IAHS Publication, vol 296, pp 33–40
Thomas V, López RJADBEWPS (2015) Global increase in climate-related disasters. ADB economics working paper series 466. https://www.adb.org/sites/default/files/publication/176899/ewp-466.pdf
Umar Ibrahim A, Ozsoz M, Serte S, Al-Turjman F, Habeeb Kolapo S (2021) Convolutional neural network for diagnosis of viral pneumonia and COVID-19 alike diseases. Expert Systems. https://doi.org/10.1111/exsy.12705
Article Google Scholar
Varo J, Sekac T, Jana S (2020) Flood Hazard Micro Zonation from a Geomatic Perspective on Vitilevu Island, Fiji. Int J Geoinform 16
Verma S, Bhatla R, Ghosh S, Sinha P, Kumar Mall R, Pant M (2021) Spatio‐temporal variability of summer monsoon surface air temperature over India and its regions using Regional Climate Model. Int J Climatol
Vincent DG (1994) The South Pacific convergence zone (SPCZ): a review. Monthly Weather Rev 122:1949–1970
Google Scholar
Wang B, Lu J, Yan Z, Luo H, Li T, Zheng Y, Zhang G (2019) Deep uncertainty quantification: a machine learning approach for weather forecasting. In: Paper presented at the proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining, Anchorage, AK, USA
Wang F et al (2020) Re-evaluation of the power of the mann-kendall test for detecting monotonic trends in hydrometeorological time series. Front Earth Sci. https://doi.org/10.3389/feart.2020.00014
Article Google Scholar
Wang Y, Huang M, Zhu X, Zhao L (2016) Attention-based LSTM for aspect-level sentiment classification. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp 606–615
Wilcox E, Levy R, Morita T, Futrell R (2018) What do RNN Language models learn about filler-gap dependencies? arXiv preprint. arXiv:180900042
Wilks DS (2011) Statistical methods in the atmospheric sciences, vol 100. Academic press
Wu X et al (2021) The development of a hybrid wavelet-ARIMA-LSTM model for precipitation amounts and drought analysis. Atmosphere 12:74
Google Scholar
Xu L, Mo KC (2020) A preliminary study of deep learning based drought forecast climate prediction S&T digest. In: Science and technology infusion climate bulletin NOAA’s national weather service. 44th NOAA annual climate diagnostics and prediction workshop
Zan T, Su Z, Liu Z, Chen D, Wang M, Gao X (2020) Pattern recognition of different window size control charts based on convolutional neural network and information fusion. Symmetry 12:1472
Google Scholar
Zschau J, Küppers AN (2013) Early warning systems for natural disaster reduction. Springer Science & Business Media

Download references

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions. No funding was received to conduct this study.

Author information

Authors and Affiliations

School of Built Environment, Faculty of Arts, Design and Architecture, University of New South Wales, Sydney, Australia
Anshuka Anshuka & David Sanderson
Transitional Artificial Intelligence Research Group, School of Mathematics and Statistics, University of New South Wales, Sydney, Australia
Rohitash Chandra
School of Life and Environmental Science, Faculty of Science, University of Sydney, Sydney, Australia
Alexander J. V. Buzacott & Floris F. van Ogtrop

Authors

Anshuka Anshuka
View author publications
You can also search for this author in PubMed Google Scholar
Rohitash Chandra
View author publications
You can also search for this author in PubMed Google Scholar
Alexander J. V. Buzacott
View author publications
You can also search for this author in PubMed Google Scholar
David Sanderson
View author publications
You can also search for this author in PubMed Google Scholar
Floris F. van Ogtrop
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anshuka Anshuka.

Ethics declarations

Conflict of interest

The authors declare no known competing financial interests, professional relationships or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval

We consciously assure that the manuscript is the authors' own original work, which has not been previously published elsewhere. The paper reflects the authors' own research and analysis in a truthful and complete manner. We would also like to assure that no animal experimentation or data involving human subject was used in this research.

Informed consent

All authors have consented to participate and publish the manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Anshuka, A., Chandra, R., Buzacott, A.J.V. et al. Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model. Stoch Environ Res Risk Assess 36, 3467–3485 (2022). https://doi.org/10.1007/s00477-022-02204-3

Download citation

Accepted: 03 March 2022
Published: 31 March 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s00477-022-02204-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Spatio temporal hydrological extreme forecasting framework using LSTM deep learning model

Abstract

Similar content being viewed by others

Improving flood forecasting using time-distributed CNN-LSTM model: a time-distributed spatiotemporal method

Going Beyond Traditional Methods: Using LSTM Networks to Predict Rainfall in Kerala

Analysis and Forecasting of Temporal Rainfall Variability Over Hundred Indian Cities Using Deep Learning Approaches

1 Introduction

2 Related works deep learning for hydrological extremes

3 Materials and methods

3.1 Study region

3.2 Data

3.2.1 Precipitation data

3.2.2 Sea surface temperature data

3.3 Index calculation

3.4 Framework for spatio temporal prediction

3.4.1 LSTM model

3.4.2 Data pre-processing

3.4.3 Spatio temporal framework

3.4.3.1 Univariate approach

3.4.3.2 Multivariate approach

3.4.3.3 Multivariate approach with sea-surface temperature (SST-PCA)

3.4.4 Technical setup

3.5 Performance metrics

4 Results

4.1 PCA results

4.2 Feature correlations

4.3 LSTM model performance

4.4 Prediction visualisation

4.5 Verification of categorical forecast

4.6 Spatial plots

5 Discussion

6 Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation