Application of augmented bat algorithm with artificial neural network in forecasting river inflow in Malaysia

Wee, Wei Joe; Chong, Kai Lun; Ahmed, Ali Najah; Malek, Marlinda Binti Abdul; Huang, Yuk Feng; Sherif, Mohsen; Elshafie, Ahmed

doi:10.1007/s13201-022-01831-z

Application of augmented bat algorithm with artificial neural network in forecasting river inflow in Malaysia

Original Article
Open access
Published: 17 December 2022

Volume 13, article number 30, (2023)
Cite this article

Download PDF

You have full access to this open access article

Applied Water Science Aims and scope Submit manuscript

Application of augmented bat algorithm with artificial neural network in forecasting river inflow in Malaysia

Download PDF

Wei Joe Wee¹,
Kai Lun Chong²,
Ali Najah Ahmed ORCID: orcid.org/0000-0002-5618-6663¹,
Marlinda Binti Abdul Malek³,
Yuk Feng Huang²,
Mohsen Sherif^4,5 &
…
Ahmed Elshafie⁶

1738 Accesses
5 Citations
Explore all metrics

Abstract

Hydrologists rely extensively on anticipating river streamflow (SF) to monitor and regulate flood management and water demand for people. Only a few simulation systems, where previous techniques failed to anticipate SF data quickly, let alone cost-effectively, and took a long time to execute. The bat algorithm (BA), a meta-heuristic approach, was used in this study to optimize the weights and biases of the artificial neural network (ANN) model. The proposed hybrid work was validated in five different study areas in Malaysia. The statistical tests analysis of the preliminary results revealed that hybrid BA-ANN was superior to forecasting the SF at all five selected study areas, with average RMSE values of 0.103 m³/s for training and 0.143 m³/s for testing as compared to ANN standalone training and testing yielding 0.091 m³/s and 0.116 m³/s, respectively. This finding signifies that the implementation of BA into the ANN model resulted in a 20% improvement. In addition, with an R² score of 0.951, the proposed model showed a better correlation than the 0.937 value of R² of standard ANN. Nonetheless, while the proposed work outperformed the conventional ANN, the Taylor diagram, violin plot, relative error, and scatter plot findings confirmed the disparities in the proposed work’s performance throughout the research regions. The findings of these evaluations highlighted that the adaptability of the proposed works would need detailed investigation because its performance differed from case to case.

Water quality prediction using machine learning models based on grid search method

Article Open access 29 September 2023

Hydrologic interpretation of machine learning models for 10-daily streamflow simulation in climate sensitive upper Indus catchments

Article 10 April 2024

Water Quality Assessment with Artificial Neural Network Models: Performance Comparison Between SMN, MLP and PS-ANN Methodologies

Article Open access 18 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Water, renewable but limited, is vital for all lives on the planet Earth. Sustained, rapid population and economic growth add to the challenge of maintaining water availability, raising questions about the continuous usage of reservoir water at unsustainable levels. Therefore, the importance of assessing and anticipating changes in water availability has grown in recent years due to population increase, industry, and urbanization, particularly in extreme conditions (Osman et al. 2022). Water shortage is one of the most challenging difficulties encountered in predominantly semiarid locations, and it is most likely influenced by several variables, one of which is accessibility (Kordani et al. 2022). Streamflow (SF) is one of the components of water transport from the land to waterbodies that is significant for management. The importance of accurate and reliable SF forecasting includes timely and efficient water resource management (Fathian et al. 2019), irrigation management verdicts (Tongal and Booij 2018), flood risk assessment (Shafizadeh-Moghadam et al. 2018), and planning releases (Sapitang et al. 2020). Knowing the quantity and the duration in advance is therefore essential.

Nonetheless, characteristics of streamflow, such as nonlinearity, stochasticity, and non-stationarity, pose complicated features that impair the effectiveness of contemporary hydrological models (Chabokpour et al. 2020). Also, given the complexities of modeling SF, particularly in the spatial and temporal domain, a physically based model may oversimplify or underestimate the actual SF process when the underlying pattern is not comprehended thoroughly (Chong et al. 2020; Martina et al. 2011). Evidenced such a problem was highlighted in the study by (Staudinger et al. 2011), where the minimum SF was poorly simulated using a hydrological model. Pellicer‐Martínez et al. (2015) noticed that the hydrological modeling process conducted was often without considering the groundwater aspect. They are also often designed to handle maximum SF circumstances. Thus, the application in low-streamflow conditions relies on the relevance of presented hydrological processes (Nicolle et al. 2014). However, if a proper model calibration is conducted, a physically based model is remarkably excellent for obtaining reliable predictions (Samarasinghe et al. 2022). Another technical difficulty with SF forecasting is real-time forecasting and the resources required for continuous monitoring. Although most reservoirs have monitoring techniques to monitor river streamflow, they fail to use the data to produce a consistent database. Practically, machine learning techniques, which use fewer computational resources to execute the same task, have been the focus of study and science.

Data-driven modeling has gained popularity over the past 20 years in the hydrology and water resources industries. It is principally owing to their capacity to extract nonlinearity, stochasticity, and patterns from historical river flow data. For instance, Azamathulla et al. (2010) adopted support vector machine (SVM) to predict sediment transport. Apart from sediment transport prediction, a number of studies have utilized machine learning in the associated sectors of water-related fields, such as rainfall prediction and side weir discharge prediction. (Azamathulla et al. 2016; Chaplot 2021). Rashki Ghaleh Nou et al. (2019) show that traditional ANN and SVM are inadequate in estimating scour depth near submerged weirs. They instead used a self-adaptive extreme learning machine (SAELM), which outperforms the classical machine learning model. Emadi et al. (2022) demonstrated that the integrated wavelet component in machine learning models can perform efficiently under water scarcity situations. The factors of climatic classes, river morphology, hydrological factor, and land-use characteristics all play a significant part in the model’s accuracy. Despite the fact that these excellent studies have garnered widespread attention in the hydrological community, it is worth noting that the limits of AI-based models remain universal and a hot topic among academics. These drawbacks include problems such as over-fitting, model generalization, and local convergence rate (Nawi et al. 2017).

ANN, for example, is a traditional AI approach that employs gradient-based optimizers and is subject to the mentioned drawbacks. The time series complexity and the searchability of the training method are issues concerning the application of gradient-based optimizers. In a similar optimization process, researchers, on the other hand, have used meta-heuristic algorithms, a nature-inspired optimization strategy, to solve combinatorial optimization approaches. Meta-heuristics are primarily employed to generate near-optimal solutions since they lack the power to provide optimal solutions but maintain properties that allow them to be utilized independently or in conjunction with conventional approaches on a wide range of issues (Chong et al. 2021; Pei et al. 2019). Despite its mathematical shortcomings and excessive time consumption, GA, for example, may be quickly modified and utilized to tackle the concerns of complicated situations (Beg and Islam 2016). The cuckoo optimization algorithm (COA) performs well in global convergence, contributes to research capabilities at both the local and global levels, and employs Lévy flights for global searching (He et al. 2018), but has a poor performance, a low convergence rate, and a tendency to drop to the optimum local value (Qu and He 2015).

Yang (2010) developed a fast, adaptable, and user-friendly optimization model (BA) based on real-time challenges and reservoir systems, allowing them to move quickly from the exploration to the exploitation phase. BA theoretically combined the advantages of modern optimization algorithms with more realistic technique formulations to provide a better outcome. As of present, only a few researchers have used BA in forecasting or prediction tasks. In 2015, hybrid NN and BA were used to forecast stock prices (Golmaryami et al. 2015; Shahvaroughi Farahani and Razavi Hajiagha 2021). Aside from that, Banadkooki et al. (2020) used BA to evaluate and optimize the weight and structure of an ANN to forecast time series suspended sediment load. In a separate study, Xing et al. (2016) used BA to determine the parameters of SVM for monthly SF prediction at the Yichang station on the Yangtze River in China. They eventually claimed that the BA-based SVM model outperformed the ANN and cross-validation-based SVM models in terms of accuracy. However, as stated earlier in the paragraph, the uses of BA in forecasting river SF remain relatively restricted.

The main contribution of this work is to provide an optimal approach through hybridization for forecasting upstream river SF in Malaysia. Various datasets were obtained from credible Malaysian authorities, and comprehensive statistical analysis and data pre-processing were performed throughout the model development. The performance of the ANN method and its hybridization with a new bio-inspired optimizer to supplement the regular ANN was validated using five selected study areas. Additional model validations were carried out employing various advanced analytics such as the Taylor diagram, violin plot, relative error, and scatter plot to analyze the model calculation inaccuracy. A multi-time scale streamflow forecasting research was also carried out to forecast future SF in all study areas from 2021 to 2025, a period of five years (60 months).

The proposed hybrid hydrological model

Artificial neural network (ANN)

As the name implies, ANN is built on the structural components of neural networks inherent in all biological organisms, which are then translated into analogous computing systems. Because it is a calculus-based algorithm, ANN is a heuristic search algorithm (it takes numerous approximations to find non-optimal but adequate results). The basic architecture of ANN comprises three layers: the input layer, the hidden layer, and the output layer. The capacity of ANN to foresee and predict both short-term and long-term data that is often too complicated to be appraised using standard modeling approaches has led to its widespread application. Furthermore, ANN has a high tolerance for errors, fast performance, and strong generalization skills when learning from historical data. Back propagation (BP) was employed to train the model. Figure 1 depicts the flowchart of the adopted ANN model.

Nodes are necessary for the input and hidden layers. Each node in its respective layer is associated with its weight. The data is subsequently routed through hidden layers that include one or more activation function components (depending on the number of nodes in hidden layers). The forward propagation of the neural network can be computed using Eq. (1).

$${Y}_{j}=\sum_{i}^{n}{X}_{i}{W}_{\mathrm{ij}}+b$$

(1)

$${O}_{j}=f\left({Y}_{j}\right)$$

(2)

where ${Y}_{j}$ = hidden/final value derived from the summation of input data and its associated weights, ${X}_{i}$ = data input/previous input value, and ${W}_{\mathrm{ij}}$ = weight associated between input/previous layer and output layer. ${O}_{j}$ is referred to as the activation function, a vital component that introduces nonlinearity property to the ANN model. Once the information reaches the output layer, they will compute an error between the predicted and observed value using Eq. (2).

$$\frac{\delta E}{\delta {O}_{j}}=({Y}_{\mathrm{obs}}-{Y}_{j})\sum {{O}_{j}}^{1}$$

(3)

$$\frac{\delta E}{\delta {Y}_{j}}=\frac{\delta E}{\delta {O}_{j}}\sum {{O}_{j}}^{1}$$

(4)

$$\frac{\delta E}{\delta {W}_{ji}}=\frac{\delta E}{\delta {Y}_{j}}\frac{\delta {Y}_{j}}{\delta {X}_{i}}$$

(5)

where ${E}_{k}$ = computed error between observed, ${Y}_{\mathrm{obs}}$ and predicted value, ${Y}_{k}$ and ${{O}_{j}}^{1}$

$${W}_{ji}={W}_{ji}-n\frac{\delta E}{\delta {W}_{ji}}$$

(6)

= derivative of the activation function, $n$ = learning rate.

Evolutionary artificial intelligence model (BA-ANN)

Yang (2010) was the first to develop BA in 2010, using the behavior of a bat’s echolocation to uncover its prey as the foundation of the algorithm. As previously stated, BA depends on microbat echolocation patterns to manage pulse emission rates and loudness while recognizing prey and avoiding obstacles in the absence of light. It is accomplished by employing echolocation mechanisms to convert information from sound waves emitted by prey and obstacles. In BA, the sole component will be used in echolocation, while time delay and loudness fluctuations will have no bearing on optimization difficulties. Figure 2 depicts the flow of the hybrid BA-ANN model, which was used in this investigation. BA is critical in the operations of random signal bandwidth adjustment via harmonics search. The theoretical fundamental of this algorithm is the assumption that the bats are soaring with velocity (pi) from an initial position ($\mathrm{si}$) with a minimum frequency ($\mathrm{fmin}$), fluctuating wavelength ($\lambda$), and loudness (L0) to seek out prey. They are equipped with the capability of regulating the frequencies ($fi$) as well as the rate of frequencies [$r \in (\mathrm{0,1})$].

Case study

Study area

As illustrated in Fig. 3, for simplicity, the five major selected study areas in Malaysia were designated from A to E. The importance of such a study is that they are mainly in charge of the majority of energy-producing divisions in Malaysia. Besides, the benefits from flood control, irrigation supports, and clean drinking water are provided. Table 1 shows the specifics of the streamflow and rainfall study areas.

Table 1 Rainfall and streamflow data details of the five selected study areas in Malaysia

Full size table

Data sources and pre-processing

According to previous research, the primary contributing characteristics of river flow in the water resources field are rainfall, flow rate, temperature, water level, streamflow, and water consumption. The parameters are vital as they greatly influence the state and condition of the water source. However, based on the literature review conducted, only rainfall and streamflow had shown consistent significance in affecting the output data, whereas the effects of temperature, water level, and water consumption were still quite vague. Therefore, with 41 years (492 months) of historical data from the year 1980 until 2020 retrieved from DID and DOE, the parameters to be used for analysis purposes of this study are historical rainfall (RF) data and historical streamflow (SF) data.

Data segmentation is critical in the ANN context for performance and to minimize data contamination. By referring to the empirical analysis conducted by Gholamy et al. (2018), it was deduced that the best results were attained in 70–80% of the data allocated for training and the remainder, 20–30% for testing. Thus, for both ANN standalone and hybrid BA-ANN models of this study, 70% of historical data are used for training (from January 1980 to August 2008), and the remaining 30% is used for testing the model (from the year September 2008 to December 2020). Apart from data partition, prior to training, data normalization was carried out to alter the existing raw data values to a range from 0 to 1. It is due to both forecasting models having an activation function that is bounded and susceptible to ranges that are not initially defined. The statistical tests were only then applied to the denormalized data, which was obtained through a reversion process to revert to the actual streamflow magnitude.

Model assessment and benchmark model comparison

In this work, ANN was used as a benchmarking model to BA-ANN to assess the accuracy of the forecast SF from the upstream river into the reservoirs of five selected study areas. Furthermore, several statistical tests were conducted using data from past and forecasted results to assess the validity of the forecasting model for all the selected study areas. The statistical tests comprise of root-mean-squared error (RMSE), coefficient of determination; often known as R-squared (R²), and mean absolute error (MAE). In general, the greater the predicting skill, the lower the derived statistical values. In contrast, a higher R² value is preferable. RMSE calculates the average squared difference between estimated and actual values. The mean of the residuals in the dataset is calculated by MAE. To assess the forecasting model’s accuracy, R² provides a fitted regression line. The employed statistical indices are defined below:-

$${\text{RSME}} = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 0}^{n} \left( {x_{i} - x_{f} } \right)^{2} }$$

(7)

$$R^{2} = 1 - \frac{{\sum \left( {x_{i} - x_{f} } \right)^{2} }}{{\sum \left( {x_{i} - \overline{x}_{f} } \right)^{2} }}$$

(8)

$${\text{MAE}} = \frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left| {x_{i} - x_{f} } \right|$$

(9)

In addition to the four criteria listed above, an uncertainty analysis using the Sequential Uncertainty Fitting algorithm version 2 (SUFI-2) was performed to assess the model’s dependability and consistency. The analysis was presented in the form of a Taylor diagram, violin plot, relative error, and scatter plot.

Results and discussion

Solution representations (architecture and input selection)

The input selection process is one of the most critical aspects of constructing a forecasting model for assessing the significance of a model’s correlation between its inputs and outputs. The model-free technique was chosen to be applied to the model in this study, which leverages existing data for training, testing, and validation of the forecasting model. Table 2 displays the four streamflow vector input scenarios considered in this study as inputs for the AI-based model. Aside from the input/output vector, a few parameters were considered when designing the architecture of the ANN standalone model. Different input scenarios and optimal network architecture for the proposed model are explored and discussed.

Table 2 Basis of the equations for all four scenarios of the forecasting models

Full size table

However, Scenario 1 is the baseline analysis using only R_t as the starting input value of historical RF data. In Scenario 2, the model will investigate the relationship between R_t−1 (the starting input value of historical RF data with lag-time 1) and S_t−1 (the initial input value of historical SF data with lag-time 1). Scenario 3 is an extension of Scenario 2, in which additional S_t−2 and R_t−2, along with all the variables in Scenario 2, are analyzed. Finally, Scenario 4 took into account all possible input combinations. The preliminary step is conducted using only one study area.

The hyperbolic tangent function was used in this study through trial-and-error analysis to aid in comprehending the complicated large size of the network and reducing the time required during the learning process. This hyperbolic tangent function delivers output varying from − 1 to 1. A momentum (α) value of 0.3 and a learning rate (β) value of − 0.3 were utilized during the development process. The scale of architecture networks was also examined, including the number of hidden nodes. Table 3 shows the simulation results of the ANN standalone forecasting model in four scenarios. Based on the overall consideration, an ANN architecture was utilized to predict the one-step ahead of monthly streamflow forecasting, with input variable based on scenario 3.

Table 3 Simulations of ANN standalone forecasting model in four scenarios

Full size table

Similarly, the BA parameters utilized to improve the ANN were acquired. In addition to the parameters required in the ANN model, several parameters must be put into consideration in an effort to regulate the bat’s echolocation wave that would allow it to find its prey. The optimal value of population size, iteration, pulse rate, and loudness was recognized by carrying out the trial-and-error method. Table 4 shows the simulation results of the hybrid BA-ANN forecasting model in four scenarios. Based on the overall assessment, a population size of 5, a pulse rate of 0.35, and a loudness of 0.2 were obtained. The best architecture for the proposed work is with an input variable based on scenario 3.

Table 4 Simulations of hybrid BA-ANN forecasting model in four scenarios

Full size table

Statistical evaluation of proposed model at five selected study areas

Tables 5 and 6 summarize the reservoir performance indicators based on the best input scenario, scenario 3. Notably, the hybrid BA-ANN models outperform the standard ANN model across all metrics. One of the comparison aspects is the computational time. The shorter time needed for the proposed work indicates that BA improves the searchability rate of the ANN model toward the global optimum. The sped-up convergence rate is essential in reservoir streamflow forecasting, particularly in real-time forecasting, as it can provide more time for the operator to make crucial decisions when an impeding shortage/overflow occurs. The results also indicated that the ANN standalone model has several limitations, particularly during local convergence, influencing the time-consuming speed convergence. The results are consistent with the findings in the study by Zaini et al. (2018). They did, however, anticipate the streamflow using additional meteorological characteristics as input, which were not included in this study. The additional parameter could have affected the accuracy of the model. The current study demonstrates that the proposed work might still perform effectively without the extra meteorological factors.

Table 5 Statistical test performance of the ANN standalone model at five selected study areas

Full size table

Table 6 Statistical test performance of the hybrid BA-ANN model at five selected study areas

Full size table

The proposed models can achieve higher accuracy of streamflow forecasting value than the standard ANN model while maintaining a faster convergence rate. For instance, from the result of the hybrid model simulation in Table 6, it can be deduced that at 200 iterations, the average testing value for RMSE of all five study areas is 0.116 m³/s, as opposed to 0.103 m³/s value of that standard ANN model. Study area D has the lowest RMSE for training among the study areas based on the proposed model, at 0.085 m³/s, while study area B has the highest, at 0.096 m³/s. Study area D maintains the lowest RMSE testing result of 0.113 m³/s, whereas study area C has the highest at 0.119 m³/s, trailing study area B. While the proposed hybrid approach decreases RMSE by 20%, the R² measure only results in a minor marginal increase, revealing that the R² metric might not be sufficient to assess their effectiveness. As presented in Tables 5 and 6, superior exploitation and exploration elements of BA improve the searchability of ANN by minimizing the risks of being caught in local optimums, resulting in increased streamflow forecasting accuracy.

To visualize the capability of ANN, whose works without explicit knowledge of the underlying physical process, in forecasting the streamflow, the observed and forecasted streamflow for all five study areas were plotted and are shown in Fig. 4. The typical ANN model is unable to reliably estimate low streamflow in all five locations, worsening the situation when there is water scarcity. Peak flow is regularly overestimated or underestimated by the model, causing the reservoir to overflow, especially during periods of severe rain. With the addition of BA to ANN, the proposed model can accurately estimate whether the flow is at its peak or its minimum.

Supplementary analysis (all scenarios in each study area)

The preceding section emphasizes a single scenario, scenario 3, as it was proven to be the best input scenario for ANN and hybrid models. However, as stated in Sect. 4.1, it was only applied to study area D as a preliminary stage for testing the various derived scenarios and model architecture. With the optimized parameters tabulated in Sect. 4.1, the models were developed for other study areas, but only the 3rd scenario was adopted. The models were developed for additional study areas using the optimized parameters reported in Sect. 4.1. However, as the only adopted scenario was the 3rd scenario, it poses the issue of whether the proposed paradigm applies to other cases. Therefore, it is crucial to validate the findings further by applying them to the remaining study areas adopted in this study.

The findings tabulated in Table 7 and Fig. 5 demonstrate that scenario 3 remains the best scenario even when applied to the other four study areas, as evidenced by the lowest RMSE value compared to another scenario. Given that all of the analyses imply scenario 3, it is plausible to assume that the elements to be considered for the equation in SF forecasting should include: (1) R_t−1 (initial input value of historical RF data with lag-time 1), and (2) R_t−2 (initial input value of historical RF data with lag-time 2), (3) S_t−1 (initial input value of historical SF data with lag-time 1), (4) S_t−2 (initial input value of historical SF data with lag-time 2), and (5) S₀ (initial output value of forecasted SF data). In conclusion, identifying the best scenario equation to apply to the model is critical since insufficient input data frequently underperformed (scenario 1), while too much input has a detrimental impact (scenario 4).

Table 7 Results of trial and error for all scenarios in each study area

Full size table

Multi-step ahead forecasting analysis

In real-time circumstances, being able to foresee each conceivable event utilizing time series forecasting would be a great benefit. It would be highly motivating to be able to acquire exact, accurate time series, especially in the field of engineering. An iterative forecasting approach was used to forecast future SF in all research areas from 2021 to 2025, a total of five years, in order to assess the developed model’s capabilities in multi-step-ahead forecasting (60 months). However, it is worth noting that the cumulative error in the prediction is carried into the subsequent forecast in the iterative forecasting process, resulting in an increased error in streamflow accuracy. The results of the multi-step forward forecasting are best represented graphically, as shown in Fig. 6.

Figure 7 depicts the likely maximum and lowest forecasted streamflow, which provides a discussion to clarify such results from Fig. 6. The value of SF for study area D, which served as the preliminary, ranges from 10.00 to 70.00 m³/s. The highest value, 77.87 m³/s, was predicted to occur in May 2022, while the lowest value, 0.88 m³/s, was forecasted to occur in January 2021. The SF value in study area A ranged from 2.00 m³/s to 6.00 m³/s. May 2023 was anticipated to have the highest value, at 7.68 m³/s, while July 2023 was predicted to have the lowest value, at 1.60 m³/s. The value of SF varied in study area B from 5.00 to 45.00 m³/s. The anticipated values were for the maximum value to occur in August 2021 at 53.10 m³/s and the lowest value to occur in April 2022 at 3.68 m³/s. Next, in study area C, the value of SF fluctuated from as low as 20.00 m³/s to as high as 65.00 m³/s. The estimated peak value for May 2023, 65.81 m³/s, is expected to fall to the anticipated minimum value for February 2024, 2.15 m³/s, around nine months later. Finally, the value of SF fluctuated from 2.00 to 5.00 m³/s for study area E. Forecasted SF data reveal that they will be at their highest in May 2023, as they will reach a value of 7.68 m³/s and at their lowest, in February 2025 as they will be at 1.57 m³/s.

Uncertainty assessment

A 95% confidence interval (95 CI) was established to examine the performance uncertainty in the developed model. The p-factor and r-factor, two essential statistical parameters, were computed to show the degree of uncertainty (Fig. 8). These two components have distinct interpretations of outcomes, with a better-trained model having a p-factor around one and an r-factor near zero. Numerically, study area A scores the highest p-factor value, at 0.996 (99.6%), followed by study area C, which scores 0.992 (99.2%). Study area E got the lowest value, with 0.986 (98.6%), while study areas D and B both have 0.990 (99.0%). The r-factors for the five reservoir study areas are 0.01, 0.017, and 0.013, with study area E being the best, study area D being the worst, and study areas A, B, and C being comparable. In general, p-factors larger than 0.7 and r-factors greater than 1.5 are considered statistically significant. Despite the worst p-factor value of 0.986 and r-factor value of 0.017, the models surpass the critical value significantly. The goodness-of-fit indices, such as the optimum average (dx) value being close to or equal to zero, may be used to quantify the uncertainty in streamflow forecasting. The lowest dx value in study areas A and E might indicate that the model captured a majority of the underlying information in that streamflow time series. Meanwhile, study areas B, C, and D have dx values greater than 0.15. A notable observation here is that the p-factor and r-factor may have modest changes between study areas, necessitating the use of the dx as the variances in value were rather considerable and obvious.

Graphical representation of the proposed model proficiency

The inevitable concern is whether the same level of achievement will be maintained in subsequent years. To further compare the model performance, some graphical representations, including relative error, scatter plots, and Taylor and violin diagrams, were utilized to show the findings gained from the best input combination.

Taylor diagrams based model performance

The Taylor diagram is a diagnostic metric through a simple visual representation of model performances versus available data. The Taylor diagram for each of the five study areas is shown in Fig. 9. The terms R, RMSE, and SD demonstrate how comparable the observation records and prediction models are.

First and foremost, the placement of the developed model at each study area in the Taylor diagram shows a vital discovery. As every model falls inside the black dashed line, they all tend to underestimate the variability of streamflow at their respective study area. Despite this, study areas B and E are relatively closer to their measured streamflow compared to the others. The fact that streamflow variability is generally underestimated may be due to the models’ inadequacy in capturing streamflow during peak flows. Study area E typically performs the poorest in all three statistics throughout the validation period. Study area D did the best with an RMSE value of 0.113 m³/s and the highest value of R at 0.974. It should be observed that the simulated pattern that is closest to the reference point fits the data well. The remaining study areas performed averagely, with approximately an identical R correlation. Among the study areas with identical R correlation values, study area B is significantly closer to the observations and has a comparable standard deviation. In the meanwhile, study area A is less susceptible to spatial variability than study area B.

Violin diagrams based model performance

A violin plot, as explained in the previous section, is a way of plotting numeric data to analyze the percentile ranking of accuracies, which includes the maximum and minimum value of the data, the median, and the interquartile range, together with the entire distribution of the simulated streamflow. Violin plot is another method used to compare the distribution of output models with reality. Violin plot is similar to a box plot with a more visible display and a more detailed description of the difference between the underlying distributions. Each violin plot’s denser section demonstrates that the streamflow distribution is at its greatest in that range of values. The violin plots’ tips display the range of values without filtering outliers and background noise.

The findings are recorded in Table 8 and presented in Fig. 10. The shape of the density mass function is highly comparable to observations in all five study areas. Nevertheless, a deeper look revealed increased dispersion and skewness in the study area D streamflow, as indicated by the highest 23.86 m³/s generated. Accordingly, upstream regions of study area B models have the highest likelihood of streamflow values (higher density), with an error of 0.42 (m³/s) as opposed to 0.03 (m³/s) in study area A. Despite having an average error of 0.15 m³/s, there is a minor difference among such study areas; study areas D, A, and E all have comparable relative tails to recorded values at the upstream discharge, but B and C have longer tails. Except for the study area C model, which underestimated streamflow in the 25th percentile, other study area models overestimated streamflow in the 25th percentile. However, such variances are negligible, with a range difference between 0.41 m³/s at study area B and 0.12 m³/s at study area D. The 75th percentile found a similar result in all study areas as the 25th percentile, albeit with minor variance. For instance, a similar trend can be found in study areas D, A, and B, although with a worse outcome than the 25th percentile error. Besides, study area C is performing better at the 75th percentile than the 25th percentile, indicating that the model can forecast the peak flow relatively better than the low flow.

Table 8 Summary of violin plot at all five study areas

Full size table

Scatter plots and relative errors based model performance

The scatter plot of the estimated and observed streamflow was generated in this section. As seen in Fig. 11, shifting to the right raised both the X and Y values. As a result, all five sites showed a favorable association between historical and simulated data, with an R² value of more than 0.9. Furthermore, the computed relative error (RE) of the simulated data versus historical data for all five study areas revealed that the RE ranged from 1.39 to 2.51%. Study area A has the highest RE, at 2.51%, followed by E, at 2.49%. The RE is 1.57% at study area D and 1.44% at study area B, respectively. Finally, study area C scores the lowest, and hence the best, RE of 1.39%. The findings are shown in Fig. 12.

Conclusion and recommendation

In this study, an evolutionary AI-based model is developed and assessed for multi-time scale streamflow forecasting, with bat algorithm (BA) adopted, as an optimization approach, to search for the optimal set of weights and biases of the artificial neural network (ANN). The superior exploration and exploitation of BA can significantly improve the performance of ANN in achieving better forecasting accuracy. The effects of a few elements, including historical streamflow (SF) and rainfall (RF) data, were assumed to be significant contributors to the river’s upstream entry into the reservoir.

The experimental verification on five selected study areas offered substantial evidence of the proposed work’s applicability and dependability in all the adopted performance indicators. It is validated using uncertainty analysis using SUFI-2 that the results produced from the proposed hybrid model exceeded 95% for the p-factor and the average distance values for all study regions were near zero. Besides, a further implication of the research is that BA as an optimization tool is well compatible with ANN in addressing nonlinear and complex time series. BA performs well in pure optimization, although it differs somewhat from how the ANN model learns. This research suggests that BA can train an ANN model with adequate generalizability, as indicated by the reduced RMSE in the testing dataset (unforeseen). As this study may have indicated, the execution of the proposed work differed from case to case. It could provide valuable insight to other academic academics in their studies on regulating the discharge of hydroelectric water under different conditions using an effective streamflow forecasting technique. Last but not least, the proposed work has the potential to be further extended to deep learning models, which are possibly more complex with the additional parameters to be optimized.

The work conducted for this research unquestionably necessitates additional research to be carried out in the future. For future work, the outcomes of this research deliver the insights as follows:-

(a)
The association between existing data on climate, hydrology, and land use must be examined in more depth to investigate the components of SF forecasting.
(b)
The proposed forecasting model of this study could be extended to additional study areas in diverse geographical locations, climate zones, and land-use activities.
(c)
The hybrid BA-ANN model can be integrated with other deterministic models (i.e., Info Works RS) to enhance the management of SF.
(d)
The hybrid BA-ANN model can be used with another expert system (for example, fuzzy approaches) to develop a more comprehensive SF short-term and long-term monitoring strategy.

Data availability

Data are available upon reasonable request from the corresponding author.

Code availability

Not applicable.

References

Azamathulla HM, Ghani AA, Chang CK, Hasan ZA, Zakaria NA (2010) Machine learning approach to predict sediment load—a case study. CLEAN Soil Air Water 38(10):969–976
Article Google Scholar
Azamathulla HM, Haghiabi AH, Parsaie A (2016) Prediction of side weir discharge coefficient by support vector machine technique. Water Supply 16(4):1002–1016
Article Google Scholar
Banadkooki FB, Ehteram M, Ahmed AN, Teo FY, Ebrahimi M, Fai CM, Huang YF, El-Shafie A (2020) Suspended sediment load prediction using artificial neural network and ant lion optimization algorithm. Environ Sci Pollut Res 27(30):38094–38116
Article Google Scholar
Beg AH, Islam MZ (2016) Advantages and limitations of genetic algorithms for clustering records. pp 2478–2483
Chabokpour J, Chaplot B, Dasineh M, Ghaderi A, Azamathulla HM (2020) Functioning of the multilinear lag-cascade flood routing model as a means of transporting pollutants in the river. Water Supply 20(7):2845–2857
Article Google Scholar
Chaplot B (2021) Prediction of rainfall time series using soft computing techniques. Environ Monit Assess 193(11):1–11
Article Google Scholar
Chong KL, Lai SH, Ahmed AN, Zaafar WZW, Rao RV, Sherif M, Sefelnasr A, El-Shafie A (2021) Review on dam and reservoir optimal operation for irrigation and hydropower energy generation utilizing meta-heuristic algorithms. IEEE Access 9:19488–19505
Article Google Scholar
Chong KL, Lai SH, Yao Y, Ahmed AN, Jaafar WZW, El-Shafie A (2020) Performance enhancement model for rainfall forecasting utilizing integrated wavelet-convolutional neural network. Water Resour Manag 34(8):2371–2387
Article Google Scholar
Emadi A, Sobhani R, Ahmadi H, Boroomandnia A, Zamanzad-Ghavidel S, Azamathulla HM (2022) Multivariate modeling of agricultural river water abstraction via novel integrated-wavelet methods in various climatic conditions. Environ Dev Sustain 24(4):4845–4871
Article Google Scholar
Fathian F, Mehdizadeh S, Sales AK, Safari MJS (2019) Hybrid models to improve the monthly river flow prediction: Integrating artificial intelligence and non-linear time series models. J Hydrol 575:1200–1213
Article Google Scholar
Gholamy A, Kreinovich V, Kosheleva O (2018) Why 70/30 or 80/20 relation between training and testing sets: a pedagogical explanation
Golmaryami M, Behzadi M, Ahmadzadeh M (2015) A hybrid method based on neural networks and a meta-heuristic bat algorithm for stock price prediction. IEEE, pp 269–275
Google Scholar
He X-S, Wang F, Wang Y, Yang X-S (2018) Nature-inspired algorithms and applied optimization. Springer, pp 53–67
Book Google Scholar
Kordani H, Chaplot B, Dehkharghani PR, Azamathulla HM (2022) People’s participation in using treated wastewater as an approach for sustainability of ecosystem services, green spaces, and farmlands in peri-urban areas: the case study of Kalak-e Bala, Karaj, Iran. Water Supply 22(4):4571–4583
Article Google Scholar
Martina M, Todini E, Liu Z (2011) Preserving the dominant physical processes in a lumped hydrological model. J Hydrol 399(1–2):121–131
Article Google Scholar
Nawi NM, Hamzah F, Hamid NA, Rehman MZ, Aamir M, Ramli AA (2017) An optimized back propagation learning algorithm with adaptive learning rate. Learning 500:2
Google Scholar
Nicolle P, Pushpalatha R, Perrin C, François D, Thiéry D, Mathevet T, Le Lay M, Besson F, Soubeyroux J-M, Viel C (2014) Benchmarking hydrological models for low-flow simulation and forecasting on French catchments. Hydrol Earth Syst Sci 18(8):2829–2857
Article Google Scholar
Osman AIA, Ahmed AN, Huang YF, Kumar P, Birima AH, Sherif M, Sefelnasr A, Ebraheemand AA, El-Shafie A (2022) Past, present and perspective methodology for groundwater modeling-based machine learning approaches. Arch Comput Methods Eng 29:1–17
Article Google Scholar
Pei W, Huayu G, Zheqi Z, Meibo L (2019) A novel hybrid firefly algorithm for global optimization. IEEE, pp 164–168
Google Scholar
Pellicer-Martínez F, González-Soto I, Martínez-Paz JM (2015) Analysis of incorporating groundwater exchanges in hydrological models. Hydrol Process 29(19):4361–4366
Article Google Scholar
Qu C, He W (2015) A double mutation cuckoo search algorithm for solving systems of nonlinear equations. Int J Hybrid Inf Technol 8(12):433–448
Google Scholar
Rashki Ghaleh Nou M, Azhdary Moghaddam M, Shafai Bajestan M, Azamathulla HM (2019) Estimation of scour depth around submerged weirs using self-adaptive extreme learning machine. J Hydroinform 21(6):1082–1101
Article Google Scholar
Samarasinghe JT, Basnayaka V, Gunathilake MB, Azamathulla HM, Rathnayake U (2022) Comparing combined 1D/2D and 2D hydraulic simulations using high-resolution topographic data: examples from Sri Lanka—Lower Kelani River Basin. Hydrology 9(2):39
Article Google Scholar
Sapitang M, Ridwan WM, Faizal Kushiar K, Najah Ahmed A, El-Shafie A (2020) Machine learning application in reservoir water level forecasting for sustainable hydropower generation strategy. Sustainability 12(15):6121
Article Google Scholar
Shafizadeh-Moghadam H, Valavi R, Shahabi H, Chapi K, Shirzadi A (2018) Novel forecasting approaches using combination of machine learning and statistical models for flood susceptibility mapping. J Environ Manag 217:1–11
Article Google Scholar
Shahvaroughi Farahani M, Razavi Hajiagha SH (2021) Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models. Soft Comput 25(13):8483–8513
Article Google Scholar
Staudinger M, Stahl K, Seibert J, Clark M, Tallaksen L (2011) Comparison of hydrological model structures based on recession and low flow simulations. Hydrol Earth Syst Sci 15(11):3447–3459
Article Google Scholar
Tongal H, Booij MJ (2018) Simulation and forecasting of streamflows using machine learning models coupled with base flow separation. J Hydrol 564:266–282
Article Google Scholar
Xing B, Gan R, Liu G, Liu Z, Zhang J, Ren Y (2016) Monthly mean streamflow prediction based on bat algorithm-support vector machine. J Hydrol Eng 21(2):04015057
Article Google Scholar
Yang X-S (2010) Nature inspired cooperative strategies for optimization (NICSO 2010). Springer, pp 65–74
Book Google Scholar
Zaini N, Malek MA, Yusoff M, Osmi SFC, Mardi NH, Norhisham S (2018) Bat algorithm and neural network for monthly streamflow prediction. AIP Publishing LLC, p 020260
Google Scholar

Download references

Acknowledgements

The authors are grateful to the Department of Irrigation and Drainage (DID) Malaysia for providing data to conduct this study.

Author information

Authors and Affiliations

Institute of Energy Infrastructure and Department of Civil Engineering, College of Engineering, Universiti Tenaga Nasional (UNITEN), 43000, Kajang, Selangor, Malaysia
Wei Joe Wee & Ali Najah Ahmed
Department of Civil Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Jalan Sg. Long, Bandar Sg. Long, 43000, Kajang, Selangor, Malaysia
Kai Lun Chong & Yuk Feng Huang
Cataclysmic Management and Sustainable Development Research Group (CAMSDE), Department of Civil Engineering, Kulliyyah of Engineering, International Islamic University Malaysia, Selangor, Malaysia
Marlinda Binti Abdul Malek
Civil and Environmental Engineering Department, College of Engineering, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates
Mohsen Sherif
National Water and Energy Center, United Arab Emirates University, P.O. Box 15551, Al Ain, United Arab Emirates
Mohsen Sherif
Department of Civil Engineering, Faculty of Engineering, University of Malaya (UM), 50603 Kuala Lumpur, Malaysia
Ahmed Elshafie

Authors

Wei Joe Wee
View author publications
You can also search for this author in PubMed Google Scholar
Kai Lun Chong
View author publications
You can also search for this author in PubMed Google Scholar
Ali Najah Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Marlinda Binti Abdul Malek
View author publications
You can also search for this author in PubMed Google Scholar
Yuk Feng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Mohsen Sherif
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Elshafie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ali Najah Ahmed.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wee, W.J., Chong, K.L., Ahmed, A.N. et al. Application of augmented bat algorithm with artificial neural network in forecasting river inflow in Malaysia. Appl Water Sci 13, 30 (2023). https://doi.org/10.1007/s13201-022-01831-z

Download citation

Received: 16 October 2022
Accepted: 16 November 2022
Published: 17 December 2022
DOI: https://doi.org/10.1007/s13201-022-01831-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Application of augmented bat algorithm with artificial neural network in forecasting river inflow in Malaysia

Abstract

Similar content being viewed by others

Water quality prediction using machine learning models based on grid search method

Hydrologic interpretation of machine learning models for 10-daily streamflow simulation in climate sensitive upper Indus catchments

Water Quality Assessment with Artificial Neural Network Models: Performance Comparison Between SMN, MLP and PS-ANN Methodologies

Introduction

The proposed hybrid hydrological model

Artificial neural network (ANN)

Evolutionary artificial intelligence model (BA-ANN)

Case study

Study area

Data sources and pre-processing

Model assessment and benchmark model comparison

Results and discussion

Solution representations (architecture and input selection)

Statistical evaluation of proposed model at five selected study areas

Supplementary analysis (all scenarios in each study area)

Multi-step ahead forecasting analysis

Uncertainty assessment

Graphical representation of the proposed model proficiency

Taylor diagrams based model performance

Violin diagrams based model performance

Scatter plots and relative errors based model performance

Conclusion and recommendation

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethics approval

Consent to participate

Consent for publication

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation