Application of novel binary optimized machine learning models for monthly streamflow prediction

Adnan, Rana Muhammad; Dai, Hong-Liang; Mostafa, Reham R.; Islam, Abu Reza Md. Towfiqul; Kisi, Ozgur; Elbeltagi, Ahmed; Zounemat-Kermani, Mohammad

doi:10.1007/s13201-023-01913-6

Application of novel binary optimized machine learning models for monthly streamflow prediction

Original Article
Open access
Published: 08 April 2023

Volume 13, article number 110, (2023)
Cite this article

Download PDF

You have full access to this open access article

Applied Water Science Aims and scope Submit manuscript

Application of novel binary optimized machine learning models for monthly streamflow prediction

Download PDF

Rana Muhammad Adnan¹,
Hong-Liang Dai¹,
Reham R. Mostafa²,
Abu Reza Md. Towfiqul Islam^3,8,
Ozgur Kisi ORCID: orcid.org/0000-0001-7847-5872^4,5,
Ahmed Elbeltagi⁶ &
…
Mohammad Zounemat-Kermani⁷

2502 Accesses
22 Citations
1 Altmetric
Explore all metrics

Abstract

Accurate measurements of available water resources play a key role in achieving a sustainable environment of a society. Precise river flow estimation is an essential task for optimal use of hydropower generation, flood forecasting, and best utilization of water resources in river engineering. The current paper presents the development and verification of the prediction abilities of new hybrid extreme learning machine (ELM)-based models coupling with metaheuristic methods, e.g., Particle swarm optimization (PSO), Mayfly optimization algorithm (MOA), Grey wolf optimization (GWO), and simulated annealing (SA) for monthly streamflow prediction. Prediction precision of standalone ELM model was compared with two-phase optimized state-of-the-arts models, e.g., ELM–PSO, ELM–MOA, ELM–PSOGWO, and ELM–SAMOA, respectively. Hydro-meteorological data acquired from Gorai and Padma Hardinge Bridge stations at Padma River Basin, northwestern Bangladesh, were utilized as inputs in this study to employ models in the form of seven different input combinations. The model’s performances are appraised using Nash–Sutcliffe efficiency, root-mean-square-error (RMSE), mean absolute error, mean absolute percentage error and determination coefficient. The tested results of both stations reported that the ELM–SAMOA and ELM–PSOGWO models offered the best accuracy in the prediction of monthly streamflows compared to ELM–PSO, ELM–MOA, and ELM models. Based on the local data, the ELM–SAMOA reduced the RMSE of ELM, ELM–PSO, ELM–MOA, and ELM–PSOGWO by 31%, 27%, 19%, and 14% for the Gorai station and by 29%, 27%, 19%, and 14% for Padma Hardinge bridge station, in the testing stage, respectively. In contrast, based on external data, ELM–PSOGWO improves in RMSE of ELM, ELM–PSO, ELM–MOA, and ELM–SAMOA by 20%, 5.1%, 6.2%, and 4.6% in the testing stage, respectively. The results confirmed the superiority of two-phase optimized ELM–SAMOA and ELM–PSOGWO models over a single ELM model. The overall results suggest that ELM–SAMOA and ELM–PSOGWO models can be successfully applied in modeling monthly streamflow prediction with either local or external hydro-meteorological datasets.

Enhancing accuracy of extreme learning machine in predicting river flow using improved reptile search algorithm

Article 01 June 2023

Integrated metaheuristic algorithms with extreme learning machine models for river streamflow prediction

Article Open access 12 June 2024

Urmia lake water depth modeling using extreme learning machine-improved grey wolf optimizer hybrid algorithm

Article 06 September 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Accurate and reliable streamflow prediction is of great importance in designing, developing, and proper operation of water resources management, hydropower generation, and ecohydrological studies and plans. In this sense, various and vigorous modeling strategies have been introduced and implemented in the literature, such as numerical-based (Suiju and Minquan 2015), autoregressive moving average models (Liu et al. 2015), and conceptual models (Boulariah et al. 2019).

During the last decades, with the advent of high-speed processors and the development of novel Soft Computing (SC) techniques, a new era of data mining methods, e.g., Machine Learning (ML) models for modeling and simulating different phenomena in hydrology, has been emerged (Najafzadeh and Niazmardi 2021; Najafzadeh et al. 2021; Zounemat-Kermani et al. 2020a, b; Najafzadeh et al. 2021; Barzkar et al. 2022; Granata et al. 2022; Ikram et al. 2023; Mostafa et al. 2023). Accordingly, streamflow prediction using SC techniques and data-driven models has been one of the main targets for hydrologists and researchers in recent years (Adnan et al. 2020a). These models use ML techniques to map the complicated relationship between the predictors (like precipitation and/or previous observed discharge values) and the response variable (short-term or long-term streamflow) (Adnan et al. 2020b). Poul et al. (2019) challenged four different types of data-driven models, including Multi-Linear Regression (MLR), Artificial Neural Network (ANN), K-nearest neighbors, and Adaptive Neuro-Fuzzy Inference System (ANFIS) in monthly streamflow prediction based upon six input combinations. In general, the ANFIS and ANN models were slightly superior to the other methods. Fu et al. (2020) applied deep learning ML models for the prediction of daily streamflow in a tropical environment. They compared the results of the Long Short-Term Memory (LSTM), as the deep network, to the conventional backpropagation ANN. The findings of the study revealed the obvious advantages of the LSTM over the classic ANN.

It should be noted that among the successful applications of various types of ML models, the Extreme Learning Machine (ELM), which presents a simple structure yet fast learning speed single-layer ANN, has proven its accuracy and efficiency in simulating complex and nonlinear hydrological problems (Rezaie-Balf and Kisi 2018; Yaseen et al. 2020; Saberi-Movahed et al. 2020; Ikram et al. 2022).

In a study, Niu et al. (2020) utilized the ELM for the annual prediction of streamflow using data series of three hydropower reservoirs in China. It was shown that the developed ELM model outperformed several traditional methods. According to the previous researches, the majority of ML applications for streamflow prediction confirm the proper potential and accuracy of ML in comparison with the conventional statistical or conceptual models in addressing this hydrological issue (Riahi-Madvar et al. 2019). Subsequently, new ideas for upgrading and enriching ML models, especially using integrative (hybrid) ML-heuristic methods, have been being developed and tested recently (Fadaee et al. 2020).

In the field of hydrology, several researchers have reported the successful application of heuristic algorithms embedded with ML models. Sudheer et al. (2014) developed a hybrid (integrative) ML model using Support Vector Regression (SVR) and Particle Swarm Optimization (PSO) for forecasting monthly streamflow in the USA. Analyzing the forecasted results indicated that the SVR–PSO model gave a better performance than the ANN and statistical model, namely ARMA (autoregressive moving average model). Malik et al. (2020) employed several integrative models based on the SVR and six heuristic algorithms in predicting daily streamflow in India. Comparing the results showed that SVR–HHO (SVR and Harris Hawks Optimization algorithm) during calibration/validation periods had superior performance to the other developed integrative models. Jiang et al. (2020) developed an integrative (hybrid) ELM–PSO model for monthly streamflow forecasting. The findings of the study demonstrated the superiority of the ELM–PSO over standard ELM, SVR, and ANFIS models.

Considering conventional heuristic algorithms, the PSO and Simulated Annealing (SA) are among the most widely used optimization algorithms in hydrology, and numerous applications have been applied using these algorithms (Bazargan and Norouzi 2018; Hosseini et al. 2020). Recently, some new heuristic algorithms have been introduced and claimed with high capability in optimizing sophisticated problems, such as Mayfly Optimization Algorithm (MOA) (Zervoudakis and Tsafarakis 2020), Grey Wolf Optimization (GWO) (Mirjalili et al. 2014), and hybrid PSOGWO (Şenel et al. 2019). However, there has not been a report on the comprehensive assessment of the capabilities of these algorithms in dealing with hydrological issues compared to the conventional ones (here, the PSO and SA). In this regard, this study utilizes the extreme learning machine (ELM) as the main model embedded with conventional (PSO and SA) and novel (GWO, hybrid PSOGWO, MOA, and hybrid SAMOA) as predictive models to realize streamflow in subhumid region of Bangladesh.

The main contribution and innovation of this study lie in (1) developing four integrative ELM models and analyzing their performance on simulation of a complicated engineering problem, namely streamflow prediction, and (2) appraisal of all developed models based on seven different input combinations scenarios including lagged streamflow values, precipitation, and temperature. To have a clear comprehension, these new approaches include simple (ELM–MOA and ELM–GWO) and hybrid integrative (ELM–PSOGWO and ELM–SAMOA) ML models. Continuously, the results of the new mention models will be compared with the standard ELM and integrative ELM–PSO and ELM–SA to achieve an inclusive perception for their usefulness and accuracy.

Case study

For the current research, the Padma river, a large river basin, is the major downstream stretch of the Ganges River, which runs more than 2561 km² derived from the Gangotri glacier of the Himalayan system which was chosen as a case study. This river basin is regarded as one of the highly populated residents on the globe. The Padma River acts a crucial role in the socio-economic conditions of the country. The stretch of the Padma river flows for 108 km² before the confluence with Meghna River at Chandpur point. Gorai River is one of the tributaries of the Padma River. The accumulated discharge of the Padma and Brahmaputra River is 30,000 m³/s⁻¹, and sometimes, it reaches 75,000 m³/s⁻¹ during the bank full phase (Dewan et al. 2017). Geographically, the study basin is positioned between 23° 48′ and 25° 18′ north latitudes and 88° 27′ and 89° 48′ east longitudes. Annually, 900 metric tons of sediment load passes through the river, out of which, 60% of sediment is either silt or clay, while the rest is bed load (Islam 2016). Islam et al. (2021) stated the floodplain of the river as a ‘wandering’ form. Padma River basin is vital for sustaining livelihood through agricultural activities, navigation, nourishment, aquaculture, and environmental sustainability perspective. For instance, the freshwater distributed by this basin is truly imperative to sustaining a riparian ecosystem of the south-western region of Bangladesh, mostly in the world’s largest mangrove forest, the Sundarbans, by holding the salinity anterior downstream side into the Bay of Bengal (BOB) (Mirza 2004). Apart from this, this basin reports extreme variability of flow regime (water and sediment), triggering from monsoonal precipitation and the melting of the Himalayan ice, which causes frequent large floods of high magnitude in Bangladesh. Further, bank erosion and river shifting are common phenomena in this basin, which led to environmental degradation and population migration. This is anticipated to enhance in forthcoming years (CDMP 2014) since elevated precipitation triggered by climate change will raise the runoff into the Ganges–Brahmaputra–Meghna River systems (Moors et al. 2011). Four seasons, such as summer (March–May), monsoon (June–September), post-monsoon (October–November), and winter (December–February), have predominated in this region with significant temperature and precipitation differences (Akhter et al. 2019). Runoff in the Padma river was mostly distributed in June–October, accounting for 72.5% of the accumulated annual runoff. The dry season is from December to April, accounting for l2.1% of the accumulated annual runoff. From December to February, it was the lowest, accounting for solely 5.87% of the annual total (Islam et al. 2016). Runoff in this river basin mostly originates from heavy rainfall. Therefore, in the current study, two hydro-meteorological sites on the Padma River basin, e.g., Padma Hardinge bridge and Gorai stations were employed for predicting monthly streamflow (see Fig. 1). The data adopted in this work were collected from Bangladesh Water Development Board (BWDB).

Methods

Extreme learning machine (ELM)

ELM is a form of the single-layer feed-forwarding network (FFN) that aims to intertwine the traditional neural networks and biological learning mechanism. Due to its special structure based on the random hidden neurons mechanism—in which hidden neurons do not need to be tuned similar to the conventional ANNs—it can provide precise modeling results having a lower computational cost. Moreover, ELM offers other advantages such as ease of implementation, better generalization ability, and minimal human intervention. As a result of the mentioned arguments and reasons, ELM has been chosen as the core machine learning tools in this study. The main mathematical methodology of ELM is described in the following.

As can be seen in Fig. 2, a single hidden layer resides in the ELM network. The input weights between the input and hidden layers are only initialized once and do not need to be conditioned (Zhu et al. 2019; Prasad et al. 2018; Yadav et al. 2017; Zhang et al. 2020). Iterative testing is used to prepare the outcome weights between the un-seen and outcome layers. The training and process time of an ELM is much quicker than that of the comparable single-layer FFN since the input weights remain in their initial state and only the output weights are trained. Huang et al. (2006) suggested the ELM, a basic three-layer structure algorithm, to address the shortcomings of conventional soft computing techniques. The input weight and the bias values are generated at random in the ELM structure. ELM uses a basic simplified inverse operation of the hidden layer output matrix to measure the output weight matrix between hidden and output layers analytically (Fan et al. 2018; Zhu et al. 2020). The ELM is a promising time series prediction method because of its interpolation and uniform approximation capabilities. ELM can be represented mathematically as a function with L hidden nodes and N training data, as seen in Huang et al., (2006):

$$\mathop \sum \limits_{i = 1}^{L} w_{i} g\left( {W_{{{\text{in}}\left( i \right)}} ,b_{i} ,x_{j} } \right) = \mathop \sum \limits_{i = 1}^{L} w_{i} g\left( {W_{{{\text{in}}\left( i \right)}} ,x_{j} + b_{i} } \right) = Y_{j} ,\;j = 1, \ldots ,N.$$

(1)

where ${x}_{j}$ represent the input vector, ${W}_{\mathrm{in}\left(i\right)}$ denotes the weight vtor of the input, ${W}_{\mathrm{in}\left(i\right)}$.${x}_{j}$ corresponds the inner product of ${W}_{\mathrm{in}\left(i\right)}$ and ${x}_{j}$, b_i denotes the bias of the ith hidden node, g(∙) denotes the sigmoid function, w_i refers to the weight matrix of the output, and y_j is the modeled output of the ELM. At the start of the ELM algorithm, the input weight and bias are selected at random.

Particle swarm optimization (PSO)

The PSO algorithm is a type of optimization algorithm to simulate the behavior of swarm intelligence (Qi et al. 2018; Xi et al. 2021). It is an efficient globally optimized search algorithm through the system intelligence guidelines that comes from the collaboration and rivalry between parts of the group. In the research space of N-dimensional, it can specifically be represented randomly as a particle group (the quantity is m) and each implemented a special structure of ANN (Chen et al. 2020; Wang et al. 2018). The primary architectural theory of PSO is closely related to two studies: the first is the evolutionary algorithm; the PSO swarm mode where the mostly optimized objective solution space can be searched simultaneously. The other is synthetic life and in particular the examination of artificial systems with life-characteristics. The efficiency of the particles location was measured by the statistical error in the training phase. More specifically, at generating the lowest values of MSE, the ANN structure is defined by a particle with a superlative performance (Elsheikh and Abd Elaziz 2019). The next swarm was produced based on updating the particles’ locations, which takes into account the best position of swarm and particle. Particle swarms gradually shifted to the optimal location up to the maximum number of iterations. The velocity and direction of the particle can be modified as follows:

$$V_{i}^{k + 1} = \omega \times V_{i}^{k} + c_{1} \times {\text{rand}} () \times \left( {P_{i}^{k} - X_{i}^{k} } \right) + c_{2} \times {\text{rand}}() \times \left( {G_{i}^{k} - X_{i}^{k} } \right)$$

(2)

$$X_{i}^{k + 1} = X_{i}^{k} + V_{i}^{k + 1}$$

(3)

where ${V}_{i}^{k+1}$ and ${V}_{i}^{k}$ denote the particle ($i$) velocity at iteration ($k$) and ($k+1$), rand () represents a random value between 0 and 1. c₁, c₂ are called a learning or acceleration factor and equaled 2 for both in this work. $\omega$ refers to the coefficient of inertia. P and G represent the best place for a particle and a swarm, respectively. In the present work, number of iterations, population, and runs were 100, 25 and 7, respectively. $\omega$ ranged from 0.2 to 0.9.

Mayfly optimization algorithm (MOA)

The suggested process of optimization considers as a PSO adjustment and the combination of main PSO, genetic and firefly algorithms benefits (Gao et al. 2020; Zervoudakis and Tsafarakis 2020). In reality, it offers for researchers who seek to advance the efficiency of the PSO algorithm with certain techniques and local search, a powerful hybrid algorithm, dependent upon the mayflies’ actions, as PSO has proved that modifications are required to ensure optimum performance in high-dimensional areas. The algorithm functions like this. Initially, the random generation of two groups of mayflies is male and female, respectively (Mansouri et al. 2019). The individual fly can be put on a random basis as a d-dimensional vector solution x = (x₁,…,x_d) in problem space, and its output is assessed on the previous definition of objective function f (x). The velocity of the mayfly v = (v₁,…,v_d) is known as the shift of its path, and each mayfly’s flying path is a complex relationship between individual and social experience. In specific, they will change their direction to their best (P_best) location and the best position achieved by any swarm flying (G_best) (Haddad et al. 2006). In Swarms, male’s mayflies will continue to investigate or manipulate during iterations. The speed will be modified to their present fitness data and historical optimal values of fitness (Chen and Shi 2019; Mansouri et al. 2019; Zhou et al. 2018). The females will update their speeds in a particular way. In biological terms, the female only live with wings in one day to at most seven days, so the female mayflies need to search the male mayflies to intermarriage and spread. They will then update their speeds on the basis of their masculine flies. The top best male and female mayflies are regarded as the first mate and the second best female/male mayflies as the second mates and so on. In the present study, the parameters used for this algorithm are presented in Table 1.

Table 1 Parameters setting of each optimization algorithms

Full size table

Simulated annealing algorithm (SA)

The SA is an algorithm that belongs to meta-heuristic methods and is known also as Monte Carlo Annealing (Shao and Zuo 2020; Tufano et al. 2020; Turhan and Bilgen 2020). The SA algorithm is a physical annealing method based on simulation that has successfully extended to multiple dynamic problem optimization. The concept behind the SA stems from thermodynamics and illustrates how to mirror the solidifying mechanism of the fluid into a crystalline solid (Abdel-Basset et al. 2021; Redi et al. 2020; Silva et al. 2020; Tang et al. 2020). Annealing is a physical mechanism utilized to tend to harden metals beginning at maximum temperature and cool off slowly. At first, SA parameters including the initial temperature T₀, final temperature T_final and cooling rate are initialized. The original temperature is the maximum and will be gradually refreshed using the rate of cooling before the end temperature is reached (Ben Messaoud 2020; Meng et al. 2020). A sharp decline in temperature causes the molecules to select the right location, normally not the best possible case, therefore quench the object. The algorithm starts with a randomly created solution. The approach is based on progressive improvement of the present solution. During iterations process, a new neighboring solution for the current solution is chosen. If the latest neighboring solution is stronger, the existing solution is modified. In addition, when the adjacent solution is stronger, the optimal solution is refreshed. When the end temperature is hit, the model stops (Liu et al. 2020; Zhao et al. 2020). For each iteration (), the actual temperature T is modified by:

$$T = T * \tau , \, 0 < \tau < {1}$$

(4)

Boltzmann distribution gives the probability according to mathematical thermodynamics that a molecule is at a certain energy degree as follows:

$$P\left( {\varepsilon_{i} } \right) = \frac{{{\text{exp}}^{{ - \varepsilon_{i/kT} }} }}{{\mathop \sum \nolimits_{j = 1}^{m} {\text{exp}}^{{ - \varepsilon_{j/kT} }} }}$$

(5)

where ${\varepsilon }_{i}$ is the energy at the state ($i$), k represents the constant of Boltzmann, T equals the thermodynamic temperature and m denotes the overall states number. In this study, the initial and final temperatures were 100 and 0.01, respectively, and the rate of cooling was 0.7 (Table 1).

Grey wolf optimizer (GWO)

GWO is an advanced successful introduced swarm intelligence algorithm suggested by Mirjalili et al. (2014) in last decade. The kind of algorithms is based on the imitation of the social order and hunting activities of the Grey Wolf Herd. GW is purely hierarchical social species comprising $\propto$, β, δ, and $\omega$. In this respect, $\propto$ is the leader who distributes several assignments to individuals at various levels to achieve global optimization. Due to its simple structure, insignificant parameter change required and high accuracy, the GWO algorithm has always been used for functional optimization (Dehghani et al. 2019; Himanshu et al. 2020; Wang et al. 2018). The location of the ith wolf is defined as X_i = X^I₁, X^I₂…, X^I_d. X^I_d stands for the position of the ith wolf is d-dimensional space, for a population containing N grey wolves (X = X₁, X₂,…, X_N). The specific role of hunting is as follows:

$$D = \left| {C* X_{i } \left( t \right) - X\left( t \right)} \right|$$

(6)

$$X_{i} \left( {t + 1} \right) = X \left( t \right) - A*D$$

(7)

where A and C stand for the coefficient vectors, t is number of iterations, X(t) is the position vector of GW, X_i (t) is target position vector of the GW, D corresponds to the distance between the prey and the grey wolf.

Defining the coefficient vector as follows:

$$A = 2*a*r_{1} - a$$

(8)

$$C = 2*r_{2}$$

(9)

$$a = 2 - i*\left( {\frac{2}{Maximum \; iteration}} \right)$$

(10)

where r₁ and r₂ correspond to the random vectors with values (0, 1) and a represents the iteration factor. In this study, the population, number of iterations, and number of runs for all algorithms were 25, 100, and 7, respectively, as demonstrated in Table 1. In addition, a decreased from 2 to 0. GW has strong food quest ability. ∝ is the boss who will serve in all activities and sometimes β and δ can participate. β and δ can also provide ∝ with successful goal information in the GWO as an ideal solution (Mohammadi et al. 2020; Tikhamarine et al. 2020, 2019; Yu and Lu 2018). Therefore, α, β, and δ are the three ideal alternatives in fact and their adjusted positions:

$$D_{ \propto } \;= \left| {C_{1} * X_{ \propto }\; \left( t \right) - X\left( t \right)} \right|$$

(11)

$$D_{{\upbeta }} = \left| {C_{2} * X_{{\upbeta }} \left( t \right) - X\left( t \right)} \right|$$

(12)

$$D_{{\updelta }} = \left| {C_{3} * X_{{\updelta }} \left( t \right) - X\left( t \right)} \right|$$

(13)

$$X_{1} = X_{ \propto } - A_{1} *D_{ \propto }$$

(14)

$$X_{2} = X_{{\upbeta }} - A_{2} *D_{{\upbeta }}$$

(15)

$$X_{3} = X_{{\updelta }} - A_{3} *D_{{\updelta }}$$

(16)

$$X \left( {t + 1} \right) = \frac{{X_{1} + X_{2} + X_{3} }}{3}$$

(17)

where X_α, X_β, and X_δ correspond to the current positions of the three optimal solutions α, β, and δ, respectively; X(t) stands for the target position; D_α, D_β, and D_δ denote the distances from the prey to the three solutions, respectively; X(t + 1) is the position vector with updated searching factor; C and A represent the random vectors.

Hybrid PSO–GWO algorithm

Without modifying the overall operation of PSO and GWO algorithms, our hybrid PSO–GWO algorithm has been built. In nearly all real-world issues, the PSO can produce good results. Nevertheless, the PSO algorithm has to be reduced to minimal in the local solution. The GWO algorithm in our approach suggested follows the PSO algorithm in order to minimize the risk of slipping through a minimum local level (Şenel et al. 2019). The PSO algorithm leads those particles into the random spot, as discussed in PSO description section, with no potential to escape local minima. These directions can be risky for going away from the global minimum. The GWO algorithm’s scanning ability can be used to avoid these threats by directing those particles into locations which are partly enhanced by the GWO algorithm rather than randomly guided. The runtime therefore is expanded, since besides the PSO algorithm, the GWO algorithm is still used. The longer time will be considered tolerable, based on the optimization problem solved, when the results are effective and the additional time required is considered. The improved success will accommodate extra time in the sector like leather, in which losses are much more important as long as the solution is accomplished in a proper time. The flowchart of this algorithm is presented in Fig. 3.

Proposed hybrid MOA–SA algorithm

For on-going optimization issues, the MOA algorithm was designed above. The new hybrid algorithm called MOA–SA is for fitness function problems of each particle in the current population and selecting the best particle and swam. Our hybrid MOA–SA algorithm was created without altering the overall operation of MOA and SA algorithms. Each MOA solution vector shall be transformed, i.e., only 0 s and 1 s and evaluated in binary form. The S-shaped transition mechanism is used for this conversion. The option of selecting a specific feature in a solution vector is given by this function. The initial temperature is the highest and will be steadily refreshed using the cooling rate to reach the final temperature, as discussed in the SA algorithm description. The algorithm begins with a solution generated at random. The method is focused on gradual changes to the existing approach. A new neighboring solution for the current solution is selected during the iteration process. The final steps are checking the iteration process and terminate the computational process and store the optimal solution. The flowchart for the new hybrid algorithm (MOA–SA) is shown in Fig. 4.

Application and results

The ability of two-phase optimized ELM models, ELM–PSOGWO and ELM–SAMOA, is investigated in monthly streamflow prediction. Various combinations of streamflow, precipitation and temperature data obtained from two stations, Pakistan, are used as model inputs. The outcomes acquired from the ELM–PSOGWO and ELM–SAMOA are compared with two single-phase optimized models, ELM–PSO, ELM–MOA, and standalone ELM. The following criteria are utilized in assessment of the employed models:

$${\text{RMSE}}:{\text{Root}}\; {\text{Mean}}\; {\text{Square}}\; {\text{Error}} = \sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left[ {(S_{{\text{o}}} )_{i} - (S_{{\text{c}}} )_{i} } \right]^{2} }$$

(18)

$${\text{MAE}}:{\text{Mean}}\; {\text{Absolute}}\; {\text{Error}}\; = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} |\left( {S_{{\text{o}}} } \right)_{i} - \left( {S_{{\text{c}}} } \right)_{i} |$$

(19)

$${\text{MAPE}}:{\text{Mean}} \;{\text{Abs}}.\; {\text{Percentage}} \;{\text{Error}} = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \frac{{|(S_{{\text{o}}} )_{i} - (S_{{\text{c}}} )_{i} |*100}}{{(S_{{\text{c}}} )_{i} }}$$

(20)

$${\text{NSE}}:{\text{Nash}} - {\text{Sutcliffe}} \;{\text{efficiency}} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left[ {(S_{{\text{o}}} )_{i} - (S_{{\text{c}}} )_{i} } \right]^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} \left[ {(S_{{\text{o}}} )_{i} - \overline{S}_{{\text{o}}} } \right]^{2} }}, - \infty < {\text{NSE}} \le 1$$

(21)

where ${S}_{\mathrm{c}}, {S}_{\mathrm{o}}, {\overline{S}}_{\mathrm{o}}$ are calculated, observed and mean of the observed streamflows, respectively, and N is the quantity of the data. Several control parameter values were considered in model development phase, and the optimal ones were decided with respect to lowest square error (Najafzadeh and Niazmardi 2021; Najafzadeh et al. 2021). These values are listed in Table 1 for each algorithm. The table also shows the population and iteration numbers and the number of runs which are necessary to get more robust results from the meta-heuristic algorithms. Input combinations listed in Table 2 were decided taking into account the correlation analysis (autocorrelation and/or partial auto-correlation functions). As seen from Table 2, first three involve lagged streamflow values, in the inputs iv and v, precipitation data are imported and after temperature data are included in the input combinations.

Table 2 The input combinations used for model development

Full size table

Table 3 represents training and testing statistics of single-phase optimized, two-phase optimized and standalone ELM models in predicting streamflows of Padma Station. As it is clear from the table, two-phase optimized models are superior to the other models, while the standalone model has the worst results in streamflow prediction. It is also clear that the two-phase optimized ELM–SAMOA has the lowest RMSE, MAE and MAPE and the highest NSE and R² in both training (RMSE = 3448 m³/s, MAE = 2282 m³/s, NSE = 0.928, R² = 0.928, MAPE = 10.78%) and testing (RMSE = 4042 m³/s, MAE = 2413 m³/s, NSE = 0.896, R² = 0.897, MAPE = 15.24%) stages; an increase in RMSE of ELM, ELM–PSO, ELM–MOA, ELM–PSOGWO is by 31%, 27%, 19% and 14% applying the ELM-SAMOA in the test stage, respectively. All models’ outcomes reveal that importing precipitation information deteriorates the accuracy while lagged temperature inputs improve the efficiency of the single-phase optimized ELM–MOA and the two-phase optimized ELM–SAMOA models (compare the inputs iii and iv/v). The RMSE and MAE of the ELM–SAMOA decrease from 4488 and 2419 m³/s to 4042 and 2413 m³/s by 11% and 0.2%; the NSE and R² increase from 0.872 and 0.878 to 0.896 and 0.897 by 3% and 2%.

Table 3 The results of the single-phase, two-phase optimized and standalone ELM models in prediction streamflows of Padma Station

Full size table

Training and testing results of the ELM-based models in predicting streamflows of Gorai Station are compared in Table 4. Here, also the superiority of the two-phase optimized ELM models over other models and the standalone model has the last rank in accuracy. The ELM-SAMOA with third input combination produced lower RMSE (413.1 m³/s), MAE (240.4 m³/s), MAPE (18.94%) and higher NSE (0.855) and R² (0.857) than those of the best ELM with input iii (RMSE = 531.3 m³/s, MAE = 289.6 m³/s, NSE = 0.760, R² = 0.762, MAPE = 27.29%), ELM–PSO with input vii (RMSE = 523.38 m³/s, MAE = 281.17 m³/s, NSE = 0.767, R² = 0.788, MAPE = 27.15%), ELM–MOA with input iv (RMSE = 492.8 m³/s, MAE = 276.2 m³/s, NSE = 0.793, R² = 0.810, MAPE = 23.68%) and ELM–PSOGWO with input iv (RMSE = 470.5 m³/s, MAE = 275.1 m³/s, NSE = 0.811, R² = 0.819, MAPE = 23.17%) in the testing stage. By applying the two-phase optimized ELM–SAMOA, the RMSE was improved by 29%, 27%, 19% and 14% compared to ELM, ELM–PSO, ELM–MOA and ELM–PSOGWO, respectively. On the contrary to Padma Station, here adding precipitation information into the inputs improves the accuracy of the ELM–MOA (RMSE from 521.5 to 482.8 m³/s, MAE from 281.17 to 276.2 m³/s, NSE from 0.767 to 0.793 and R² from 0.788 to 0.810) and ELM–PSOGWO (RMSE from 600.6 to 470.5 m³/s, MAE from 324.7 to 275.1 m³/s, NSE from 0.693 to 0.811 and R² from 0.812 to 0.819). Importing temperature input slightly improves efficiency of ELM–PSO; from input combination iii to vii, the RMSE, MAE, NSE and R² were improved by 0.6%, 6.9%, 0.4% and 3% in the testing stage, respectively.

Table 4 The results of the single-phase, two-phase optimized and standalone ELM models in prediction streamflows of Gorai Station

Full size table

Figures 5 and 6 illustrate the time variation diagrams of the observed and predicted streamflows by the best ELM-based models for the Padma and Gorai stations, respectively. From the figures, the two-phase optimized ELM-SAMOA model appears to be better simulate the streamflows compared to other models while the standalone ELM cannot adequately catch the observed values in both stations. The scatter diagrams of the streamflow predictions are compared in Figs. 7 and 8 for two stations. From these graphs, it is also apparent that the ELM-SAMOA offers better accuracy with less scattered estimates compared to other alternatives. The implemented models are further compared on Taylor and violin diagrams in Figs. 9 and 10 for the Padma and Gorai stations. It is apparent from the Taylor graphs that the ELM–SAMOA has the closest standard deviation to observed one with the highest correlation and the lowest square error and it is followed by the other two-phase optimized ELM–PSOGW, single-phase optimized ELM–MOA, ELM–PSO and standard ELM models, respectively. It is clear from the violin diagrams that the distribution of the ELM–SAMOA predictions is closer to the observed one while the standard ELM model has the most different distribution. All the graphs justify the testing statistics provided in Tables 3 and 4 that the ELM–SAMOA acted better than the other models in prediction of monthly streamflows.

Table 5 compares the single-phase, two-phase optimized and standalone ELM models with respect to t-test for both stations. In the table, the statistics were calculated according to the significance level of 5% (two-tailed test). Higher t-statistics (t-stat) than the critical one shows that there is no significant difference in mean between the computed and observed data. The model having higher t-stat indicates better robustness. It is clearly seen from Table 5 that the two-phase optimized ELM–SAMOA has higher statistics compared to the other models in Padma and Gorai stations.

Table 5 t-test of single-phase, two-phase optimized and standalone ELM models for both stations using best input combination results

Full size table

Two-phase optimized ELM models were also compared with the single-phase optimized and standalone ELM models in estimating streamflows of Gorai Station (downstream) using data of Padma Station (upstream). Same input combinations were taken into account, and the model outcomes are listed in Table 6 with respect to RMSE, MAE, NSE, R² and MAPE. In this application, also two-phase optimized ELM models perform superior to the single-phase optimized and standalone ELM models; however, the difference between the ELMSAMOA and ELM–PSO is marginal. The best two-phase optimized ELM–PSOGWO model with input ii has lower RMSE (511.9 m³/s) and higher NSE (0.777) than those of the best ELM with input iv (RMSE = 612.1 m³/s, NSE = 0.681), ELM–PSO with input v (RMSE = 538 m³/s, NSE = 0.753), ELM–MOA with input ii (RMSE = 543.4 m³/s, NSE = 0.749) and ELM-SAMOA with input iv (RMSE = 535.7 m³/s, NSE = 0.756) in the testing stage. Implementing the ELM–PSOGWO improved the RMSE of the best ELM, ELM–PSO, ELM–MOA and ELM–SAMOA by 20%, 5.1%, 6.2% and 4.6% in the testing stage, respectively. By including precipitation inputs, the accuracy of standalone ELM (input iv), (ELM–PSO (input v) and ELM–SAMOA (input iv) was improved, whereas the temperature data did not increase the efficiency of the models in estimating Gorai’s streamflows utilizing data of upstream station. Time variation and scatter diagrams of the estimated streamflows by different ELM-based models are illustrated in Figs. 11 and 12. It is clear from the visual comparisons; two-phase optimized ELM models have closer streamflow estimates to the observed values, and their scatters are less compared to single-phase optimized and standalone ELM models. This part of study is very useful especially for the basins having missing streamflow data. In such basins, streamflow can be easily estimated using external (e.g., upstream) data.

Table 6 The results of the single-phase, two-phase optimized and standalone ELM models in estimation of Gorai’s streamflow using data of Padma Station (Upstream)

Full size table

Overall, two-phase optimized ELM models, ELM–PSOGWO and ELM–SAMOA models, perform superior to the single-phase and standalone ELM models in streamflow prediction. Among the two-phase optimized ELM models, the ELM–SAMOA acted better than the other. In the second application, the ELM–PSOGWO offered better efficiency compared to other ELM-based models. The main advantages of the two-phase optimization approaches are improvement in exploration and exploitation abilities of single-phase metaheuristic algorithms. Because for a robust and generalizable optimization algorithm is to balance the ability of exploitation and exploration efficiently in order to find best solution/parameters of a machine learning model. Results of this study also endorsed the effectiveness of PSOGWO- and SAMOA-based ELM models by improving exploration and exploitation capabilities.

Concluding remarks

The ability of two-phase optimized ELM models was investigated in monthly streamflow prediction using lagged streamflow, precipitation and temperature data as inputs. The outcomes were compared with the single-phase optimized and standalone ELM models. In the first application, each station’s streamflows were predicted using local data while in the second application, streamflows of one station were estimated using other station data. The following conclusions were reached from the benchmark outcomes.

Based on the RMSE, MAE, NSE, R² and MAPE criteria and graphical methods (e.g., time variation, scatter, Taylor and violin diagrams), the two-phase optimized ELM–SAMOA offered the best accuracy in prediction of monthly streamflows using local data; improvement in RMSE of ELM, ELM–PSO, ELM–MOA and ELM–PSOGWO is by 31%, 27%, 19% and 14% for one station and 29%, 27%, 19% and 14% for other station, in the testing stage, respectively.
In the second application, the two-phase optimized ELM–PSOGWO acted as the best model in streamflow estimation with external data; improvement in RMSE of ELM, ELM–PSO, ELM–MOA and ELM–SAMOA is by 20%, 5.1%, 6.2% and 4.6% in the testing stage, respectively.
The outcomes suggested the use of two-phase optimization compared to single-phase one while the standalone ELM model provided the worst efficiency.

References

Abdel-Basset M, Ding W, El-Shahat D (2021) A hybrid Harris Hawks optimization algorithm with simulated annealing for feature selection. Artif Intell Rev. https://doi.org/10.1007/s10462-020-09860-3
Article Google Scholar
Adnan RM, Liang Z, Heddam S, Zounemat-Kermani M, Kisi O, Li B (2020a) Least square support vector machine and multivariate adaptive regression splines for streamflow prediction in mountainous basin using hydro-meteorological data as inputs. J Hydrol 586:124371
Article Google Scholar
Adnan RM, Zounemat-Kermani M, Kuriqi A, Kisi O (2020) Machine learning method in prediction streamflow considering periodicity component. Intelligent data analytics for decision-support systems in hazard mitigation. Springer, Sipngapore, pp 383–403
Google Scholar
Akhter S, Eibek KU, Islam S, Islam ARMT, Shen S, Chu R (2019) Predicting spatiotemporal changes of channel morphology in the reach of Teesta River, Bangladesh using GIS and ARIMA modeling. Quat Int 513:80–94. https://doi.org/10.1016/j.quaint.2019.01.022
Article Google Scholar
Barzkar A, Najafzadeh M, Homaei F (2022) Evaluation of drought events in various climatic conditions using data-driven models and a reliability-based probabilistic model. Nat Hazards 110(3):1931–1952
Article Google Scholar
Bazargan J, Norouzi H (2018) Investigation the effect of using variable values for the parameters of the linear Muskingum method using the particle swarm algorithm (PSO). Water Resour Manag 32:4763–4777
Article Google Scholar
Ben Messaoud R (2020) Extraction of uncertain parameters of single-diode model of a photovoltaic panel using simulated annealing optimization. Energy Rep 6:350–357. https://doi.org/10.1016/j.egyr.2020.01.016
Article Google Scholar
Boulariah O, Meddi M, Longobardi A (2019) Assessment of prediction performances of stochastic and conceptual hydrological models: monthly stream flow prediction in northwestern Algeria. Arab J Geosci 12:1–14
Article Google Scholar
CDMP (Comprehensive Disaster Management Programme) (2014) Trend and impact analysis of internal displacement due to the impacts of disaster and climate change. Study Report, Ministry of Disaster Management and Relief, Dhaka
Chen J, Shi J (2019) A multi-compartment vehicle routing problem with time windows for urban distribution—a comparison study on particle swarm optimization algorithms. Comput Ind Eng 133:95–106. https://doi.org/10.1016/j.cie.2019.05.008
Article Google Scholar
Chen H, Fan DL, Fang L, Huang W, Huang J, Cao C, Yang L, He Y, Zeng L (2020) Particle swarm optimization algorithm with mutation operator for particle filter noise reduction in mechanical fault diagnosis. Int J Pattern Recognit Artif Intell. https://doi.org/10.1142/S0218001420580124
Article Google Scholar
Dehghani M, Riahi-Madvar H, Hooshyaripor F, Mosavi A, Shamshirband S, Zavadskas EK, Chau K-w (2019) Prediction of hydropower generation using Grey wolf optimization adaptive neuro-fuzzy inference system. Energies 12:1–20. https://doi.org/10.3390/en12020289
Article Google Scholar
Dewan A, Corner R, Saleem A, Rahman MM, Haider MR, Rahman MM, Sarker MH (2017) Assessing channel changes of the Ganges–Padma River system in Bangladesh using Landsat and hydrological data. Geomorphology 276:257–279. https://doi.org/10.1016/j.geomorph.2016.10.017
Article Google Scholar
Elsheikh AH, Abd Elaziz M (2019) Review on applications of particle swarm optimization in solar energy systems. Int J Environ Sci Technol 16:1159–1170. https://doi.org/10.1007/s13762-018-1970-x
Article Google Scholar
Fadaee M, Mahdavi-Meymand A, Zounemat-Kermani M (2020) Suspended sediment prediction using integrative soft computing models: on the analogy between the butterfly optimization and genetic algorithms. Geocarto Int 37:961–977
Article Google Scholar
Fan J, Yue W, Wu L, Zhang F, Cai H, Wang X, Lu X, Xiang Y (2018) Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily reference evapotranspiration using limited meteorological data in different climates of China. Agric for Meteorol 263:225–241. https://doi.org/10.1016/j.agrformet.2018.08.019
Article Google Scholar
Fu M, Fan T, Za Ding, Salih SQ, Al-Ansari N, Yaseen ZM (2020) Deep learning data-intelligence model based on adjusted forecasting window scale: application in daily streamflow simulation. IEEE Access 8:32632–32651
Article Google Scholar
Gao ZM, Zhao J, Li SR, Hu YR (2020) The improved mayfly optimization algorithm. J Phys Conf Ser. https://doi.org/10.1088/1742-6596/1684/1/012077
Article Google Scholar
Granata F, Di Nunno F, Najafzadeh M, Demir I (2022) A stacked machine learning algorithm for multi-step ahead prediction of soil moisture. Hydrology 10(1):1
Article Google Scholar
Haddad OB, Afshar A, Mariño MA (2006) Honey-bees mating optimization (HBMO) algorithm: a new heuristic approach for water resources optimization. Water Resour Manag 20:661–680. https://doi.org/10.1007/s11269-005-9001-3
Article Google Scholar
Himanshu N, Kumar V, Burman A, Maity D, Gordan B (2020) Grey wolf optimization approach for searching critical failure surface in soil slopes. Eng Comput. https://doi.org/10.1007/s00366-019-00927-6
Article Google Scholar
Hosseini FS, Choubin B, Mosavi A, Nabipour N, Shamshirband S, Darabi H, Haghighi AT (2020) Flash-flood hazard assessment using ensembles and Bayesian-based machine learning models: application of the simulated annealing feature selection method. Sci Tot Environ 711:135161
Article Google Scholar
Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and applications. Neurocomputing 70:489–501. https://doi.org/10.1016/j.neucom.2005.12.126
Article Google Scholar
Ikram RMA, Mostafa RR, Chen Z, Parmar KS, Kisi O, Zounemat-Kermani M (2023) Water temperature prediction using improved deep learning methods through reptile search algorithm and weighted mean of vectors optimizer. J Mar Sci Eng 11(2):259
Article Google Scholar
Ikram RMA, Hazarika BB, Gupta D, Heddam S, Kisi O (2022) Streamflow prediction in mountainous region using new machine learning and data preprocessing methods: a case study. Neural Comput Appl 1–18
Islam ARMT (2016) Assessment of fluvial channel dynamics of padma river in northwestern Bangladesh. Univ J Geosci 4:41–49. https://doi.org/10.13189/ujg.2016.040204
Article Google Scholar
Islam ARMT, Sein ZMM, Ongoma V et al (2016) Geomorphological and land use mapping: A case study of Ishwardi under Pabna district, Bangladesh. Adv Res 4(6):378–387. https://doi.org/10.9734/AIR/2015/14149
Article Google Scholar
Islam ARMT, Talukdar S, Mahato S et al (2021) Machine learning algorithm-based risk assessment of riparian wetlands in Padma River Basin of Northwest Bangladesh. Environ Sci Pollut Res. https://doi.org/10.1007/s11356-021-12806-z
Article Google Scholar
Jiang Y, Bao X, Hao S, Zhao H, Li X, Wu X (2020) Monthly streamflow forecasting using ELM-IPSO based on phase space reconstruction. Water Resour Manag 34:3515–3531
Article Google Scholar
Liu Y et al (2015) Analyzing effects of climate change on streamflow in a glacier mountain catchment using an ARMA model. Quatern Int 358:137–145
Article Google Scholar
Liu B, Wang R, Zhao G, Guo X, Wang Y, Li J, Wang S (2020) Prediction of rock mass parameters in the TBM tunnel based on BP neural network integrated simulated annealing algorithm. Tunn Undergr Space Technol 95:1–12. https://doi.org/10.1016/j.tust.2019.103103
Article Google Scholar
Malik A, Tikhamarine Y, Souag-Gamane D, Kisi O, Pham QB (2020) Support vector regression optimized by meta-heuristic algorithms for daily streamflow prediction. Stoch Env Res Risk Assess 34:1755–1773
Article Google Scholar
Mansouri N, Mohammad HZB, Javidi MM (2019) Hybrid task scheduling strategy for cloud computing by modified particle swarm optimization and fuzzy theory. Comput Ind Eng 130:597–633. https://doi.org/10.1016/j.cie.2019.03.006
Article Google Scholar
Meng X, Fu Y, Yuan J (2020) Estimating solubilities of ternary water-salt systems using simulated annealing algorithm based generalized regression neural network. Fluid Phase Equilib 505:112357. https://doi.org/10.1016/j.fluid.2019.112357
Article Google Scholar
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61
Article Google Scholar
Mirza MMQ (2004) The Ganges water diversion: environmental effects andimplications-an introduction. In: Mirza MMQ (ed) The Ganges waterdiversion: environmental effects and implications. Kluwer Academic Publishers, Dordrecht, The Netherlands, pp 1–12
Chapter Google Scholar
Mohammadi B, Guan Y, Aghelpour P, Emamgholizadeh S, Zolá RP, Zhang D (2020) Simulation of titicaca lake water level fluctuations using hybrid machine learning technique integrated with grey wolf optimizer algorithm. Water (Switzerland) 12:1–18. https://doi.org/10.3390/w12113015
Article Google Scholar
Moors EJ, Groot A, Biemans H et al (2011) Adaptation to changing water resources in the Ganges basin, northern India. Environ Sci Policy 14:758–769
Article Google Scholar
Mostafa RR, Kisi O, Adnan RM, Sadeghifar T, Kuriqi A (2023) Modeling potential evapotranspiration by improved machine learning methods using limited climatic data. Water 15(3):486
Article Google Scholar
Najafzadeh M, Niazmardi S (2021) A novel multiple-kernel support vector regression algorithm for estimation of water quality parameters. Nat Resour Res 30(5):3761–3775
Article Google Scholar
Najafzadeh M, Homaei F, Farhadi H (2021) Reliability assessment of water quality index based on guidelines of national sanitation foundation in natural streams: Integration of remote sensing and data-driven models. Artif Intell Rev 54(6):4619–4651
Article Google Scholar
Niu W-j, Feng Z-k, Chen Y-b, Zhang H-r, Cheng C-t (2020) Annual streamflow time series prediction using extreme learning machine based on gravitational search algorithm and variational mode decomposition. J Hydrol Eng 25:04020008
Article Google Scholar
Poul AK, Shourian M, Ebrahimi H (2019) A comparative study of MLR, KNN, ANN and ANFIS models with wavelet transform in monthly stream flow prediction. Water Resour Manag 33:2907–2923
Article Google Scholar
Prasad R, Deo RC, Li Y, Maraseni T (2018) Soil moisture forecasting by a hybrid machine learning technique: ELM integrated with ensemble empirical mode decomposition. Geoderma 330:136–161. https://doi.org/10.1016/j.geoderma.2018.05.035
Article Google Scholar
Qi C, Fourie A, Chen Q (2018) Neural network and particle swarm optimization for predicting the unconfined compressive strength of cemented paste backfill. Constr Build Mater 159:473–478. https://doi.org/10.1016/j.conbuildmat.2017.11.006
Article Google Scholar
Redi AANP, Jewpanya P, Kurniawan AC, Persada SF, Nadlifatin R, Dewi OAC (2020) A simulated annealing algorithm for solving two-echelon vehicle routing problem with locker facilities. Algorithms 13:1–14. https://doi.org/10.3390/a13090218
Article Google Scholar
Rezaie-Balf M, Kisi O (2018) New formulation for forecasting streamflow: evolutionary polynomial regression vs. extreme learning machine. Hydrol Res 49:939–953
Article Google Scholar
Riahi-Madvar H, Dehghani M, Seifi A, Salwana E, Shamshirband S, Mosavi A, Chau K-w (2019) Comparative analysis of soft computing techniques RBF MLP, and ANFIS with MLR and MNLR for predicting grade-control scour hole geometry. Eng Appl Comput Fluid Mech 13:529–550
Google Scholar
Saberi-Movahed F, Najafzadeh M, Mehrpooya A (2020) Receiving more accurate predictions for longitudinal dispersion coefficients in water pipelines: training group method of data handling using extreme learning machine conceptions. Water Resour Manage 34(2):529–561
Article Google Scholar
Şenel FA, Gökçe F, Yüksel AS, Yiğit T (2019) A novel hybrid PSO–GWO algorithm for optimization problems. Eng Comput 35:1359–1373. https://doi.org/10.1007/s00366-018-0668-5
Article Google Scholar
Shao W, Zuo Y (2020) Computing the halfspace depth with multiple try algorithm and simulated annealing algorithm. Comput Stat 35:203–226. https://doi.org/10.1007/s00180-019-00906-x
Article Google Scholar
Silva BH, Machado IM, Pereira FM, Pagot PR, França FHR (2020) Application of the simulated annealing algorithm to the correlated WMP radiation model for flames. Inverse Probl Sci Eng 28:1345–1360. https://doi.org/10.1080/17415977.2020.1732956
Article Google Scholar
Sudheer C, Maheswaran R, Panigrahi BK, Mathur S (2014) A hybrid SVM–PSO model for forecasting monthly streamflow. Neural Comput Appl 24:1381–1389
Article Google Scholar
Suiju L, Minquan F (2015) Three-dimensional numerical simulation of flow in Daliushu reach of the Yellow River. Int J Heat Technol 33:107–114
Article Google Scholar
Tang S, Peng M, Xia G, Wang G, Zhou C (2020) Optimization design for supercritical carbon dioxide compressor based on simulated annealing algorithm. Ann Nucl Energy 140:107107. https://doi.org/10.1016/j.anucene.2019.107107
Article Google Scholar
Tikhamarine Y, Souag-Gamane D, Kisi O (2019) A new intelligent method for monthly streamflow prediction: hybrid wavelet support vector regression based on grey wolf optimizer (WSVR–GWO). Arab J Geosci. https://doi.org/10.1007/s12517-019-4697-1
Article Google Scholar
Tikhamarine Y, Souag-Gamane D, Najah Ahmed A, Kisi O, El-Shafie A (2020) Improving artificial intelligence models accuracy for monthly streamflow forecasting using grey Wolf optimization (GWO) algorithm. J Hydrol 582:124435. https://doi.org/10.1016/j.jhydrol.2019.124435
Article Google Scholar
Tufano A, Accorsi R, Manzini R (2020) A simulated annealing algorithm for the allocation of production resources in the food catering industry. Br Food J 122:2139–2158. https://doi.org/10.1108/BFJ-08-2019-0642
Article Google Scholar
Turhan AM, Bilgen B (2020) A hybrid fix-and-optimize and simulated annealing approaches for nurse rostering problem. Comput Ind Eng 145:106531. https://doi.org/10.1016/j.cie.2020.106531
Article Google Scholar
Wang D, Tan D, Liu L (2018) Particle swarm optimization algorithm: an overview. Soft Comput 22:387–408. https://doi.org/10.1007/s00500-016-2474-6
Article Google Scholar
Xi H, Liao P, Wu X (2021) Simultaneous parametric optimization for design and operation of solvent-based post-combustion carbon capture using particle swarm optimization. Appl Therm Eng 184:116287. https://doi.org/10.1016/j.applthermaleng.2020.116287
Article Google Scholar
Yadav B, Ch S, Mathur S, Adamowski J (2017) Assessing the suitability of extreme learning machines (ELM) for groundwater level prediction. J Water l Dev 32:103–112. https://doi.org/10.1515/jwld-2017-0012
Article Google Scholar
Yaseen ZM, Faris H, Al-Ansari N (2020) Hybridized extreme learning machine model with salp swarm algorithm: a novel predictive model for hydrological application. Complexity 2020:1–14
Article Google Scholar
Yu S, Lu H (2018) An integrated model of water resources optimization allocation based on projection pursuit model—Grey wolf optimization method in a transboundary river basin. J Hydrol 559:156–165. https://doi.org/10.1016/j.jhydrol.2018.02.033
Article Google Scholar
Zervoudakis K, Tsafarakis S (2020) A mayfly optimization algorithm. Comput Ind Eng. https://doi.org/10.1016/j.cie.2020.106559
Article Google Scholar
Zhang L, Chen X, Zhang Y, Wu F, Chen F, Wang W, Guo F (2020) Application of GWO–ELM model to prediction of caojiatuo landslide displacement in the three gorge reservoir area. Water (Switz). https://doi.org/10.3390/w12071860
Article Google Scholar
Zhao S, Dong J, Monte C, Sun X, Zhang W (2020) New phase function development and complete spectral radiative properties measurements of aerogel infused fibrous blanket based on simulated annealing algorithm. Int J Therm Sci 154:106407. https://doi.org/10.1016/j.ijthermalsci.2020.106407
Article Google Scholar
Zhou H, Pang J, Chen PK, Chou FD (2018) A modified particle swarm optimization algorithm for a batch-processing machine scheduling problem with arbitrary release times and non-identical job sizes. Comput Ind Eng 123:67–81. https://doi.org/10.1016/j.cie.2018.06.018
Article Google Scholar
Zhu S, Heddam S, Wu S, Dai J, Jia B (2019) Extreme learning machine-based prediction of daily water temperature for rivers. Environ Earth Sci 78:1–17. https://doi.org/10.1007/s12665-019-8202-7
Article Google Scholar
Zhu B, Feng Y, Gong D, Jiang S, Zhao L, Cui N (2020) Hybrid particle swarm optimization with extreme learning machine for daily reference evapotranspiration prediction from limited climatic data. Comput Electron Agric 173:105430. https://doi.org/10.1016/j.compag.2020.105430
Article Google Scholar
Zounemat-Kermani M, Matta E, Cominola A, Xia X, Zhang Q, Liang Q, Hinkelmann R (2020) Neurocomputing in surface water hydrology and hydraulics: a review of two decades retrospective, current status and future prospects. J Hydrol 588:125085
Article Google Scholar
Zounemat-Kermani M, Alizamir M, Fadaee M, Sankaran Namboothiri A, Shiri J (2020) Online sequential extreme learning machine in river water quality (turbidity) prediction: a comparative study on different data mining approaches. Water Environ J 35:355
Google Scholar

Download references

Funding

The authors would also like to express their sincere appreciation to the associate editor and the anonymous reviewers for their comments and suggestions. This work was supported by the National Social Science Foundation of China (18BTJ029), Key Projects of National Statistical Science Research Projects (2020LZ10), and Tertiary Education Scientific Research Project of Guangzhou Municipal Education Bureau (202235324). The APC was funded by the Postdoctoral Start-up Research Fund of Guangzhou University.

Author information

Authors and Affiliations

School of Economics and Statistics, Guangzhou University, Guangzhou, 510006, China
Rana Muhammad Adnan & Hong-Liang Dai
Information Systems Department, Faculty of Computers and Information Sciences, Mansoura University, Mansoura, 35516, Egypt
Reham R. Mostafa
Department of Disaster Management, Begum Rokeya University, Rangpur, 5400, Bangladesh
Abu Reza Md. Towfiqul Islam
Department of Civil Engineering, Lübeck University of Applied Science, 23562, Lübeck, Germany
Ozgur Kisi
Civil Engineering Department, Ilia State University, 0162, Tbilisi, Georgia
Ozgur Kisi
Agricultural Engineering Department, Faculty of Agriculture, Mansoura University, Mansoura, 35516, Egypt
Ahmed Elbeltagi
Department of Water Engineering, Shahid Bahonar University of Kerman, Kerman, Iran
Mohammad Zounemat-Kermani
Department of Development Studies, Daffodil International University, Dhaka 1216, Bangladesh
Abu Reza Md. Towfiqul Islam

Authors

Rana Muhammad Adnan
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Liang Dai
View author publications
You can also search for this author in PubMed Google Scholar
Reham R. Mostafa
View author publications
You can also search for this author in PubMed Google Scholar
Abu Reza Md. Towfiqul Islam
View author publications
You can also search for this author in PubMed Google Scholar
Ozgur Kisi
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed Elbeltagi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Zounemat-Kermani
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors have equally made contributions. All authors read and approved the original manuscript.

Corresponding authors

Correspondence to Hong-Liang Dai or Ozgur Kisi.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Adnan, R.M., Dai, HL., Mostafa, R.R. et al. Application of novel binary optimized machine learning models for monthly streamflow prediction. Appl Water Sci 13, 110 (2023). https://doi.org/10.1007/s13201-023-01913-6

Download citation

Received: 01 August 2022
Accepted: 20 March 2023
Published: 08 April 2023
DOI: https://doi.org/10.1007/s13201-023-01913-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Application of novel binary optimized machine learning models for monthly streamflow prediction

Abstract

Similar content being viewed by others

Enhancing accuracy of extreme learning machine in predicting river flow using improved reptile search algorithm

Integrated metaheuristic algorithms with extreme learning machine models for river streamflow prediction

Urmia lake water depth modeling using extreme learning machine-improved grey wolf optimizer hybrid algorithm

Introduction

Case study