Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models

Darabi, Hamid; Mohamadi, Sedigheh; Karimidastenaei, Zahra; Kisi, Ozgur; Ehteram, Mohammad; ELShafie, Ahmed; Torabi Haghighi, Ali

doi:10.1007/s00500-021-05721-5

Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models

Methodologies and Application
Open access
Published: 26 March 2021

Volume 25, pages 7609–7626, (2021)
Cite this article

Download PDF

You have full access to this open access article

Soft Computing Aims and scope Submit manuscript

Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models

Download PDF

Hamid Darabi¹,
Sedigheh Mohamadi²,
Zahra Karimidastenaei¹,
Ozgur Kisi³,
Mohammad Ehteram⁴,
Ahmed ELShafie⁵ &
…
Ali Torabi Haghighi¹

2475 Accesses
26 Citations
Explore all metrics

Abstract

Accurate modeling and prediction of suspended sediment load (SSL) in rivers have an important role in environmental science and design of engineering structures and are vital for watershed management. Since different parameters such as rainfall, temperature, and discharge with the different lag times have significant effects on the SSL, quantifying and understanding nonlinear interactions of the sediment dynamics has always been a challenge. In this study, three soft computing models (multilayer perceptron (MLP), adaptive neuro-fuzzy system (ANFIS), and radial basis function neural network (RBFNN)) were used to predict daily SSL. Four optimization algorithms (sine–cosine algorithm (SCA), particle swarm optimization (PSO), firefly algorithm (FFA), and bat algorithm (BA)) were used to improve the capability of SSL prediction of the models. Data from gauging stations at the mouth of the Kasilian and Talar rivers in northern Iran were used in the analysis. The selection of input combinations for the models was based on principal component analysis (PCA). Uncertainty in sequential uncertainty fitting (SUFI-2) and performance indicators were used to assess the potential of models. Taylor diagrams were used to visualize the match between model output and observed values. Assessment of daily SSL predictions for Talar station revealed that ANFIS-SCA yielded the best results (RMSE (root mean square error): 934.2 ton/day, MAE (mean absolute error): 912.2 ton/day, NSE (Nash–Sutcliffe efficiency): 0.93, PBIAS: 0.12). ANFIS-SCA also yielded the best results for Kasilian station (RMSE: 1412.10 ton/day, MAE: 1403.4 ton/day, NSE: 0.92, PBIAS: 0.14). The Taylor diagram confirmed that ANFIS-SCA achieved the best match between observed and predicted values for various hydraulic and hydrological parameters at both Talar and Kasilian stations. Further, the models were tested in Eagel Creek Basin, Indiana state, USA. The results indicated that the ANFIS-SCA model reduced RMSE by 15% and 21% compared to the MLP-SCA and RBFNN-SCA models in the training phase. Comparing models performance indicated that the ANFIS-SCA model could decrease MAE error compared to ANFIS-BA, ANFIS-PSO, ANFIS-FFA, and ANFIS models by 18%, 32%, 37%, and 49% in the training phase, respectively. The results indicated that the integration of optimization algorithms and soft computing models can improve the ability of models for predicting SSL. Additionally, the hybridization of soft computing models with optimization algorithms can decrease the uncertainty of models.

A comparison of various artificial intelligence approaches performance for estimating suspended sediment load of river systems: a case study in United States

Article 19 March 2015

Suspended sediment load prediction based on soft computing models and Black Widow Optimization Algorithm using an enhanced gamma test

Article 27 April 2021

Simulation of suspended sediment based on gamma test, heuristic, and regression-based techniques

Article 09 October 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Sediment dynamics (transport and deposition) can cause environmental issues and concerns such as damage to aquatic ecosystems, declining quality of surface water and groundwater, and variations in reservoir recharge and river morphology (Afan et al. 2016; Shojaeezadeh et al. 2018; Gholami et al. 2016; Guo et al. 2020; Ren et al. 2020). Suspended sediment load (SSL) in watersheds is one of the most important hydraulic and hydrological parameters, which can impact the performance of hydraulic structures and water transfer projects. Additionally, sediments transported to reservoirs can reduce the reservoir capacity and affect operational policy, e.g., water supply, energy generation, and irrigation. Therefore, the estimation and prediction of SSL in rivers are vital tasks in the water resources management, and accurate results would help decision-making on river engineering, reservoir operation, watershed management, and sustainable water resources (Yang et al. 2009; Downs et al. 2009; Akrami et al. 2013; Himanshu et al. 2017; Haghighi et al. 2019). Prediction of daily sediment can lead to optimal decisions for dam’s outlet operation during the flood and conveying some part suspended sediment load to downstream area. Addressing short-term and long-term sediment dynamics is challenging owing to the heterogeneity of basins, the uncertainty in hydrological parameters, and the stochastic nature of flow and characteristics of sediment transport and deposition processes (Malmon et al. 2002; Pirnia et al. 2019b; Pizzuto 2020). Imprecise SSL modeling and prediction can reduce the amount of water stored by dam reservoirs, which can have an enormous negative impact on domestic and agricultural water supply, and also on dam structures (Lafdani et al. 2013; McCarney-Castle et al. 2017; Zhang et al. 2020; Zhao et al. 2020).

During recent decades, various approaches to improve the accuracy of SSL predictions have been introduced, including numerical and hydraulic, distributed and lumped models, statistics, empirical models, and machine learning models (Bezak et al. 2014; Merkhali et al. 2015; Kumar et al. 2016; Shamaei and Kaedi 2016; Choubin et al. 2018). Some studies have predicted SSL at daily scale using data-driven methods such as machine learning algorithms and soft computing models (Nourani and Andalib 2015; Choubin et al. 2018; Kaveh et al. 2020). Other studies worldwide seeking to enhance the precision of the SSL estimation have used machine learning techniques such as adaptive neuro-fuzzy system (ANFIS) (Rajaee et al. 2009; Cobaner et al. 2009; Kisi et al. 2012; Azamathulla et al. 2012; Vafakhah 2013; Choubin et al. 2018), artificial neural network (ANN) (Rajaee et al. 2009; Melesse et al. 2011; Kisi et al. 2012; Vafakhah 2013; Nourani and Andalib 2015; Wang et al. 2018; Halecki et al. 2018; Liu et al. 2019), support vector machine (SVM) (Kisi et al. 2012; Pektaş and Doğan 2015; Choubin et al. 2018), multilayer perceptron (MLP) (Cigizoglu 2004; Gholami et al. 2016; Romano et al. 2018), and radial basis function neural network (RBFNN) (Erol et al. 2008; Ahmad and Kumar 2016; Ibrahim et al. 2019). The soft computing models were widely applied for predicting SSL, e.g., Adib and Mahmoodi (2017) were applied GA method to optimize the structure of the ANN model predicting SSL, Talebi et al. (2017) estimated SSL using regression trees and ANN models, Salih et al. (2020) have illustrated that the attribute selected classifier performed better than the tree models in SSL prediction, Ehteram et al. (2020) have employed ANN and a multiobjective genetic algorithm to predict the SSL, and Samantary and Ghose (2020) estimated SSL using SVM, feed-forward neural network (FFN), and RBFNN and they have shown that the SVM had the highest performance.

Although the MLP, ANIFS, RBFNN, and SVM models have a high capability for estimating SSL, optimization of these algorithms is required to obtain more accurate results (Fiyadh et al. 2019). Classical model training algorithms, such as backpropagation and the gradient descent algorithm, may become trapped in local optimums, so researchers have begun to develop new optimization algorithms (Ehteram et al. 2017). One recent example is the sine–cosine algorithm (SCA), inspired by mathematical sine and cosine functions, which has high search accuracy, speed of convergence, and stability (Mirjalili 2016). Optimization algorithms can be utilized as training algorithms to set the internal parameters of the MLP, ANFIS, and RBFNN models.

In the present study, data-based approaches and soft computing models (stand-alone and hybridized with optimization algorithms) were used for predicting SSL in the Talar river basin in northern Iran, where sediment is mostly generated during high-severity, erosive precipitation events and where complex processes determine suspended sediment and precipitation in river systems at watershed scale. The innovation of the present study is the new soft computing hybrid models which have been employed in previous studies for predicting other hydrological variabels. Furthermore, the present study deals with using these soft computing models and optimization algorithms that can be linked to hydraulic and hydrological modeling. Additionally, the present study deals with the uncertainty of model parameters and its effect on the outcomes. ANFIS and ANN models are widely used models for predicting hydrological variables given their high potential, high accuracy, and easy learning for modelers. Furthermore, the extensive capability of soft computing models in other engineering fields makes the mentioned models be present as the models used in the study. However, the motivation behind of the study is to provide solutions to identify the parameters needed to estimate the SSL in different areas. Moreover, the optimization algorithms of the study, as will be stated, were selected for the study given their high search capability, fast convergence speed, and lack of computational complexity. The hybridization of the models makes the results more accurate. Likewise, the models are more able to simulate variables in more complex problems. Since the hybridized and optimized models performed better than individual models, the soft computing models are frequently optimized or hybridized to overwhelm the weakness of stand-alone models. However, it should be considered that the preparation of the structure of soft computing models and the selection of the best input scenario are the challenges of the current study.

The current study develops a low-cost estimation approach for accurately predicting SSL in developing regions where sediment loads in rivers are the main environmental concern. Specific objectives were to: (1) develop and implement optimization algorithms (SCA, particle swarm optimization (PSO), bat algorithm (BA) and firefly algorithm (FFA)) to improve model prediction of SSL; (2) investigate the capability of the four optimization methods in SSL prediction by applying widely used performance indices; and (3) compare outputs achieved using the stand-alone and hybrid ANFIS, MLP, and RBFNN models.

2 Material and methods

2.1 Study areas

This study was carried out in two case study including Talar river and Eagel Creek Basins located in Iran and USA with different types of climate and environmental conditions. In the following section, some characteristics of the two above-mentioned basins are presented.

2.1.1 Talar river basin

The Talar river watershed (2100 km²) is situated in Mazandaran region, northern Iran (52° 35′ 18′′–53° 23′ 35′′ E; 35° 44′ 19′′–36° 19′ 13′′ N) (Fig. 1). Based on its aridity index of 0.69 (Sahin 2012; Pirnia et al. 2019a), the region climate is semi-humid, with 552.7 mm yearly precipitation and mean yearly minimum and maximum temperatures of 7.7 and 21.1 °C (Kavian et al. 2018). The smaller Kasilian river also runs through the watershed, to discharge into the Caspian Sea to the north (Fig. 1). Landslides are an important sediment source to both the Talar and Kasilian river systems (Emamgholizadeh and Demneh 2019). The watershed is characterized by intense rainfall events accompanied by frequent floods (Kavian et al. 2018) and has mountainous terrain characterized by rugged topography (altitude ranging from approximately 200 to 4000 m asl) and sparse vegetation cover in headwater areas, leading to huge sediment flows to the river network (Kavian et al. 2018). Both rivers have hydrometric stations situated at their outlet, from which daily observed data on rainfall, discharge, and suspended sediment concentration (SSC) were obtained for this work. The data were randomly divided into two subsets, with 80% utilized to calibrate the models and the remaining 20% utilized to test the proposed models. The maximum suspended sediment concentrations in the training and testing datasets were, respectively, 40,000 and 39,200 ton/day at Talar station and 60,000 and 59,000 ton/day at Kasilian station (Table 1).

Table 1 Characteristics and range of suspended sediment load (SSL) at gauging stations

Full size table

2.1.2 Eagel Creek Basin

In addition of the Talar basin, we used our models to predicts the daily SSL in a temperate and humid continental climate named Eagel Creek Basin in the Indiana state, USA (Fig. 2). The models run based on the rainfall, temperature, and discharge data (Table 1) from 2015 to 2018 (data retrieved from https://www.usgs.gov/centers/oki-water). For this basin, THE data were randomly divided into two subsets, with 80% utilized to calibrate the models and the remaining 20% utilized to test the proposed models.

2.2 Models tested for SSL prediction

2.2.1 Adaptive neuro-fuzzy system (ANFIS)

As an artificial neural network combined with fuzzy logical inference, ANFIS has a high ability for dealing with the imprecision and uncertainty of nonlinear environmental problems through its strong, effective learning techniques (Chang and Lai 2014; Choubin et al. 2018). Figure 3a shows a structure of the ANFIS model, which is a rule-based system comprising three parts: a rule base, a database, and an inference system that produces the system results by combining the fuzzy rules (Yurdusev and Firat 2009). The five layers in the ANFIS model are (1) input nodes, (2) rule nodes, (3) average nodes, (4) consequent nodes, and (5) output nodes, which employ different algorithms to produce fuzzy rules for training and testing (Park et al. 2012; Choubin et al. 2018). In ANFIS grid partitioning, fuzzy clustering and hybrid learning algorithms are applied to determine the input data structures in combination with the backpropagation gradient descent method (Cobaner et al. 2009; Kisi et al. 2012). The ANFIS model creates the following if–then rules using the pattern of input and output data:

$$ if\left( {x = A_{1} } \right)\;{\text{and}}\;\left( {y = B_{1} } \right) \to f = p_{1} x + q_{1} x + r_{1} $$

(1)

$$ if\left( {x = A_{2} } \right)\;{\text{and}}\;\left( {y = B_{2} } \right) \to f = p_{2} x + q_{2} x + r_{2} $$

(2)

where A₁, B₁, A₂, and B₂ are related membership functions (MFs), x and y are inputs, and p₁, q₁, r₁, and r₂ are consequent parameters. The ANFIS model has five computational layers:

1.
The amount of input variable is fuzzified by the first layer:
$$ O_{1,i} = - \mu_{{A_{i} }} \left( x \right) $$
(3)
where $O_{1,i}$ is the MF of A_i and _I is the linguistic label of node function. In the current work, the bell function was selected as the MF:
$$ \mu_{{A_{i} }} \left( x \right) = \exp \left[ { - \left( {\frac{{x - c_{i} }}{{a_{i} }}} \right)^{2} } \right] $$
(4)
where a_i and c_i are premise parameters.
2.
The second layer calculates the firing strength of each rule by product operation:
$$ O_{2,i} = \mu_{Ai} \left( x \right) \times \mu_{Bi} \left( x \right) $$
(5)
where $O_{2,i}$ is the second layer output and $\mu_{Bi} \left( x \right)$ is the fuzzy MF of fuzzy set B_i.
3.
The third layer is used to compute the normalized firing strength of every rule.
$$ O_{3,i} = \frac{{\omega_{i} }}{{\sum {\omega_{i} } }} = \frac{{\omega_{i} }}{{\omega_{1} + \omega_{2} }} $$
(6)
where $\omega_{i}$ is the fuzzy strength of each rule.
4.
The fourth layer determines the output of each rule:
$$ O_{4,i} = \overline{\omega } \times f_{i} $$
(7)
where $f_{i}$ is the output of the fuzzy region and $\overline{\omega }$ is the output of the third layer.
5.
The fifth layer is defuzzification:
$$ O_{5} = \sum {\overline{\omega }_{i} } \times f_{i} $$
(8)
where $O_{5}$ is the output of all the rules.

2.2.2 Multilayer perceptron (MLP)

The MLP network is a model one or more hidden layers which can use various input sets by a set of suitable outputs (Choubin et al. 2018). In MLP (Fig. 3b), the major learning rule is the backpropagation algorithm, which comprises two stages, a feed-forward and a backward stage, with external input information and calculated and measured information signals at the output (Cigizoglu 2004). The MLP network can simulate 90% of processes related to environmental and nature problems (Kim and Valdes 2003). The MLP model employed in the present study was a three-layer learning network having a hidden, an input, and an output layer (Samanta et al. 2019; Bhowmik et al. 2019; Van Dao et al. 2020). The neurons at hidden layers use the nonlinear activation function to provide the output as follows:

$$ u_{j} = \sum\limits_{i = 1}^{p} {w_{ij} } .x_{i} $$

(9)

$$ x_{j} = \varphi \left( {u_{j} - \theta_{j} } \right) $$

(10)

where $x_{i}$ is input, $x_{j}$ is the output of the model, $u_{j}$ is activation function, and $\theta_{j}$ is a threshold function. Previous researchers have successfully utilized the logistic sigmoid function for the MLP model as follows:

$$ \varphi_{x} = \frac{1}{{1 + e^{ - x} }} $$

(11)

The training algorithms are introduced to search for the optimum value of weight connections. Classical training algorithms such as backpropagation algorithm and gradient descent algorithm are widely applied to calibrate the MLP parameters.

2.2.3 Radial basis function neural network (RBFNN)

The RBFNN model is a type of feed-forward neural network which consists of a number of artificial neurons (see Fig. 3c). It can be considered a general-purpose network that can be employed in different fields to achieve accurate predictions. The RBFNN is considered a good candidate for solving problems by faster learning potential (Erol et al. 2008; Han et al. 2012; Kong et al. 2016; Ibrahim et al. 2019). RBFNN has very powerful mathematical functions for organization of deep learning theory in solving problems (Sabour and Movahed 2017). In practical application, the learning algorithm for the RBFNN model employs different datasets for training and testing, so as to adapt itself rapidly to new factors or combinations (Sabour and Movahed 2017). RBFNN has the advantage over other types of neural networks of having a clustering stage in training and testing (Singh et al. 2014; Kumar et al. 2016). It uses symmetric basis functions as activation functions:

$$ \phi_{i} \left( x \right) = \exp \left( { - \frac{{\left\| {x - c_{i} } \right\|}}{{\sigma_{i}^{2} }}} \right) $$

(12)

where $\phi_{i} \left( x \right)$ is the Gaussian function,$\sigma_{i}$ is the width of the ith radial basis function node, and $c_{i}$ is the center of hidden neuron i. The network output is computed as follows:

$$ y = \sum\limits_{i = 0}^{n} {\phi_{i} } w_{i} $$

(13)

where y is output and n is number of hidden neurons.

The training algorithms are used to set the RBFNN parameters such as center, width, and weight of the radial basis function node.

2.3 Optimization algorithms tested

2.3.1 Sine–cosine algorithm (SCA)

The SCA approach was first proposed by Mirjalili (2016). It updates the position of solutions using sine and cosine functions. The mathematical formulation of SCA is:

$$ X_{i}^{t + 1} = X_{i}^{t} + r_{1} \times \left( {\sin \left( {r_{2} } \right)} \right) \times \left| {r_{3} p_{i}^{t} - x_{i}^{t} } \right| $$

(14)

$$ X_{i}^{t + 1} = X_{i}^{t} + r_{1} \times \left( {\cos \left( {r_{2} } \right)} \right) \times \left| {r_{3} p_{i}^{t} - x_{i}^{t} } \right| $$

(15)

where $X_{i}^{t}$ is the position of current solution at the ith iteration in the ith dimension, r₂ and r₃ are random values, p_i^t is the destination solution, and r₁ is a control parameter used to get a balance between exploration and exploitation. The two SCA functions (Eqs. 14, 15) are then integrated into one function:

$$ X_{i}^{t} = \left[ \begin{gathered} X_{i}^{t + 1} = X_{i}^{t} + r_{1} \times \left( {\sin \left( {r_{2} } \right)} \right) \times \left| {r_{3} p_{i}^{t} - x_{i}^{t} } \right|\quad \left( {{\text{if}}} \right)\;r_{4} < 0.5 \hfill \\ X_{i}^{t + 1} = X_{i}^{t} + r_{1} \times \left( {\cos \left( {r_{2} } \right)} \right) \times \left| {r_{3} p_{i}^{t} - x_{i}^{t} } \right|\quad \left( {{\text{if}}} \right)\;r_{4} \ge 0.5 \hfill \\ \end{gathered} \right] $$

(16)

The following equation is used to update the value of parameter r₁:

$$ r_{1} = a - t\frac{a}{T} $$

(17)

where a is a constant and T is the maximum quantity of iterations. Parameter r₂ is utilized to obtain the movement direction of the next solution. Parameter r₃ is used to define a random weight for the destination with a stochastic influence emphasizing (r₃ > 1) or decreasing distance (r₃ < 1). Parameter r₄ is used to switch between the cosine and sine functions. Figure 4 shows the sine–cosine effect on the next position and Fig. 5 shows a flowchart of SCA.

2.3.2 Bat algorithm (BA)

All bats have the echolocation characteristic to sense distance and use it to identify the difference between the food and obstacles (Yang et al. 2009). In the first step in BA, the initial population of bats is randomly initialized (Fig. 6). The BA uses the following equations to renew the bats’ velocity and position (Yang et al. 2009):

$$ \begin{aligned} f_{i} &= f_{\min } + \left( {f_{\max } - f_{\min } } \right)\beta \hfill \\ v_{i}^{t} &= v_{i}^{t - 1} + \left( {x_{i}^{t - 1} - x^{*} } \right)f_{i} \hfill \\ x_{i} &= x_{i}^{t - 1} + v_{i}^{t} \hfill \\ \end{aligned} $$

(18)

where $f_{i}$ is frequency of bat i, $f_{\max }$ is the maximum frequency, $f_{\min }$ is the minimum frequency, $v_{i}^{t}$ is velocity of agent i at iteration t, $x_{i}^{t - 1}$ is position of agnet i at iteration t − 1, $x^{t}_{i}$ is position of agnet i at iteration t, $\beta$ is a random number, $v_{i}^{t - 1}$ is velocity of agnet i at iteration t, $x^{*}$ is the best solution, and x_i is position of agnet i at iteration t. When the bat becomes closer to its prey, the rate of pulse emission and the loudness of the bat are renewed as:

$$ \begin{aligned} A_{i}^{t + 1} & = \alpha A_{i}^{t} \\ r_{i}^{t + 1} & = r_{i}^{o} \left[ {1 - \exp \left( { - \gamma t} \right)} \right] \\ \end{aligned} $$

(19)

where $A_{i}^{t + 1}$ is loudness of bat i at iteration t + 1, $\alpha$ is a constant value, $\gamma$ is a constant value, $r_{i}^{o}$ is the pulse emission’ initial rate, $r_{i}^{t + 1}$ is the pulse emission rate of bat i at iteration t + 1, and $A_{i}^{t}$ is the loudness of bat i at iteration t.

The bats use random walk to update their position:

$$ x_{new} = x_{old} + \varepsilon A_{i}^{t} $$

(20)

where $x_{new}$ is the bat’s new position, $x_{old}$ is the old position of the bat, and $\varepsilon$ is a random number.

2.3.3 Firefly algorithm (FFA)

Firefly algorithm, introduced by Yang et al. (2009), is dependent on the firefly’s behavior (Fig. 7). The short, rhythmic flashes produced by fireflies are intended to attract other fireflies that have weaker flashes. The landscape of the objective function identifies the firefly brightness. For a problem of minimization, a brighter firefly has a smaller objective function. The fireflies update their position as follows:

$$ x_{i} \left( {t + 1} \right) = x_{i} \left( t \right) + \chi (r)\left( {x_{j} \left( t \right) - x_{i} \left( t \right)} \right) + \phi_{t} \upsilon $$

(21)

where $x_{i} \left( {t + 1} \right)$ is position of firefly i at iteration t + 1, $x_{i} \left( t \right)$ is position of firefly i at iteration t, $x_{j} \left( t \right)$ is position of firefly j at iteration t, $\chi (r)$ is attractiveness, $\phi_{t}$ is a step factor, and $\upsilon$ is a random number. The attractiveness is computed as follows:

$$ \chi \left( r \right) = \chi_{0} e^{{ - \gamma r^{2} }} $$

(22)

$$ r_{ij} = \left\| {x_{i} - x_{j} } \right\| = \sqrt {\sum\limits_{i = 1}^{D} {\left( {x_{id} - x_{jd} } \right)^{2} } } $$

(23)

where $\chi_{0}$ is attractiveness at r = 0, D is number of dimensions, and $r_{ij}$ is distance between two fireflies.

2.3.4 Particle swarm optimization (PSO)

Eberhart and Kennedy (1995) introduced PSO, which is inspired by the social behavior of particles (Fig. 8). The algorithm starts with initialization of random particles in the search space. The particles search for the optimal solution by updating generations. At each iteration, the two best values are used to update each particle. The first is the best solution found so far and the second is the best value found so far by any particle in the population. The following equations are utilized to renew the position and velocity of particles:

$$ x_{i} \left( t \right) = x_{i} \left( {t - 1} \right) + v_{i} \left( t \right) $$

(24)

$$ v_{i}^{t} = wv_{i}^{t - 1} + C_{1} r_{1} \left[ {P_{{{\text{best}}}} - x_{i} \left( t \right)} \right] + C_{2} r_{2} \left[ {G_{{{\text{best}}}} - x_{i}^{t} } \right] $$

(25)

where $v_{i}^{t}$ is velocity of the particle at time t, w is inertia coefficient, $C_{1}$ and $C_{2}$ are acceleration coefficients, r₁ and r₂ are random numbers, $P_{{{\text{best}}}}$ is the most promising position of the particle, $G_{{{\text{best}}}}$ is the most promising position among the particles of the whole swarm, and $x_{i} \left( t \right)$ is the position of particles at time t.

2.4 Hybridization of prediction models with optimization algorithms

2.4.1 ANFIS hybridization

Application of the ANFIS model starts with setting parameters to optimal values, commonly by using a hybrid learning method combining gradient descent (GD) and the least square estimate (LSE). However, the hybrid LSE-GE method may unable to achieve the rate of convergence for finding appropriate values of internal parameters in ANFIS, and therefore supporting algorithms are widely applied to optimize the internal parameters. The premise and consequent parameters in ANFIS are decision variables of the optimization algorithms that are optimized using these supporting algorithms. The main function of the optimization algorithms is then to update the initial values of the internal parameters in ANFIS, utilizing algorithm operators. An objective function, root mean square error (RMSE), is defined for hybrid ANFIS optimization algorithms. The optimization process tries to minimize the value of RMSE. When the ANFIS optimization algorithms converge to the lowest value of RMSE as a stopping criterion, the hybrid ANFIS model achieves the optimal value of its internal parameters.

2.4.2 MLP hybridization

The MLP parameters must be optimized to achieve the most accurate results. Training algorithms are required to set weight connections and threshold values. The initial threshold values and weight connections are defined as the initial population of algorithms. Each of the agents of the algorithms has two key parts: a set of weight connections and a set of threshold values. The values of MLP parameters are updated when the optimization algorithm tries to minimize the error function (RMSE). The convergence cycle of optimization continues until the hybrid MLP optimization algorithm model converges to a minimum objective function value.

2.4.3 RBFNN hybridization

Training algorithms are introduced to search for optimum parameters of the RBFNN model. Each of the agents of optimization algorithms has three key parts: center, width, and weight of the radial basis function node. The RBFNN parameters are defined as the initial population algorithms, which are entered into optimization algorithms to be updated by the operators of optimization algorithms. The optimal value of RBFNN parameters is found when the hybrid RBFNN optimization algorithm model converges to the lowest value of the target objective function.

2.5 Uncertainty analysis of soft computing models

Uncertainty in sequential uncertainty fitting (SUFI-2) is one of the best-known models for uncertainty analysis (Kumar et al. 2017). In SUFI-2, the parameter uncertainties account for uncertainty in model inputs and an objective function must be defined before the uncertainty analysis. Latin hypercube (LH) sampling is conducted, the objective function is assessed, and finally, the parameter covariance matrix is computed. In addition, 95% prediction uncertainty (95PPU) is computed at the 2.5% and 97% levels. Uncertainty analysis is required for the study as optimization algorithms try to find the exact values of the model parameters, as the input values may have include some sort of uncertainties. Thus, model uncertainty analysis can examine the effect of uncertainty related to model structure and parameter on the results. Two indices are used to quantify the uncertainty of models, observed data’s percentage bracketed by 95 PPU (p index) and an index r computed as follows:

$$ \begin{aligned} \overline{d} & = \frac{1}{n}\sum\limits_{t = 1}^{n} {\left( {y_{97.5\% } - y_{2.5\% } } \right)} \\ r & = \frac{{\overline{d}}}{\sigma } \\ \end{aligned} $$

(26)

where $\sigma$ is standard deviation of the data, $y_{97.5\% }$ is the upper boundary of 95PPU, $y_{2.5\% }$ is the lower boundary of 95PPU, n is quantity of data, and r is average width of the confidence interval band. Other evaluation statistics utilized in this study were: RMSE (lower RMSE shows more accurate estimations), mean absolute error (MAE) (lower MAE shows more accurate estimation), percentage bias (PBIAS) (lower PBIAS shows more accurate estimations), and Nash–Sutcliffe efficiency (NSE) (NSE = 1 shows the ideal model):

$$ {\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {Y_{{{\text{obs}}}} - Y_{{{\text{sim}}}} } \right)^{2} } } $$

(27)

$$ {\text{RMSE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {Y_{{{\text{obs}}}} - Y_{{{\text{sim}}}} } \right|} $$

(28)

$$ {\text{PBIAS}} = 100 * \left[ {\frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{{{\text{obs}}}} - Y_{{{\text{sim}}}} } \right)} }}{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{{{\text{obs}}}} } \right)} }}} \right] $$

(29)

$$ {\text{NSE}} = 1 - \left[ {\frac{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{{{\text{obs}}}} - Y_{{{\text{sim}}}} } \right)}^{2} }}{{\sum\nolimits_{i = 1}^{n} {\left( {Y_{{{\text{obs}}}} - \overline{Y}_{{{\text{obs}}}} } \right)} }}} \right] $$

(30)

where n is quantity of observed data, $Y_{{{\text{obs}}}}$ is observed data, $Y_{{{\text{sim}}}}$ is simulated data, and $\overline{Y}_{{{\text{obs}}}}$ is mean of observed data.

3 Results and discussion

3.1 Selection of appropriate inputs for soft computing models

In this study, the soft computing models are used to predict SSL (t) (a 1-day ahead forecast of SSL). Principal component analysis (PCA) is an effective method for identifying inputs of models and decreasing the number of input parameters required (Lu et al. 2019). PCA achieves parsimony by describing the maximum value of common variance in a correlation matrix using the smallest number of illustrative concepts. The Kaiser–Meyer–Melkin criterion (KMO) is used to investigate the adequacy of data as follows. The KMO is a measure of the proportion of variance among variables that might be common variance (Darabi et al. 2014).

$$ {\text{KMO}} = \frac{{\sum {\left( {{\text{correlation}}} \right)^{2} } }}{{\sum {\left( {{\text{correlation}}} \right)^{2} + \sum {\left( {{\text{partial}}\;{\text{correlation}}} \right)^{2} } } }} $$

(31)

According to the literature, the minimum value of KMO should be 0.5. In this study, KMO was 0.65. The correlation among variables should be checked, to avoid multicollinearity problems (Lu et al. 2019). In this study, all correlation values were below the threshold (0.9), and thus, there were no problems of multicollinearity. Table 2 shows the value of the contribution of principal components (PCs). The results indicated that the first three PCs included 60, 23, and 12% of input variables at Talar station, and 61, 20, and 11% of input variables at Kasilian station. Lagged data (one-day to nine-day lagged rainfall, one-day to nine-day lagged discharge, and one-day to nine-day lagged SSL) were regarded as the initial data. It was found that the first three PCs were affected more by one-day and two-day lagged SSL, one-day lagged R, and one-day lagged Q than by any other variables (Table 2). The direction of new future space was determined by the eigenvectors and the variance of data by the eigenvalues (Table 2). The PCs are the integration of the independent variable.

Table 2 Eigenvalues and eigenvectors obtained for principal components (PCs) describing the most influential variables (time (t)-lagged suspended sediment load (SSL), discharge (Q), rainfall (R))

Full size table

3.1.1 Tuning the random parameters in optimization algorithms

In the current work, the Taguchi model was utilized to set the random parameters of evolutionary algorithms. Population size and r₂, r₃, and r₄ are regarded as the random parameters in SCA that can affect the accuracy of the proposed model. Four levels were defined for each of these four parameters. The total number of tests to be performed to find the optimum value of parameters was computed as:

$$ {\text{Number}} = 1 + N\left( {L - 1} \right) $$

(32)

where L is level number and N is parameter number. Hence, at least 13 experiments had to be conducted for SCA. In addition, the Taguchi model utilizes signal-to-noise ratio to select the optimal value of parameters (Mozdgir et al. 2013):

$$ \frac{S}{N}{\text{ratio}} = - 10\log \left( {{\text{objective}}\left( {{\text{function}}} \right)} \right)^{2} $$

(33)

where the optimal value of random parameters has the highest S/N ratio. Figure 8 depicts the computed S/N ratio for different parameters in the four optimization algorithms tested here.

3.2 Talar station

For Talar station, the best results with the training dataset were obtained when ANFIS-SCA was used (RMSE: 934.2 ton/day, MAE: 912.3 ton/day, NSE: 0.93, PBIAS: 0.14) and the worst results were obtained when RBFNN was used (RMSE: 1789.10 ton/day, MAE: 1765.2 ton/day, NSE: 0.77, PBIAS: 0.36) (Table 3). The MLP-SCA was the best second model, and the hybrid and stand-alone MLP outperformed the hybrid and stand-alone RBFNN models (Table 3). Comparison of results obtained using the optimization algorithms revealed that SCA provided the best results and FFA the worst results.

Table 3 Value of evaluation criteria (root mean square error (RMSE), mean absolute error (MAE), Nash–Sutcliffe efficiency (NSE), percentage bias (PBIAS)) obtained for the different hybrid and stand-alone models in the training and testing phases for the Talar and Kasilian stations

Full size table

The best results with the testing dataset for Talar station were also obtained with ANFIS-SCA (RMSE: 1423.2 ton/day, MAE: 1412.10 ton/day, NSE: 0.92, PBIAS: 0.16) (Table 3). Based on the RMSE, MAE, and PBIAS values, in the test stage the stand-alone ANFIS, MLP, and RBFNN models were more accurate than the hybrid ANFIS, MLP, and RBFNN models. Adding SCA decreased the RMSE of ANFIS, MLP, and RBFNN by 20%, 21%, and 22%, respectively. The testing results indicated that the hybrid and stand-alone MLP was better than the hybrid and stand-alone RBFNN models.

3.3 Kasilian station

For Kasilian station, in the training stage ANFIS-SCA was the best performing model (RMSE: 898.1 ton/day, MAE: 712.3 ton/day, NSE: 0.95, PBIAS: 0.12) and RBFNN was the worst (RMSE: 1655.6 ton/day, MAE: 1645.2 ton/day, NSE: 0.82, PBIAS: 0.39) (Table 3). The hybrid and stand-alone MLP models outperformed the stand-alone and hybrid RBFNN models during the training phase, while ANFIS-SCA outperformed MLP-SCA and RBFNN-SCA in terms of precision. Overall, the NSE, MAE, PBIAS, and NSE values for SCA proved its superiority among the optimization algorithms tested, while FFA gave the worst results. The performance of the hybrid ANFIS, MLP, and RBFNN models surpassed that of their stand-alone counterpart in the training stage.

In the testing phase, ANFIS-SCA again provided the best results (RMSE: 1412.10 ton/day, MAE: 1403.4 ton/day, NSE: 0.92; PBIAS: 0.14), and RBFNN again exhibited the worst results (RMSE: 1789.1 ton/day, MAE: 1767.2 ton/day, NSE: 0.65; PBIAS: 0.49) (Table 3). The hybrid ANFIS, MLP, and RBFNN models outperformed the stand-alone ANFIS, MLP, and RBFNN models. Based on the assessment statistics for Kasilian Station, it can be said that the SCA was the most accurate optimization algorithm and, as in the training phase, FFA gave the worst results) (Table 3). The evaluation criteria also confirmed the superiority of ANFIS-SCA, followed by the MLP-SCA, in comparison with RBFNN-SCA.

In order to visually summarize how closely the proposed models matching the observed values, Taylor diagrams were used to display the match between observed data and the output of the models in terms of their RMSE, standard deviation, and correlation. The Taylor diagram for Talar station, in which statistics for the 15 models (see Table 3) were calculated and a colored circle was assigned to each model, is shown in Fig. 8a. The position of each circle appearing in the diagram quantifies how that model’s estimated SSL matched measured data, where the centered RMSE is proportional to the distance from the reference point on the horizontal axis as observed data. The whole dataset was used to plot the Taylor diagrams. The results revealed that for ANFIS-SCA, MLP-SCA, and RBFNN-SCA, the centered RMSE was 1000, 1050, and 1189 m, respectively. The hybrid soft computing models resulted in lower RMSE than the stand-alone models (Fig. 9a).

The Taylor diagram for Kasilian station is shown in Fig. 8b. It indicated that ANFIS-SCA and MLP-SCA predictions gave the best match with observed data and that the ANFIS-SCA model had a higher Taylor correlation and lower RMSE than the other models. The RBFNN model had the highest RMSE. When using the hybrid MLP, RBFNN, and ANFIS model, the Taylor correlation increased from 0.4 to 0.97. The highest RMSE was found for the stand-alone and hybrid MLP models (980–3000 ton/day) and the stand-alone and hybrid RBFNN models (1100 to 3300 ton/day) (Fig. 9b).

3.4 Uncertainty analysis of models and box plots

Comparison of models in terms of the selected indices (p and r) showed that ANFIS-SCA provided better r (0.12) and p (0.95) values for both Talar and Kasilian stations (Table 4). The hybrid ANFIS, MLP, and RBFNN models gave better p and r values than the stand-alone ANFIS, MLP, and RBFNN models. In addition, RBFNN-FFA had the lowest p (0.76) and highest r (0.39) among the hybrid models. For Talar station, the hybrid and stand-alone ANFIS models gave better r and p values than the hybrid and stand-alone MLP and RBFNN models (Table 4). Figure 10a, b shows the box plots for different models at Talar and Kasilian stations. The results indicated that ANFIS-SCA, MLP-SCA, and RBFNN-SCA most closely matched the observed SSL, outperforming the stand-alone ANFIS, MLP, and RBFNN models at both stations.

Table 4 Uncertainty results (r, p) for the different hybrid and stand-alone models

Full size table

Overall, this study showed that the hybrid ANFIS-SCA model has good ability for predicting SSL in rivers. However, different climate parameters affected the SSL values obtained (Table 2), so follow-up studies should predict SSL for future periods using climate models and scenarios describing projected changes in meteorological parameters such as temperature and rainfall.

3.4.1 The analysis for the Eagel Creek river basin

Table 2 indicates that the first three components (PC₁, PC3, and PC₃) have greater values of participation. Furthermore, SSL (t − 1), R (t − 1), and Q (t − 1) data are more significant for all three components compared to other data. Thus, the first three components were selected as input to the models. Table 5 shows that the ANFIS-SCA model reduced RMSE by 15% and 21% compared to the MLP-SCA and RBFNN-SCA models in the training phase. Comparing models performance indicated that the ANFIS-SCA model could decrease MAE error compared to ANFIS-BA, ANFIS-PSO, ANFIS-FFA, and ANFIS models by 18%, 32%, 37%, and 49% in the training phase, respectively. Comparing the performance of the models showed that the ANFIS-SCA model with the highest value of p and the lowest value of r had less uncertainty than other models in the training phase. Moreover, the findings indicated that RBFNN model with 0.77 NSE had the weakest performance in the training phase. Examining the performance of the models in the training phase according to PBIAS value indicated that the MLP-SCA model had less PBIAS than other MLP models, showing better performance of the MLP-SCA model than other models. Comparing the performance of the models in the test phase indicated that the ANFIS-SCA model reduced RMSE value by 8.8%, 24%, 38%, and 45%, respectively, compared to the ANFIS-BA, ANFIS-PSO, ANFIS-FFA, and ANFIS in the test phase. Additionally, the results indicated that the RBFNN model with NSE and PBIAS of 0.62 and 0.55 has the weakest performance among the models in the test phase. One can see that the hybrid ANFIS models had higher and lower p and r values than the ANFIS model, showing less uncertainty of hybrid ANFIS models compared to the ANFIS model. Ultimately, Fig. 10c indicates that ANFIS-SCA box diagram was more in line with the observational data compared to other models. Figure 10c indicates that ANFIS, MLP, and RBFNN models are less accurate than models using optimization algorithms. Hence, the performance of the models for the second case study indicated that ANFIS-SCA model had better accuracy in the present study. In Table 5, the performance of the different hybrid and stand-alone models for the training and testing phases is presented for the Eagel Creek Basin.

Table 5 Performance of the different hybrid and stand-alone models for the training and testing phases for the Eagel Creek Basin

Full size table

4 Conclusions

The knowledge of suspended sediment load modeling in rivers is excessive, as it results from soil erosion and plays a key role in watershed management, river morphology, and the operation of hydraulic structures. The current research studied any possibility of evolutionary soft computing approaches in suspended sediment load modeling. However, soft computing approaches such as the ANFIS, MLP, and RBFNN models are widely used to estimate SSL, but their output is not sufficiently accurate for basin management. In this study, four optimization algorithms (SCA, PSO, BA, and FFA) were used to train the ANFIS, MLP, and RBFNN models for suspended sediment load prediction at the basin scale (Talar and Eagel Creek Basins located in northern Iran and central part of USA). The second case study demonstrated that the ANFIS-SCA model could decrease MAE error compared to ANFIS-BA, ANFIS-PSO, ANFIS-FFA, and ANFIS models by 18%, 32%, 37%, and 49% in the training phase, respectively. However, different climate parameters affected the SSL value, so future studies should predict SSL for future periods using models and scenarios describing future changes in climate. Each optimization algorithm in the study with high accuracy and appropriate convergence speed showed a very high capacity for solving optimization problems. The conclusions are as follows:

Novel optimized models had an important scientific contribution to the development of a powerful model for suspended sediment load prediction at the watershed scale. The sine–cosine algorithm (SCA) optimizer gave strong predictive capacities to the (multilayer perceptron (MLP), adaptive neuro-fuzzy system (ANFIS), and radial basis function neural network (RBFNN).
Among the optimized models, ANFIS-SCA showed the best performance in the training and testing phases for both stations, while RBFNN showed the lowest accuracy.
Optimization of the models using SCA has decreased the RMSE by 20%, 21%, and 22% for ANFIS, MLP, and RBFNN, respectively.
The uncertainty outputs (based on the uncertainty in sequential uncertainty fitting (SUFI-2)) indicated that the hybrid ANFIS, MLP, and RBFNN models were the most accurate (lowest r index, highest p index) of the models tested. Overall, ANFIS-SCA showed a good ability for predicting SSL.
This study can help as a basic research for future studies and other regions (other optimization algorithms or soft computing models) seeking suspended sediment load prediction in a watershed scale using optimized models.

References

Adib A, Mahmoodi A (2017) Prediction of suspended sediment load using ANN GA conjunction model with Markov chain approach at flood conditions. KSCE J Civ Eng 21(1):447–457
Google Scholar
Afan HA, El-shafie A, Mohtar WHMW, Yaseen ZM (2016) Past, present and prospect of an Artificial Intelligence (AI) based model for sediment transport prediction. J Hydrol 541:902–913
Google Scholar
Ahmad ST, Kumar KP (2016) Radial basis function neural network nonlinear equalizer for 16-QAM coherent optical OFDM. IEEE Photonics Technol Lett 28(22):2507–2510
Google Scholar
Akrami SA, El-Shafie A, Jaafar O (2013) Improving Rainfall Forecasting Efficiency Using Modified Adaptive Neuro-Fuzzy Inference System (MANFIS). Water Resour Manag 27(9):3507–3523
Google Scholar
Azamathulla HM, Ghani AA, Fei SY (2012) ANFIS-based approach for predicting sediment transport in clean sewer. Appl Soft Comput 12(3):1227–1230
Google Scholar
Bezak N, Mikoš M, Šraj M (2014) Trivariate frequency analyses of peak discharge, hydrograph volume and suspended sediment concentration data using copulas. Water Resour Manag 28(8):2195–2212
Google Scholar
Bhowmik M, Muthukumar P, Anandalakshmi R (2019) Experimental based multilayer perceptron approach for prediction of evacuated solar collector performance in humid subtropical regions. Renew Energy 143:1566–1580
Google Scholar
Chang FJ, Lai HC (2014) Adaptive neuro-fuzzy inference system for the prediction of monthly shoreline changes in northeastern Taiwan. Ocean Eng 84:145–156
Google Scholar
Choubin B, Darabi H, Rahmati O, Sajedi-Hosseini F, Kløve B (2018) River suspended sediment modelling using the CART model: a comparative study of machine learning techniques. Sci Total Environ 615:272–281
Google Scholar
Cigizoglu HK (2004) Estimation and forecasting of daily suspended sediment data by multi-layer perceptrons. Adv Water Resour 27(2):185–195
Google Scholar
Cobaner M, Unal B, Kisi O (2009) Suspended sediment concentration estimation by an adaptive neuro-fuzzy and neural network approaches using hydro-meteorological data. J Hydrol 367(1–2):52–61
Google Scholar
Darabi H, Shahedi K, Solaimani K, Miryaghoubzadeh M (2014) Prioritization of subwatersheds based on flooding conditions using hydrological model, multivariate analysis and remote sensing technique. Water Environ J 28(3):382–392
Google Scholar
Downs PW, Cui Y, Wooster JK, Dusterhoff SR, Booth DB, Dietrich WE, Sklar LS (2009) Managing reservoir sediment release in dam removal projects: An approach informed by physical and numerical modelling of non-cohesive sediment. Int J River Basin Manag 7(4):433–452
Google Scholar
Eberhart R, Kennedy J (1995) A new optimizer using particle swarm theory. In: MHS'95. Proceedings of the sixth international symposium on micro machine and human science, pp 39–43
Ehteram M, Karami H, Mousavi SF, El-Shafie A, Amini Z (2017) Optimizing dam and reservoirs operation based model utilizing shark algorithm approach. Knowl Based Syst 122(26):38
Google Scholar
Ehteram M, Ahmed AN, Latif SD, Huang YF, Alizamir M, Kisi O, El-Shafie A (2020) Design of a hybrid ANN multi-objective whale algorithm for suspended sediment load prediction. Environ Sci Pollut Res 28:1–16
Google Scholar
Emamgholizadeh S, Demneh RK (2019) A comparison of artificial intelligence models for the estimation of daily suspended sediment load: a case study on the Telar and Kasilian rivers in Iran. Water Supply 19(1):165–178
Google Scholar
Erol R, Oğulata SN, Şahin C, Alparslan ZN (2008) A radial basis function neural network (RBFNN) approach for structural classification of thyroid diseases. J Med Syst 32(3):215–220
Google Scholar
Fiyadh SS, AlSaadi MA, Jaafar WZ, AlOmar MK, Fayaed SS, Mohd NS, Hin LS, El-Shafie A (2019) Review on heavy metal adsorption processes by carbon nanotubes. J Clean Prod 230:783–793
Google Scholar
Gholami A, Bonakdari H, Zaji AH, Michelson DG, Akhtari AA (2016) Improving the performance of multi-layer perceptron and radial basis function models with a decision tree model to predict flow variables in a sharp 90 bend. Appl Soft Comput 48:563–583
Google Scholar
Guo C, Jin Z, Guo L, Lu J, Ren S, Zhou Y (2020) On the cumulative dam impact in the upper Changjiang River: Streamflow and sediment load changes. CATENA 184:104250
Google Scholar
Haghighi AT, Darabi H, Shahedi K, Solaimani K, Kløve B (2019) A scenario-based approach for assessing the hydrological impacts of land use and climate change in the Marboreh Watershed. Iran. Environ Model Assess 25:1–17
Google Scholar
Halecki W, Kruk E, Ryczek M (2018) Estimations of nitrate nitrogen, total phosphorus flux and suspended sediment concentration (SSC) as indicators of surface-erosion processes using an ANN (Artificial Neural Network) based on geomorphological parameters in mountainous catchments. Ecol Indic 91:461–469
Google Scholar
Han HG, Qiao JF, Chen QL (2012) Model predictive control of dissolved oxygen concentration based on a self-organizing RBF neural network. Control Eng Pract 20(4):465–476
Google Scholar
Himanshu SK, Pandey A, Yadav B (2017) Assessing the applicability of TMPA-3B42V7 precipitation dataset in wavelet-support vector machine approach for suspended sediment load prediction. J Hydrol 550:103–117
Google Scholar
Ibrahim S, Choong CE, El-Shafie A (2019) Sensitivity analysis of artificial neural networks for just-suspension speed prediction in solid-liquid mixing systems: performance comparison of MLPNN and RBFNN. Adv Eng Inform 39:278–291
Google Scholar
Kaveh K, Kaveh H, Bui MD, Rutschmann P (2020) Long short-term memory for predicting daily suspended sediment concentration. Eng Comput. https://doi.org/10.1007/s00366-019-00921-y
Article Google Scholar
Kavian A, Mohammadi M, Gholami L, Rodrigo-Comino J (2018) Assessment of the spatiotemporal effects of land use changes on runoff and nitrate loads in the Talar River. Water 10(4):445
Google Scholar
Kim TW, Valdés JB (2003) Nonlinear model for drought forecasting based on a conjunction of wavelet transforms and neural networks. J Hydrol Eng 8(6):319–328
Google Scholar
Kisi O, Dailr AH, Cimen M, Shiri J (2012) Suspended sediment modeling using genetic programming and soft computing techniques. J Hydrol 450:48–58
Google Scholar
Kong C, Wang H, Li D, Zhang Y, Pan J, Zhu B, Luo Y (2016) Quality changes and predictive models of radial basis function neural networks for brined common carp (Cyprinus carpio) fillets during frozen storage. Food Chem 201:327–333
Google Scholar
Kumar D, Pandey A, Sharma N, Flügel WA (2016) Daily suspended sediment simulation using machine learning approach. CATENA 138:77–90
Google Scholar
Kumar N, Singh SK, Srivastava PK, Narsimlu B (2017) SWAT Model calibration and uncertainty analysis for streamflow prediction of the Tons River Basin, India, using Sequential Uncertainty Fitting (SUFI-2) algorithm. Model Earth Syst Environ 3(1):30
Google Scholar
Lafdani EK, Nia AM, Ahmadi A (2013) Daily suspended sediment load prediction using artificial neural networks and support vector machines. J Hydrol 478:50–62
Google Scholar
Liu QJ, Zhang HY, Gao KT, Xu B, Wu JZ, Fang NF (2019) Time-frequency analysis and simulation of the watershed suspended sediment concentration based on the Hilbert-Huang transform (HHT) and artificial neural network (ANN) methods: A case study in the Loess Plateau of China. CATENA 179:107–118
Google Scholar
Lu H, Meng Y, Yan K, Gao Z (2019) Kernel principal component analysis combining rotation forest method for linearly inseparable data. Cogn Syst Res 53:111–122
Google Scholar
Malmon DV, Dunne T, Reneau SL (2002) Predicting the fate of sediment and pollutants in river floodplains. Environ Sci Technol 36(9):2026–2032
Google Scholar
McCarney-Castle K, Childress TM, Heaton CR (2017) Sediment source identification and load prediction in a mixed-use Piedmont watershed, South Carolina. J Environ Manag 185:60–69
Google Scholar
Melesse AM, Ahmad S, McClain ME, Wang X, Lim YH (2011) Suspended sediment load prediction of river systems: An artificial neural network approach. Agric Water Manag 98(5):855–866
Google Scholar
Merkhali SP, Ehteshami M, Sadrnejad SA (2015) Assessment quality of a nonuniform suspended sediment transport model under unsteady flow condition (case study: Aras River). Water Environ J 29(4):489–498
Google Scholar
Mirjalili S (2016) SCA: a sine cosine algorithm for solving optimization problems. Knowl-Based Syst 96:120–133
Google Scholar
Mozdgir A, Mahdavi I, Badeleh IS, Solimanpur M (2013) Using the Taguchi method to optimize the differential evolution algorithm parameters for minimizing the workload smoothness index in simple assembly line balancing. Math Comput Model 57(1–2):137–151
MATH Google Scholar
Nourani V, Andalib G (2015) Daily and monthly suspended sediment load predictions using wavelet based artificial intelligence approaches. J Mountain Sci 12(1):85–100
Google Scholar
Park I, Choi J, Lee MJ, Lee S (2012) Application of an adaptive neuro-fuzzy inference system to ground subsidence hazard mapping. Comput Geosci 48:228–238
Google Scholar
Pektaş AO, Doğan E (2015) Prediction of bed load via suspended sediment load using soft computing methods. Geofizika 32(1):27–46
Google Scholar
Pirnia A, Darabi H, Choubin B, Omidvar E, Onyutha C, Haghighi AT (2019a) Contribution of climatic variability and human activities to stream flow changes in the Haraz River basin, northern Iran. J Hydro-environ Res 25:12–24
Google Scholar
Pirnia A, Golshan M, Darabi H, Adamowski J, Rozbeh S (2019b) Using the Mann-Kendall test and double mass curve method to explore stream flow changes in response to climate and human activities. J Water Clim Change 10(4):725–742
Google Scholar
Pizzuto J (2020) Suspended sediment and contaminant routing with alluvial storage: new theory and applications. Geomorphology 352:106983
Google Scholar
Rajaee T, Mirbagheri SA, Zounemat-Kermani M, Nourani V (2009) Daily suspended sediment concentration simulation using ANN and neuro-fuzzy models. Sci Total Environ 407(17):4916–4927
Google Scholar
Ren J, Zhao M, Zhang W, Xu Q, Yuan J, Dong B (2020) Impact of the construction of cascade reservoirs on suspended sediment peak transport variation during flood events in the Three Gorges Reservoir. CATENA 188:104409
Google Scholar
Romano G, Abdelwahab OM, Gentile F (2018) Modeling land use changes and their impact on sediment load in a Mediterranean watershed. CATENA 163:342–353
Google Scholar
Sabour MR, Movahed SMA (2017) Application of radial basis function neural network to predict soil sorption partition coefficient using topological descriptors. Chemosphere 168:877–884
Google Scholar
Sahin S (2012) An aridity index defined by precipitation and specific humidity. J Hydrol 444:199–208
Google Scholar
Salih SQ, Sharafati A, Khosravi K, Faris H, Kisi O, Tao H et al (2020) River suspended sediment load prediction based on river discharge information: application of newly developed data mining models. Hydrol Sci J 65(4):624–637
Google Scholar
Samanta S, Suresh S, Senthilnath J, Sundararajan N (2019) A new neuro-fuzzy inference system with dynamic neurons (nfis-dn) for system identification and time series forecasting. Appl Soft Comput 82:105567
Google Scholar
Samantaray S, Ghose DK (2020) Assessment of suspended sediment load with neural networks in arid watershed. J Inst Eng India Ser A 101:371–380
Google Scholar
Shamaei E, Kaedi M (2016) Suspended sediment concentration estimation by stacking the genetic programming and neuro-fuzzy predictions. Appl Soft Comput 45:187–196
Google Scholar
Shojaeezadeh SA, Nikoo MR, McNamara JP, AghaKouchak A, Sadegh M (2018) Stochastic modeling of suspended sediment load in alluvial rivers. Adv Water Resour 119:188–196
Google Scholar
Singh A, Imtiyaz M, Isaac RK, Denis DM (2014) Assessing the performance and uncertainty analysis of the SWAT and RBNN models for simulation of sediment yield in the Nagwa watershed. India Hydrolog Sci J 59(2):351–364
Google Scholar
Talebi A, Mahjoobi J, Dastorani MT, Moosavi V (2017) Estimation of suspended sediment load using regression trees and model trees approaches (Case study: Hyderabad drainage basin in Iran). ISH J Hydrol Eng 23(2):212–219
Google Scholar
Vafakhah M (2013) Comparison of cokriging and adaptive neuro-fuzzy inference system models for suspended sediment load forecasting. Arab J Geosci 6(8):3003–3018
Google Scholar
Van Dao D, Jaafari A, Bayat M, Mafi-Gholami D, Qi C, Moayedi H, Luu C (2020) A spatially explicit deep learning neural network model for the prediction of landslide susceptibility. CATENA 188:104451
Google Scholar
Wang X, Shi Z, Shi Y, Ni S, Wang R, Xu W, Xu J (2018) Distribution of potentially toxic elements in sediment of the Anning River near the REE and V-Ti magnetite mines in the Panxi Rift, SW China. J Geochem Explor 184:110–118
Google Scholar
Yang CT, Marsooli R, Aalami MT (2009) Evaluation of total load sediment transport formulas using ANN. Int J Sedim Res 24(3):274–286
Google Scholar
Yurdusev MA, Firat M (2009) Adaptive neuro fuzzy inference system approach for municipal water consumption modeling: An application to Izmir. Turkey J Hydrol 365(3–4):225–234
Google Scholar
Zhang X, Fichot CG, Baracco C, Guo R, Neugebauer S, Bengtsson Z, Fagherazzi S (2020) Determining the drivers of suspended sediment dynamics in tidal marsh-influenced estuaries using high-resolution ocean color remote sensing. Remote Sens Environ 240:111682
Google Scholar
Zhao G, Pang B, Xu Z, Xu L (2020) A hybrid machine learning framework for real-time water level prediction in high sediment load reaches. J Hydrol 581:124422
Google Scholar

Download references

Funding

Open access funding provided by University of Oulu including Oulu University Hospital.

Author information

Authors and Affiliations

Water, Energy and Environmental Engineering Research Unit, University of Oulu, P.O. Box 4300, 90014, Oulu, Finland
Hamid Darabi, Zahra Karimidastenaei & Ali Torabi Haghighi
Department of Ecology, Institute of Science and High Technology and Environmental Sciences, Graduate University of Advanced Technology, Kerman, Iran
Sedigheh Mohamadi
Department of Civil Engineering, School of Technology, Ilia State University, 0162, Tbilisi, Georgia
Ozgur Kisi
Department of Water Engineering and Hydraulic Structures, Faculty of Civil Engineering, Semnan University, Semnan, Iran
Mohammad Ehteram
Department of Civil Engineering, Faculty of Engineering, University of Malaya, Kuala Lumpur, Malaysia
Ahmed ELShafie

Authors

Hamid Darabi
View author publications
You can also search for this author in PubMed Google Scholar
Sedigheh Mohamadi
View author publications
You can also search for this author in PubMed Google Scholar
Zahra Karimidastenaei
View author publications
You can also search for this author in PubMed Google Scholar
Ozgur Kisi
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Ehteram
View author publications
You can also search for this author in PubMed Google Scholar
Ahmed ELShafie
View author publications
You can also search for this author in PubMed Google Scholar
Ali Torabi Haghighi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hamid Darabi.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Research involving human participants and/or animals

This article does not contain any studies with human participants and/or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Darabi, H., Mohamadi, S., Karimidastenaei, Z. et al. Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models. Soft Comput 25, 7609–7626 (2021). https://doi.org/10.1007/s00500-021-05721-5

Download citation

Accepted: 28 February 2021
Published: 26 March 2021
Issue Date: June 2021
DOI: https://doi.org/10.1007/s00500-021-05721-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models

Abstract

Similar content being viewed by others

A comparison of various artificial intelligence approaches performance for estimating suspended sediment load of river systems: a case study in United States

Suspended sediment load prediction based on soft computing models and Black Widow Optimization Algorithm using an enhanced gamma test

Simulation of suspended sediment based on gamma test, heuristic, and regression-based techniques

1 Introduction

2 Material and methods

2.1 Study areas

2.1.1 Talar river basin

2.1.2 Eagel Creek Basin

2.2 Models tested for SSL prediction

2.2.1 Adaptive neuro-fuzzy system (ANFIS)

2.2.2 Multilayer perceptron (MLP)

2.2.3 Radial basis function neural network (RBFNN)

2.3 Optimization algorithms tested

2.3.1 Sine–cosine algorithm (SCA)

2.3.2 Bat algorithm (BA)

2.3.3 Firefly algorithm (FFA)

2.3.4 Particle swarm optimization (PSO)

2.4 Hybridization of prediction models with optimization algorithms

2.4.1 ANFIS hybridization

2.4.2 MLP hybridization

2.4.3 RBFNN hybridization

2.5 Uncertainty analysis of soft computing models

3 Results and discussion

3.1 Selection of appropriate inputs for soft computing models

3.1.1 Tuning the random parameters in optimization algorithms

3.2 Talar station

3.3 Kasilian station

3.4 Uncertainty analysis of models and box plots

3.4.1 The analysis for the Eagel Creek river basin

4 Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Research involving human participants and/or animals

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation