Introduction

A significant concern regarding sediment transport with inflow is solid matter deposition in the pipe channel. It is essential to determine the limiting velocity (at a constant slope) in dry weather flow (DWF) for sediment transport without deposition. Moreover, the pipe diameter should be sufficiently capable of transporting maximum flow in wet weather flow (WWF). Sedimentation can increase the bed roughness and decrease the cross-sectional area of the channel. The long-term accumulation of sediment on the bed can increase the risk of stabilization and cementation, thus causing transport capacity reduction. Therefore, a criterion for predicting the limiting velocity to prevent sediment deposition is essential.

The simplest traditional method is to use a constant limiting velocity value provided in many references for different hydraulic and geographical situations (see more details in Ebtehaj et al. 2014). This method is inaccurate in some circumstances because the values do not consider hydraulic and sediment characteristics, such as pipe diameter, flow depth, hydraulic radius, median particle diameter and volumetric sediment concentration.

Numerous experimental and analytical studies have been done to identify which parameters influence sediment transport (Mayerle et al. 1991; Nalluri and Ab Ghani 1996; May 2003; Banasiak 2008; Vongvisessomjai et al. 2010; Almedeij 2012; Bonakdari and Ebtehaj 2014a). These studies yielded various equations for determining the limiting velocity. The presented equations fall in two groups: (i) semi-experimental equations derived from solving the force acting on equilibrium and (ii) dimensional analysis-based equations. May et al. (1996) used 7 different datasets (Ackers et al. 1996) in 332 tests and presented a semi-experimental relationship in the form of Eq. (1). It is recognized as the best semi-experimental equation for non-deposition sediment transport (Vongvisessomjai et al. 2010; Ebtehaj and Bonakdari 2014a):

$$C_{V} = 3. 0 3\times 1 0^{ - 2} \left( {\frac{{D^{2} }}{A}} \right)\left( {\frac{d}{D}} \right)^{ 0. 6} \left( {\frac{{V^{2} }}{g (s - 1 )D}} \right)^{ 1. 5} \left( {1 - \frac{{V_{t} }}{V}} \right)^{ 4}$$
(1)
$$V_{t} = 0. 1 2 5\sqrt {g (s - 1 )d} \left( {\frac{y}{d}} \right)^{ 0. 4 7} ,$$
(2)

where C V is the volumetric sediment concentration, D is the pipe diameter, d is the median particle diameter, V is the flow velocity, V t is the incipient motion velocity of sediment (Eq. 2), A is the cross-sectional area of the flow, g is the gravitational acceleration, s (=ρ s ) is the specific gravity of sediment and y is the flow depth.

Ebtehaj et al. (2014) presented a regression-based equation to determine the densimetric Froude number (Fr) by dimensional analysis with a wide range of data as follows:

$$Fr = \frac{V}{{\sqrt {g (s - 1 )d} }} = 4. 4 9\;C_{V}^{ 0. 2 1} \left( {\frac{d}{R}} \right)^{ - 0. 5 4} .$$
(3)

The lack of knowledge and complexity of flow conditions have led to uncertainty modeling. Due to the innate uncertainty and insufficient accuracy of regression-based equations, such equations do not perform well in some hydraulic circumstances. Therefore, utilizing this method leads to overestimated or underestimated plans (Bonakdari and Ebtehaj 2014b).

Because artificial intelligence (AI) performs adequately (Gholami et al. 2011; Al-Abadi 2014; Gorai et al. 2014; Mondal et al. 2015), techniques such as neural networks (Ebtehaj and Bonakdari 2013), decision trees (Ivanovich and Hamid 2014), fuzzy logic (Demirci and Baltaci 2013), evolutionary computing (Ebtehaj and Bonakdari 2016), and gene expression programming (Ab Ghani and Azamathulla 2014) have been successfully applied in sediment transport modeling. Shoorehdeli et al. (2007) developed a new hybrid particle swarm optimization (PSO) algorithm for ANFIS network training. The authors modified the PSO for the training scheme in the antecedent part of the fuzzy rules, inspired by the genetic algorithm and using adaptive weighted by PSO. They demonstrated that the new ANFIS is less complex and more accurate than the gradient-based method in ANFIS training.

Moosavi et al. (2013) applied hybrid models based on Wavelet, Wavelet-ANFIS and Wavelet-ANN for groundwater level forecasting during different prediction periods. The results of ANFIS, Wavelet-ANFIS, ANN and Wavelet-ANN indicated that the hybrid methods exhibit superior precision to both ANFIS and ANN, while Wavelet-ANFIS performs the best of all models. Ebtehaj and Bonakdari (2014b) optimized the MLP-ANN weights in sediment transport prediction using two evolutionary algorithms (EA): the imperialist competitive algorithm (ICA) and genetic algorithm (GA). A comparison of the hybrid methods (ANN-ICA and ANN-GA) with the general ANN indicated that EA performs significantly better than ANN.

Ebtehaj and Bonakdari (2014a) evaluated the ANFIS performance in predicting sediment transport. The authors recommended applying evolutionary algorithms for the optimum selection of ANFIS membership functions. Therefore, in this study, ANFIS is coupled with the DE algorithm for the first time to develop a hybrid model and assess sediment transport in sewers. The main goal is to increase prediction accuracy and reliability by benefiting from the specific nature of each approach. The DE algorithm is applied to optimize the membership function of the ANFIS network using three datasets with a wide range of data. The parameters affecting limiting velocity (i.e. Fr) prediction are identified initially by examining the influential factors on sediment transport. Thereafter, six models are proposed to assess the effect of different parameters on Fr prediction. The developed ANFIS-DE model predictions are compared with the ANFIS model as well as existing sediment transport equations.

Theoretical background of the method used

Overview of ANFIS

Adaptive neuro-fuzzy inference systems (ANFIS) (Jang et al. 1997) are a hybrid soft computing (SC) technique where artificial neural networks (ANN) are combined with fuzzy logic (FL). The fuzzy membership values are obtained through ANN training (Bui et al. 2017a). This method’s process entails rapid convergence and high accuracy. A fuzzy inference system (FIS) with two inputs (x and y) and f as an output is an example of this process. For sediment transport in pipe channels, the x and y parameters are considered the volumetric sediment concentration (C V ) and a dimensionless parameter (the ratio of hydraulic radius to median particle diameter d/R). Moreover, F as an output parameter represents the Fr function. This example is examined for two input parameters. For a first-order Takagi–Sugeno fuzzy model, it is possible to set a sample rule with two IF–THEN rules as follows:

$${\text{Rule 1}}: {\text{IF}}\mathop {}\nolimits_{{}} x = A_{1} ,\mathop {}\nolimits_{{}} y = B_{1} \mathop {}\nolimits_{{}} {\text{THEN}}\mathop {}\nolimits_{{}} f_{1} = p_{1} x + q_{1} y + r_{1}$$
(4)
$${\text{Rule 1}}: {\text{IF}}\mathop {}\nolimits_{{}} x = A_{2} ,\mathop {}\nolimits_{{}} y = B_{2} \;{\text{THEN}}\mathop {}\nolimits_{{}} f_{1} = p_{2} x + q_{2} y + r_{2} .$$
(5)

The antecedent part (IF) and consequent part (THEN) of a fuzzy rule are fuzzy in nature and the crisp function of the variable is a linear equation or rule (respectively). If C V and d/R are considered Fr model input variables, Eqs. 1 and 2 can be rearranged as follows:

$${\text{Rule 1}}:{\text{IF}}\,\;C_{V} \,{\text{is}}\,{\text{LOW}}\,{\text{and}}\, (d/R )\,{\text{is}}\,{\text{LOW}},\;{\text{THEN}}\,Fr = p_{ 1} C_{V} + q_{1} (d/R )+ r_{1}$$
(6)
$${\text{Rule 1}}: {\text{IF}}\;C_{V} \,i{\text{s}}\,\,{\text{HIGH}}\,{\text{and}}\, (d/R )\,{\text{is}}\,{\text{Medium}},\;{\text{THEN}}\;Fr = p_{2} C_{V} + q_{2} (d/R )+ r_{2} ,$$
(7)

where p i , q i and r i (i = 1, 2, 3,…, n) are the set of parameters. Figure 1 presents the overall structure of ANFIS with two inputs and ANFIS with four inputs used in this study. The performance of the different layers shown in this figure is as follows:

Fig. 1
figure 1

ANFIS structures (general and quadruple)

First layer Each node in this layer produces the membership degree from an input variable.

$$O_{i}^{1} = \mu_{{A_{i} }} (C_{V} )\quad i = 1 { ,2,}$$
(8)

where C V is the input of the ith node and A i is the linguistic label related to this node’s function. For d/R (the second input parameter), another function can be derived as in Eq. 9:

$$O_{i}^{ 1} = \mu_{{B_{i - 2} }} (d/R)\quad i = 3 { ,4} .$$
(9)

O 1 i is regarded as the membership function A i (MF) and it determines the degree that satisfies a given input (C V or d/R) to the A i quantity. These functions are smooth and have concise notation; for this reason, the most accepted membership functions in ANFIS are deemed to be the Gaussian and bell shape functions (Bui et al. 2012). Both have the advantages of smoothness and being non-zero at all points. The bell shape MF (Eq. 10) has a greater parameter than the Gaussian MF (Eq. 11); as a result, it can approach the non-fuzzy set if the free parameter is tuned (MATLAB and Statistics Toolbox Release 2009). Therefore, the Gaussian MF is employed in this study.

$$A_{i} (C_{V} )= \frac{ 1}{{ 1+ [ (C_{V} - C_{i} )/a_{i} ]^{{ 2b_{i} }} }}$$
(10)
$$A_{i} (C_{V} )= { \exp }\left( { - \frac{{\left\| {C_{V} - c_{i} } \right\|^{ 2} }}{{ 2\sigma_{i}^{ 2} }}} \right),$$
(11)

where {a i , b i , c i and σ i } is the parameter set of Gaussian MFs and μ is the MF of A i . Changing any of the parameters leads to different MF results. Hence, representing different forms of MFs is dependent on the linguistic label A i . In fact, many of the piecewise and continuous functions in this node may be used. The parameters of this layer are introduced as premise parameters. Three membership functions are derived for each input model presented in Fig. 2.

Fig. 2
figure 2

Fuzzy membership functions for the input parameters

Second layer This layer consists of circular nodes marked with П. The following equation is multiplied by the input signal and these nodes are sent to the output produced:

$$O_{i}^{ 2} = w_{i} = A_{i} (C_{V} )B_{i} (d/R )\mathop {}\nolimits_{{}}^{{}} \mathop {}\nolimits_{{}}^{{}} i = 1 , 2.$$
(12)

Each node’s output indicates the firing strength of a specified rule.

Third layer Here, circular nodes (Ns) calculate the ratio of the firing strength rule (ith rule) to the sum of all firing strength rules, as follows:

$$O_{i}^{ 3} = \bar{w}_{i} = \frac{{w_{ 1} }}{{w_{ 1} + w_{ 2} }},\mathop {}\nolimits_{{}}^{{}} \mathop {}\nolimits_{{}}^{{}} i = 1 , 2.$$
(13)

This layer’s output is presented as the normalized firing strength.

Fourth layer The parameter values of p, q and r are optimized in this layer. All nodes in this layer adapt to a node function as follows:

$$O_{i}^{ 4} = \bar{w}_{i} (p_{i} C_{V} + q_{i} (d/R )+ r_{i} ) ,$$
(14)

where {p i , q i , r i } is the parameter set and \(\bar{w}\) is the normalized weight. The parameters in this layer are known as consequent parameters.

Fifth layer The circular single node in this layer, Σ, and all outputs are calculated as the sum of all input signals as:

$$O_{i}^{ 5} = \sum {\bar{w}_{i} f_{i} = \frac{{\sum {w_{i} f_{i} } }}{{\sum {w_{i} } }}} .$$
(15)

ANFIS is trained using backpropagation and a hybrid of backpropagation with least squares. Hence, the hybrid method employs backpropagation for input data (premise parameters) and least squares for output MFs (consequent parameters).

Ebtehaj and Bonakdari (2014a) showed that the hybrid algorithm presents better results than backpropagation. Hence, this algorithm is employed in the present study to estimate Fr with the ANFIS network. In addition to these algorithms, the more recent use of hybrid ANFIS has led to improved ANFIS prediction results (Cus et al. 2009; Shoorehdeli et al. 2009; Chang et al. 2011; Chen 2013; Bui et al. 2016a, 2017b). Therefore, Differential Evolution (DE) is employed in this study and the results are compared with the hybrid algorithm results.

Differential evolution

Differential evolution (DE) is a powerful evolutionary global optimization method that was proposed by Storn and Price (1997). The advantages of DE are its simple structure, quality of solutions found, and ease of implementation (Liu and Lampinen 2002; Bui et al. 2016b). Thus, it has been applied in many different practical cases. EA is grouped as an optimization stochastic algorithm and is inspired by biological processes, whereby survival of the fittest is required for compliance with environmental and inherent genetic features (Bäck et al. 1997).

If the objective functions to be optimized and named are f, we have:

$$f (V ):R^{D} \to R,$$
(16)

where R is related to real data and D represents the objective function parameters f (V). The DE algorithm aims to minimize the objective function using the optimized parameter values:

$$V = (v_{1} , \ldots ,v_{D} ),\quad V \in R^{D} ,$$
(17)

where V is a vector including the parameters of the objective function of D. The objective function is the mean squared error between the actual and estimated Fr. The objective function parameters are defined as follows:

$$v_{i}^{{ ( {\text{L)}}}} \le v_{i} \le v_{i}^{{ ( {\text{U)}}}} ,$$
(18)

where v (L) i and v (U) i are related to the lower and upper boundaries, respectively.

Like other evolutionary algorithms, DE acts on a population (P G ) of candidate solutions and not only on a single solution. If G is considered the generation of a population, the population evaluated by DE can be stated as follows:

$$P_{G} = (v_{1,G} ,v_{2,G} , \ldots ,V_{{{\text{NP}},G}} )\;G = 0, \ldots ,G_{\hbox{max} } ,$$
(19)

where G max is the maximum generation that usually serves as the stopping criterion of DE (Bui et al. 2016b). Each vector contains the exact parameter of D considered an individual chromosome.

$$v_{i,G} = (v_{1,i,G} ,v_{2,i,G} , \ldots ,V_{D,i,G} )\quad i = 1,2, \ldots ,{\text{NP}}\quad G = 0, \ldots ,G_{\hbox{max} } .$$
(20)

An initial population should be created to establish a starting point for the optimal search. In general, there is no information about the optimal solutions except the optimum problem parameters. Therefore, one of the methods to determine the initial population, P G  = 0, is the random selection of restrictions given as follows:

$$v_{j,i,0} = {\text{rand}}_{j} [0,1]\,(v_{j}^{(U)} - v_{j}^{(L)} ) + v_{j}^{(L)} \quad i = 1,2, \ldots ,{\text{NP}}\quad j = 1,2, \ldots ,D,$$
(21)

where rand j [0, 1] is a random value distributed consistently in the [0, 1] range, which is selected for every new j. The DE procedure differs from other evolutionary algorithms. From the primary production to the regular population of vectors, P G is combined and sampled randomly to produce candidate vectors for the next generation, P G + 1.

A candidate population of vectors obtained from different trials, \(P_{{_{G + 1} }}^{/}\) = U iG+1 = u j , I, G + 1, is calculated as follows:

$$u_{j,i,G + 1} = \left\{ {\begin{array}{*{20}c} {\left\{ \begin{aligned} v_{j,r3,G} + F(v_{j,r1,G} - v_{j,r2,G} )\;{\text{if}}\,v_{j}^{{({\text{L}})}} < y_{j,i,G + 1} < v_{j}^{{({\text{U}})}} \hfill \\ {\text{otherwise}}\;{\text{and}}_{j} [0,1]\,(v_{j}^{{({\text{U}})}} - v_{j}^{{({\text{L}})}} ) + v_{j}^{{({\text{L}})}} \hfill \\ \end{aligned} \right.} & {} \\ {{\text{otherwise}}\quad v_{j,i,G} } & {} \\ \end{array} } \right\},$$
(22)

where r 1 , r 2 and r 3 are values that differ in each run and i is a parameter whose value should be specified. The correct values of parameters r 1 , r 2 and r 3 are randomly selected for each i value.

The next generation population, P G + 1, is selected from the current population (P G ) and the population of children follows this equation:

$$V_{{i,G{ + 1}}} = \left\{ \begin{aligned} U_{{i,G{ + 1}}} \mathop {}\nolimits_{{}}^{{}} \mathop {}\nolimits_{{}}^{{}} {\text{if}}_{{}}^{{}} f (U_{i,G + 1} )\le (V_{i,G} )\hfill \\ V_{i,G} \mathop {}\nolimits_{{}}^{{}} \mathop {}\nolimits_{{}}^{{}} {\text{otherwise}} \hfill \\ \end{aligned} \right.\left. \begin{aligned} \hfill \\ \hfill \\ \hfill \\ \end{aligned} \right\}$$
(23)

Thus, the temporary population of individuals is compared with their peers’ population. Assuming the objective function is minimized, the vector with the lowest objective function value gains a new position in the next generation. As a result, every individual from the next population is good or better than peers from the general population.

Statistical measure

To evaluate the performance of each method, the following statistical indicators are calculated: the coefficient of determination (R 2), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Relative Error (MARE) and average absolute deviation (δ).

$$R^{2} = 1 - \frac{{\sum\nolimits_{i = 1}^{n} {(O_{i} - t_{i} )^{2} } }}{{\sum\nolimits_{i = 1}^{n} {(O_{i} - \bar{O}_{i} )^{2} } }}$$
(24)
$$\text{RMSE} = \sqrt {\frac{ 1}{n}\sum\nolimits_{i = 1}^{n} {\left( {O_{i} - t_{i} } \right)^{ 2} } }$$
(25)
$$\text{MAE} = \frac{ 1}{n}\sum\nolimits_{i = 1}^{n} {\left| {O_{i} - t_{i} } \right|}$$
(26)
$$\text{MARE} = \frac{ 1}{n}\sum\nolimits_{i = 1}^{n} {\frac{{\left| {O_{i} - t_{i} } \right|}}{{O_{i} }}}$$
(27)
$$\updelta = \frac{{\sum\nolimits_{i = 1}^{n} {\left| {O_{i} - t_{i} } \right|} }}{{\sum\nolimits_{i = 1}^{n} {O_{i} } }}.$$
(28)

Methodology

Experimental and analytical studies of sediment transport in pipe channels indicate the effects of the hydraulic parameters of flow and sediment characteristics on the velocity of flow. Therefore, to determine the limiting velocity that would prevent solids from settling on the channel bed, a functional relationship is given as follows:

$$V = f(R,y,d,C_{V} ,A,g,\rho ,\rho_{s} \lambda_{s} ).$$
(29)

To estimate the limiting velocity parameters, the dimensionless parameters should be defined. Several studies in this field (Ab Ghani 1993; Vongvisessomjai et al. 2010; Azamathulla et al. 2012; Ebtehaj and Bonakdari 2013) presented the influential parameters on sediment transport in non-deposition state as follows:

$$Fr = V/\sqrt {g(s - 1)d} = f(C_{V} ,D_{\text{gr}} ,d/D,d/R,R/D,D^{2} /A,\lambda_{s} ),$$
(30)

where R is the hydraulic radius, y is the flow depth, d is the median particle diameter, C V is the volumetric sediment concentration, A is the cross-sectional area of the flow, g is the gravitational acceleration, ρ is the water density, ρ s is the sediment density, λ s is the overall friction factor of sediment, Fr is the densimetric Froude number, s is the specific gravity of sediment (=ρ/ρ s ), D gr (=d(g(s  1) 2)1/3) is the dimensionless particle number and D is the pipe diameter.

In some recent studies, Ebtehaj and Bonakdari (2013; 2014a; 2014b; 2016) classified the dimensionless parameters presented in the above equation with respect to the nature of each parameter in the dimensionless groups of transport (C V ), transport mode (d/R, D 2 /A and R/D), flow resistance (λ s ), movement (Fr) and sediment (D gr and d/D).

Thus, the Fr parameter is the target selected to estimate the limiting velocity parameter. By considering the effect of each of the following categories, four inputs are derived for the models. The inputs obtained from “transport “and “flow resistance” are constant, while “sediment” and “transport mode” have more than one parameter. Hence, different models are presented to investigate the effect of each parameter, as follows:

$${\text{Model }}1:Fr = f(C_{V} , \, D_{gr} , \, d/R, \, \lambda_{s} )$$
$${\text{Model }}2:Fr = f(C_{V} , \, D_{gr} , \, D^{2} /A, \, \lambda_{s} )$$
$${\text{Model }}3:Fr = f(C_{V} , \, D_{gr} , \, R/D, \, \lambda_{s} )$$
$${\text{Model }}4:Fr = f(C_{V} , \, d/D, \, d/R, \, \lambda_{s} )$$
$${\text{Model }}5:Fr = f(C_{V} , \, d/D, \, D^{2} /A, \, \lambda_{s} )$$
$${\text{Model }}6:Fr = f(C_{V} , \, d/D, \, R/D, \, \lambda_{s} ),$$

The study site and data used

In this study, three datasets are used to predict sediment transport in the non-deposition condition in pipe channels (Ab Ghani 1993; Ota and Nalluri 1999 and Vongvisessomjai et al. 2010). The data employed in this study are all associated with non-deposition in different hydraulic conditions, such as pipe diameter, channel slope, particle size and channel length. Details of the datasets are provided in studies carried out by Ebtehaj et al. (2014) and Ebtehaj and Bonakdari (2014a, b(. The samples are in the following ranges: 1 <C V (ppm) <1280; 0.005 < R (m) < 0.136; 0.013 < λ s  < 0.053; 0.1 < D<0.45; 0.237 < V (m/s) < 1.216; 0.072 < d (mm) < 8.3; 0.153 < y/D < 0.84 and 5.06 < D gr < 142.

Proposed hybrid method for non-deposition sediment transport prediction

This section describes the hybrid comprising ANFIS and a global optimization method, i.e. Differential Evolution (ANFIS-DE). ANFIS-DE is used for modeling sediment transport at limit of deposition in sewer networks. The proposed ANFIS-DE is encoded in MATLAB Environmental. First, an initial ANFIS model is produced for the sediment transport data using the training dataset. Subsequently, DE is utilized to optimize the premise and consequent parameters of the model. Once the optimum parameter values are determined, the ANFIS-DE regression model is acquired and it can be used to predict the limiting velocity for non-deposition sediment transport in sewer systems.

Data preparation

The data utilized in this study are arranged as four inputs and one output parameter (Fr). To present a useful predictive model with scientific significance, model validation is essential in the analysis. Therefore, all datasets collected from the literature are divided in two subsets at a ratio of 70–30 for training and testing. The training dataset comprised 150 samples that were selected randomly, whereas the rest of the samples (68) were utilized for testing to confirm the prediction accuracy of the proposed model.

Model configuration

The values of the antecedent and consequent parameters related to the initial model generated with the training dataset are not optimized. In this study, the grid partitioning method is employed for Fuzzy Inference System (FIS) generation (Ebtehaj and Bonakdari 2014a). Moreover, there are 3 MFs for each input. The MFs were selected by trial and error between the objective function values of the ANFIS-DE model vs. the number of MFs. The best antecedent and consequent parameter values related to the rules are attained in the optimization process. The parameter values are presented in Table 1.

Table 1 DE control parameters

ANFIS model training using PSO

The structure of the proposed hybrid ANFIS-DE for sediment transport modeling is illustrated in Fig. 3. The network training process is defined by having to reach the convergence criterion and continuing until the number of iterations or objective functions determined is reached. The DE algorithm selection process is as follows:

Fig. 3
figure 3

Flowchart of proposed method (ANFIS-DE)

First, to avoid expansion process duplication, the values of the objective function and constraint functions for parameter V i,G are stored in the variables. U i, G + 1 refuses constraint function values greater than the V i,G value and rejects these constraint functions without reassessment. If V i,G is not convinced of all constraints, U i,G + 1 probing is done because the constraint is still much less than V i,G . As the U i,G + 1 and V i,G values are searched, the membership function to probe the new U G + 1 should be evaluated. This practice continues until the value of U i,G + 1 becomes greater than V i,G . In this case, the membership function value is not evaluated once more.

Stopping criteria

The maximum number of iterations and minimum defined error are utilized as stopping criteria, such that if one of the conditions is satisfied the optimization process in terminated. In this case, the objective function defined as RMSE (Eq. 25) is the smallest and the antecedent and consequent parameters are assigned to the ANFIS-DE model. The validation dataset is employed to validate the final model for accuracy and to predict the limiting velocity required to prevent sedimentation in sewer systems.

Results and discussion

The results of the Artificial Intelligence methods (ANFIS and ANFIS-DE) and regression equations in predicting sediment transport at limit of deposition in sewer systems are presented in this section. Figure 4 shows the results of Fr estimation using the ANFIS and ANFIS-DE methods. Model 1 that employs C V , D gr , d/R and λ s as inputs produced good results with both methods. Fr estimated by ANFIS-DE mostly had less than 10% relative error. According to Table 2, the average relative error of Model 1 for all samples in testing mode was about 8% (MARE = 0.076), but with ANFIS it was often more than 10%. The estimates were both under and overestimates (MARE = 0.099). Other indicators presented in the table also indicate the superiority of ANFIS-DE over ANFIS. Therefore, the DE evolutionary algorithm in Model 1 outperformed the hybrid algorithm (backpropagation with least squares).

Fig. 4
figure 4

Scatter plot of Fr predicted using ANFIS and ANFIS-DE (Testing)

Table 2 Performance evaluation of ANFIS and ANFIS-DE in predicting Fr for all models (training and testing)

In Models 1, 2 and 3, the influence of parameters related to the transport mode (d/R, D 2 /A & R/D) is examined. By applying the constant parameters of sediment (D gr), transport (C V ) and flow resistance (λ s ), besides using parameter D 2 /A instead of d/R (Model 2), the performance of both methods significantly decreased compared with Model 1. The statistical indices show that the relative error in estimating Fr reached over 20% (MARE for ANFIS = 0.24; MARE for ANFIS-DE = 0.232) and the RMSE of both methods was double that for Model 1. In this model, most estimations were considered underestimations, and using its results would lead to excessive solid deposition on the channel bed and reduced transmission capacity. Therefore, using D 2 /A as a transport mode parameter in addition to C V , D gr, and λ s has a negative impact on ANFIS and ANFIS-DE performance in predicting the limiting velocity (Fr). This resulted in high error values, low accuracy (R 2 for ANFIS = 061 and R 2 for ANFIS-DE = 0.72) and high sediment deposition.

Based on the statistical indices presented in Table 2 and the scatter plot in Fig. 4, Model 3 outperformed Model 2 but was inferior to Model 1. ANFIS-DE estimated Model 3 with fairly good accuracy, whereby the relative error decreased from 24% in Model 2 to about 13% in Model 3. However, Model 3 still made underestimations. The Fr predicted by ANFIS and the input combination of Model 3 that employed R/D as a transport mode parameter led to a decrease in relative error by 3% compared with Model 2. Hence, the Fr value had high relative error and was underestimated. Models 2 and 3 were relatively similar in accuracy. Model 3 had only 4% relative error, which was lower than Model 2, but it had a mean relative error of over 20%. Consequently, using Model 3 is not reliable, as the results would decrease sediment transport capacity and increase sedimentation on the bed channel. The results obtained for Models 1, 2 and 3 signify that considering λ s , D gr and C V as part of the “flow resistance,” “sediment” and “transport” groups (respectively) led to the best performance of ANFIS and ANFIS-DE when employing d/R as a “transport mode” parameter.

Regarding Models 4-6, the influence of the transport mode group parameters was investigated when d/D was used as a dimensionless parameter from the “sediment” group in this study. Model 4 produced good results with both methods: ANFIS (R 2 = 0.929; RMSE = 0.452; MAE = 0.385; MARE = 0.091 and δ = 0.088) and ANFIS-DE (R 2 = 0.965; RMSE = 0.323; MAE = 0.281; MARE = 0.065 and δ = 0.064), whereby most Fr predictions had less than 10% relative error. The difference between Models 1 and 4 is in their input combinations, as their parameters from the “sediment” group are D gr and d/D and the other parameters are the same. A general comparison of these models demonstrate their high accuracy. Both ANFIS and ANFIS-DE methods in Model 4 estimate with less than 10% relative error, but ANFIS showed an insignificantly greater relative error. A quantitative comparison of these models indicates that using d/D in Model 4 rather than D gr in Model 1 decreased the relative error about 1.1 and 0.8% for ANFIS-DE and ANFIS, respectively. Models 5 and 2 were compared to survey the effect of the “sediment” parameters (D gr and d/D) and D 2 /A as a “transport mode” parameter. It was found that Model 5 had nearly the same conditions as Model 2 and often underestimated but had a lower relative error. Consequently, the mean error of ANFIS-DE and ANFIS decreased from 24 to 18% and 23 to 16%, respectively. Evidently, the simultaneously use of d/D and D 2 /A may decrease the lack of effect of d/R as an input on limiting velocity estimation.

By ANFIS-DE, for input combination presented in the form of Model 6 reasonable results were obtained. However, this model had significant error of over 10% as overestimation and underestimation, which would reduce its reliability and lead to uneconomical plans or sedimentation, respectively. Using ANFIS to estimate Fr with the input combination of Model 6 presented weaker results than ANFIS-DE, with about 15% error. A comparison of Models 6 and 3 shows that the simultaneously use of of d/D and R/D has insignificant impact in comparison with D gr and R/D because the D gr is a function of d as well d/D. In addition, the D is absent in D gr but is considered in R/D in both models. In general, using the d/D parameter as a representative of the sediment group improves Fr estimation.

According to the explanations given, in addition to the parameters related to transport (C V ) and flow resistance (λ s ) that are constant, using parameter d/D from the sediment group and parameter d/R from transport mode offers the best performance (Model 4). A comparison of ANFIS and ANFIS-DE for all models signifies the superior performance of the hybrid method presented in this study.

Table 3 compares the model presented in this study (ANFIS-DE) with existing regression-based equations using different statistical indicators. According to the table and all indicators, the proposed model produced better results than the regression equations. Among the presented models, most indicators for May et al. (1996) equation indicated better results than Ebtehaj et al. (2014), but the difference is not significant enough to deem one better than another. It is worth noting that May et al. (1996) equation was superior to other equations in terms of MSE, MAE, MARE and δ, and a lower R 2; therefore, it is quite different from the estimate achieved by May et al. (1996), which increases its unreliability.

Table 3 Comparison of ANFIS-DE and existing regression-based equations (all data)

Additionally, existing equations were compared with ANFIS-DE in terms of discrepancy ratio (DR), which is equal to the ratio of the predicted value to the actual value. These results are presented in Fig. 5. The DR values for ANFIS-DE, and Ebtehaj et al. (2014) and May et al. (1996) equations are 0.99, 1.03 and 0.97, respectively. Although the difference is not vast, model accuracy and the average precision should be confirmed by considering the maximum relative error values as well. In Fig. 5 the DR value for ANFIS-DE is 1 ± 0.15, while the estimation results of the two regression equations indicate that some of the estimated values have large error. Thus, overestimation beyond a certain allowance results in non-economical project assessment, while underestimation below a certain value leads to sediment deposition and related problems. Using the equations would also result in less confidence.

Fig. 5
figure 5

DR values for ANFIS-DE and existing regression-based equations

Conclusion

Sediment transport in pipe channels is a critical subject, because using incorrect designs leads to sediment deposition and non-economical assessments. Therefore, a method that accurately estimates the limiting velocity to prevent sediment deposition is essential. In this study, a hybrid ANFIS was designed based on differential evolution called ANFIS-DE. It was used to predict the densimetric Froude number (Fr) parameter, which takes into account the limiting velocity effect. Accordingly, the factors affecting non-deposition sediment transport were determined and the dimensionless parameters were defined to propose six different models (Models 1–6). According to the results obtained for the effect of each dimensionless parameter on the estimated Fr parameter, using D 2 /A as a parameter in “transport mode” leads to significantly inferior performance of both methods. The average relative error attained was over 20% for both methods. The best result of each input combination was achieved by selecting d/R as representative of the “transport mode” group. Besides, by taking into account parameters related to “flow resistance,” “transport” and “transport mode,” using D gr instead of d/D led to a minor accuracy reduction of less than 2%. The best performance in estimating the Fr parameter was attained by the model that used the following dimensionless parameters: volumetric sediment concentration (C V ), ratio of median particle diameter to hydraulic radius (d/R), ratio of median particle diameter to pipe diameter (d/D) and overall friction factor of sediment (λ s ) (Fr = f (CV, d/R, d/D, λ s )). Moreover, a comparison of ANFIS-DE with ANFIS represented the superior performance of ANFIS-DE. Two regression-based methods presented based on semi-experimental and dimensional analysis were also compared with the results of ANFIS-DE. The comparison results indicated the superior performance of ANFIS.