An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction

Ahmadianfar, Iman; Shirvani-Hosseini, Seyedehelham; He, Jianxun; Samadi-Koucheksaraee, Arvin; Yaseen, Zaher Mundher

doi:10.1038/s41598-022-08875-w

An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction

Article
Open access
Published: 23 March 2022

Volume 12, article number 4934, (2022)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction

Download PDF

Iman Ahmadianfar¹,
Seyedehelham Shirvani-Hosseini²,
Jianxun He³,
Arvin Samadi-Koucheksaraee¹ &
…
Zaher Mundher Yaseen^4,5

3610 Accesses
39 Citations
1 Altmetric
Explore all metrics

Abstract

Precise prediction of water quality parameters plays a significant role in making an early alert of water pollution and making better decisions for the management of water resources. As one of the influential indicative parameters, electrical conductivity (EC) has a crucial role in calculating the proportion of mineralization. In this study, the integration of an adaptive hybrid of differential evolution and particle swarm optimization (A-DEPSO) with adaptive neuro fuzzy inference system (ANFIS) model is adopted for EC prediction. The A-DEPSO method uses unique mutation and crossover processes to correspondingly boost global and local search mechanisms. It also uses a refreshing operator to prevent the solution from being caught inside the local optimal solutions. This study uses A-DEPSO optimizer for ANFIS training phase to eliminate defects and predict accurately the EC water quality parameter every month at the Maroon River in the southwest of Iran. Accordingly, the recorded dataset originated from the Tange-Takab station from 1980 to 2016 was operated to develop the ANFIS-A-DEPSO model. Besides, the wavelet analysis was jointed to the proposed algorithm in which the original time series of EC was disintegrated into the sub-time series through two mother wavelets to boost the prediction certainty. In the following, the comparison between statistical metrics of the standalone ANFIS, least-square support vector machine (LSSVM), multivariate adaptive regression spline (MARS), generalized regression neural network (GRNN), wavelet-LSSVM (WLSSVM), wavelet-MARS (W-MARS), wavelet-ANFIS (W-ANFIS) and wavelet-GRNN (W-GRNN) models was implemented. As a result, it was apparent that not only was the W-ANFIS-A-DEPSO model able to rise remarkably the EC prediction certainty, but W-ANFIS-A-DEPSO (R = 0.988, RMSE = 53.841, and PI = 0.485) also had the edge over other models with Dmey mother in terms of EC prediction. Moreover, the W-ANFIS-A-DEPSO can improve the RMSE compared to the standalone ANFIS-DEPSO model, accounting for 80%. Hence, this model can create a closer approximation of EC value through W-ANFIS-A-DEPSO model, which is likely to act as a promising procedure to simulate the prediction of EC data.

Prediction of Water Quality Parameters Using ANFIS Optimized by Intelligence Algorithms (Case Study: Gorganrood River)

Article 29 September 2017

Evaluation of total dissolved solids in rivers by improved neuro fuzzy approaches using metaheuristic algorithms

Article 15 January 2024

Modeling Groundwater Quality Parameters Using Hybrid Neuro-Fuzzy Methods

Article 17 December 2018

Introduction

Research background

Nowadays, water resources face plenty of serious threats, all of which are caused mainly by the increasing phenomenon of climate change, urbanization and inadequate water infrastructure¹. Undoubtedly, rivers are vital inland water resources to supply a wide range of purposes such as agricultural demands, industrial and recreational goals, and domestic consumption^2,3. Accordingly, the essential role of existing water resources, in particular rivers, emphasizes the necessity of efficient water resource management⁴. Admittedly, the primary key of effective water resource management is to monitor the water quality (WQ) on a regular basis^5,6,7,8.

Nevertheless, frequent and accurate testing and sampling of existing water bodies are time-consuming and exorbitant, resulting in the tedious study and limited calibration and validation of WQ evaluation^9,10. Developing appropriate data-intelligence models for WQ monitoring is part and parcel of better surface water management^11,12. Hence, the main benefit of modeling the water quality-related variables of surface water is to reach an efficient and reliable water resource management, which causes costs reduction. It is mainly because these models, as indirect procedures, have high reliability, which would detect the values of water quality-related variables in the future¹³. Ensuring better WQ management demands the development of suitable models for WQ monitoring^14,15,16,17. As a result, reaching a better understanding of surface water-related parameters is one of the significant elements to have reliable water resources management¹⁸. It can result in a more efficient emerging tool of water treatment cost reduction and better WQ sustainability.

Among several WQ variables, EC is an essential indicator for salinity that is highly significant for better irrigation and water usages purposes¹⁹. This is clearly explaining the importance of this variable in the surface water quality health as it is prevailed by the total dissolved solid (TDS) while being related to dissolved ionic solutes including sodium (Na⁺), chloride (Cl⁻), magnesium (Mg²⁺), sulfate (SO4²⁻), and calcium (Ca²⁺) in water^5,19. Not only does the ionic composition affect the growth of plants, but it also decreases the quality of drinking water remarkably. Besides, EC plays an essential role in salinity hazard measurement for irrigation and drinking water^5,19,20.

Problem statement

Water quality variables models are critically important tools for conducting aquatic systems research, bringing along an appropriate evaluation and prediction of surface WQ for effective water resource management^{7,21,22,23,24}. Indeed, this procedure causes efficient measures to ensure that pollution proportions remain within permissible limits²⁵. One of the essential tasks to reach optimal resource management is to predict the WQ parameters accurately. Although the traditional process-based modeling methods lead to accurate WQ parameters prediction, these models have some restrictions^{3,26,27,28,29}. They are limited to single particular catchment, certain type of data stochasticity and data redundancy. As an illustration, they work based on data set requiring a great deal of processing time and unknown input data. Furthermore, since WQ is affected by distinct parameters, conventional strategies for data processing do not have enough efficiency in solving this problem, and these parameters illustrate a sophisticated non-linear relationship with parameters of WQ prediction³⁰.

Literature review predictive models

In recent years, various modelling procedures have taken place to enhance the prediction accuracy of diverse WQ models³. For example, diverse mathematical models based on statistical perspectives were established i.e., linear regression model³¹, moving average (MA) and autoregressive integrated moving average (ARIMA)^32,33. The forgoing models have been developed and employed to predict water quality; however, they give rise to some obstacles. Some drawbacks of statistical models are that they use linear and normally distributed relationships between the prediction and response²⁶. Moreover, these models are unlikely to supply accurate predictions owing to the shortage of authentic tools to gather observation data for the timeframe, the complexity of influential criteria in prediction, and the shortcoming to receive non-stationarity and nonlinearity of the WQ parameters³⁴. Nowadays, using artificial intelligence methods can significantly assist in increasing the model’s accuracy and reliability^35,36,37,38. Using machine learning (ML) models are experiencing an increasing trend in solving environmental problems, which stems from their striking ability to solve sophisticated non-linear problems^{11,14,28,39,40,41,42}, and their non-reliance on pre-knowledge of the physical processes, although these ML models need large data volumes to work properly. Since the ML technique is an influential procedure to model sophisticated non-linear systems, it promotes the development of parallel computing and computational capabilities significantly, which causes researchers to operate ML methods⁵.

Recently, to address free ammonia (AMM), total Kjeldahl nitrogen (TKN), water temperature (WT), total coliform (TC), fecal coliform (FC), and pH, the least square support vector machine (LSSVM), multivariate adaptive regression splines (MARS), and M5 model tree (M5Tree) were conducted⁴³. The outcomes of MARS and LSSVM models proved the higher ability in comparison with other methods. Adaptive neuro fuzzy inference system (ANFIS) was developed to forecast the WQ parameters in Manzala Lake by Ref.⁴⁴, which led to an accurate WQ parameters prediction. Eventually, it detected the total nitrogen and phosphorus contents in the region. The mixture of artificial neural network (ANN) model and multi-objective genetic algorithm (MOGA) was employed by Ref.⁴⁵ to enhance WQ prediction performance, which brought along better capability.

Recently, the hybrid wavelet-artificial neural network (WANN) procedures forecasted the EC of river water²⁰ to assess the WQ parameters based on limited time series data,, which revealed the WANN model led to modified modeling. Barzegar et al.³⁹ applied ANN, ANFIS, wavelet-ANN, and wavelet-ANFIS to evaluate water salinity levels according to Ca²⁺, Mg²⁺, Na⁺, SO4²⁻, and Cl⁻ in rivers on a monthly basis. Ravansalar et al.⁸ enhanced a new hybrid wavelet-linear genetic programming (WLGP) model to forecast sodium (Na⁺) concentration every month, bringing along the remarkable ability of the WLGP model in terms of prediction of the Na⁺ peak values. Furthermore, Barzegar et al.⁶ modelled multi-step-ahead EC via a hybrid wavelet-extreme learning machine (WA-ELM) model, being compared with an adaptive neuro-fuzzy inference system (ANFIS). Li et al.¹⁵ combined recurrent neural network (RNN) with modified Dempster/Shafer (D–S) evidence theory to create a hybrid RNNs-DS model. Jafari et al.¹⁴ used a hybrid wavelet-genetic programming (WGP) model for improved water biochemical oxygen demand (BOD) prediction. Indeed, this model was assessed against 5 ML, having W-ANN, ANN, GP, DT, and BN, outperforming the comparative models. Najah Ahmed et al.²⁶ presented a Neuro-Fuzzy Inference System (WDT-ANFIS) based on an augmented wavelet de-noising manner, dependent on historical data of the WQ parameter. They used three evaluation techniques to address diverse influences on the model. Finally, it revealed the proposed model could forecast all WQ parameters.

In hybrid models, Deng et al.³⁴ investigated a multi-factor WQ time series prediction model based on Heuristic Gaussian cloud transformation, which led to an enhanced model for forecasting accurately. Zhou et al.²⁴ improved grey relational analysis (IGRA) algorithm and long-short term memory (LSTM) neural network-based model for WQ prediction, resulting in a noteworthy performance of the proposed model in WQ prediction compared to the benchmarked models. Haji Seyed Asadollah et al.³ introduced a new ensemble machine learning model, extra tree regression (ETR), to forecast the water quality index (WQI) amounts at the Lam Tsuen River in Hong Kong on a monthly basis. Comparing this model with classic standalone models, support vector regression (SVR) and decision tree regression (DTR), they found the ETR model led to more authentic WQI predictions for both training and testing phases. Dehghani et al.⁴⁶ introduced a hybrid of grey wolf optimizer (GWO) with ANFIS model to forecast multi-ahead influent flow rate. Their results indicated that the proposed model could reliably estimate the influent flow rate from 5-min up to 10 days. For the prediction of the dissolved oxygen (DO) at two stations in Yangtze River, China, an improved least square support vector machine (LSSVM) coupled with the sparrow search algorithm (SSA) was introduced by Ref.⁴⁷. In addition, the variational mode decomposition (VMD) was applied to denoise the input dataset. Their results indicated that the proposed model has better efficiency than standalone LSSVM, VMD-LSSVM, and SSA-LSSVM to predict the DO. The literature reviews, highly emphasized the implication of hybrid machine learning models for diverse environmental engineering problems^48,49,50. It has been approved as those newly developed versions are the trustworthy computer aid models for solving highly stochastic and non-linear historical big data^51,52.

Main objectives and contributions

Over the past decade, researchers, particularly hydrologists, have witnessed a remarkable increment trend around the world to discover influential computational models for surface WQ simulation^53,54. It has been concomitant with noteworthy advancements in modeling. Needless to say, understanding the surface WQ perfectly, as a natural problem, is a challenging issue. In turn, the key goal of devising a novel hybridized version of ML models is to assess this ordeal more effectively. The necessity of the internal model parameters tuning, data clustering and cleaning, data preprocessing and several others gave rise to some limits in Standalone ML model. Hence, the current study is motivated to develop an efficient hybrid model coupled with wavelet theorem, called wavelet adaptive neural fuzzy inference system couple with an adaptive hybrid of differential evolution and particle swarm optimization (W-ANFIS-A-DEPSO). In fact, the main contribution of this study is to hybridize the ANFIS model with an efficient optimization method called A-DEPSO, which is a novel hybrid model u. The A-DEPSO algorithm uses a powerful local and global search mechanism to avoid local solutions and moves toward the global solution. In addition, it uses an adaptive control parameter to assist the algorithm in balancing the exploration and exploitation. The model is designed to predict EC index on a monthly basis at Maroon River, Iran. Accordingly, a set of data including monthly discharge (Q) and EC measured over three decades, from 1980 to 2016, at Tange-Takab station is used for modeling.

A set of preprocessing analyzes, being essential in the proposed model training process, is intended to select the appropriate input parameters to the predictive models. In order to address the most excellent selective combinations separately for EC, the best subset regression approach is planned. By considering a wide range of statistical metrics, graphical implements, and error analysis in selective combinations, the predictive abilities for the standalone and wavelet-based ML models are evaluated.

Methodology

Adaptive neuro fuzzy inference system (ANFIS)

ANFIS is a hybrid method merging the ANN and fuzzy logic, which is first initiated by Ref.⁵⁵. ANFIS uses the IF–THEN fuzzy rules (FRs) for describing the knowledge between the input and target dataset of a modeling problem^56,57. The main structure of the ANFIS model is displayed in Fig. 1. In this model, the Takagi–Sugeno inference procedure plays an integral role in generating the if–then rules, the range from input to output.

$$ {\text{Rule1}}:{\text{ if}}\,x\,{\text{is}}\,D_{1} \,{\text{and}}\,{\text{y}}\,{\text{is}}\,C_{1} \,,{\text{ then}}\,g_{1} = \alpha_{1} x + \beta_{1} y + \gamma_{1} , $$

(1)

$$ {\text{Rule2}}:{\text{ if}}\,x\,{\text{is}}\,D_{2} \,{\text{and}}\,{\text{y}}\,{\text{is}}\,C_{2} \,,{\text{ then}}\,g_{2} = \alpha_{2} x + \beta_{2} y + \gamma_{2} $$

(2)

in which ${\alpha }_{1}$, ${\beta }_{1}$, ${\gamma }_{1}$, and ${\alpha }_{2}$, ${\beta }_{2}$, ${\gamma }_{2}$ denote the consequence parameters, and ${D}_{1}$, ${D}_{2}$, ${C}_{2}$, and ${C}_{2}$ are considered membership functions (MFs). The ANFIS includes five layers, with it consisting of a wide range of inputs and just one output. This structure is elaborated as follows:

During the first layer, each node is controlled by a specific function parameter, which is then used to generate an amount of membership degree ($\psi $) by the bell MF.

$${Q}_{1k}={\psi }_{Dk}\left(x\right), k=\mathrm{1,2},$$

(3)

$$\begin{array}{cc}& {Q}_{1k}={\psi }_{{C}_{k-2}}\left(y\right), k=\mathrm{3,4}\\ & \psi (x)=\frac{1}{1+{\left[{\left(\frac{x-{e}_{k}}{{a}_{k}}\right)}^{2}\right]}^{{m}_{k}}},k=\mathrm{1,2}\end{array}$$

(4)

in which ${a}_{k}$, ${e}_{k}$, and ${m}_{k}$ are defined as membership values. At the second layer, each node output is regarded an input signal, indicating the firing strength of each rule.

$${Q}_{2k}={\psi }_{{D}_{k}}\left(x\right)\times {\psi }_{{D}_{-2}}\left(y\right).$$

(5)

Concerning layer 3, the strength ratio for kth rule to all rules' sum strength is estimated by the output of layer 3.

$$ Q_{3k} = \overline{w}_{k} = \frac{{\omega_{k} }}{{\mathop \sum \nolimits_{k = 1}^{2} \omega_{k} }} $$

(6)

In addition, the adaptive nodes can be estimated in layer 4.

$$ Q_{4k} = \overline{w}_{k} g_{k} = \overline{w}_{k} \left( {\alpha_{k} x + \beta_{k} y + \gamma_{k} } \right). $$

(7)

Eventually, layer 5 computes the network output.

$${Q}_{5\mathrm{k}}=\sum_{k} {w}_{k}{g}_{k}.$$

(8)

Least square support vector machine (LSSVM)

Suykens and Vandewalle introduced a new method of the support vector machine (SVM) called LSSVM, which uses linear equations to raise the convergence speed^58,59,60. However, the SVM works with a quadratic programming technique for training⁶¹. To put it simply, in terms of simple structure and high convergence speed, the LSSVM has the edge over SVM, which results in more popular methods in regression and classification fields^61,62. In the following, the model was formulated in which the training dataset is defined through (${x}_{n}$, ${y}_{n}$), n = 1, 2, …, N, as ${x}_{n}$ and ${y}_{n}$ are considered the input and output dataset.

$$ Minimize_{{\psi ,{\text{b}},\zeta }} F(\psi {{,\zeta ) = }}\frac{1}{2}\psi^{T} \psi + \frac{1}{2}\lambda \mathop \sum \limits_{i = 1}^{N} \lambda_{i}^{2} , $$

(9)

$$ Subject\,to: $$

$$ y_{n} = \psi^{T} \theta \left( {x_{n} } \right) + b + {\upzeta }_{n} {, }n = 1{,} 2{,} \ldots {,} N, $$

(10)

where $\lambda$ is considered a penalty parameter, ${\upzeta }$ is defined as a regression error, $\theta \left( {x_{m} } \right)$ is a non-linear function,$\psi^{T}$ is the transposed output layer vector, $b$ is a parameter for calculation. In addition, the Eqs. (9) and (10) may be expressed by the Lagrange procedure. So,

$$ L(\psi {\text{, b}}, \zeta , \beta {) = }Z\left( {\psi {{,\zeta )}} - \mathop \sum \limits_{n = 1}^{N} \beta_{n} (\psi^{T} \theta \left( {x_{n} } \right) + b + {\upzeta }_{n} - y_{n} } \right). $$

(11)

In which ${\beta }_{n}$ is defined as a lagrange multiplier. Through Karush–Kuhn–Tucker (KKT) conditions, some solutions are gained, and in the following, they are formulated:

$$ \frac{\partial L}{{\partial \psi }} = 0 \to \psi = \mathop \sum \limits_{n = 1}^{N} \beta_{n} \theta \left( {x_{m} } \right), $$

(12)

$$ \frac{\partial L}{{\partial b}} = 0 \to \mathop \sum \limits_{n = 1}^{N} \beta_{n} = 0, $$

(13)

$$ \frac{\partial L}{{\partial {\upzeta }_{n} }} = 0 \to \beta_{n} = \lambda {\upzeta }_{n} , $$

(14)

$$ \frac{\partial L}{{\partial \beta_{n} }} = 0 \to \psi^{T} \theta \left( {x_{n} } \right) + b + {\upzeta }_{n} - y_{n} = 0. $$

(15)

The linear equations mentioned below may be gained by solving $\psi$ and ${\upzeta }_{n}$ parameters.

$$\left[\begin{array}{cc}0& {g}_{n}^{T}\\ {g}_{n}& Krl+{\lambda }^{-1}e\end{array}\right]\left[\begin{array}{c}b\\ \beta \end{array}\right]=\left[\begin{array}{c}0\\ y\end{array}\right]$$

(16)

in which ${g}_{n}={[1,\dots , 1]}^{T}$, $\beta ={{[\beta }_{1}, \dots , {\beta }_{n}]}^{T}$, $y={{[y}_{1}, \dots , {y}_{n}]}^{T}$ And $g$ is regarded as the unit matrix. $Krl$ being the kernel functions are formulated as,

$$K\left({x}_{m},{x}_{i}\right)=\theta \left({x}_{m}\right)\theta \left({x}_{i}\right).$$

(17)

Radial basis functions (RBF) are operated as the kernel function as:

$$Krl\left({x}_{n}\text{,} {x}_{j}\right)=\mathrm{exp}\left(\frac{-\left|\left|{x}_{n}, {x}_{j}\right|\right|}{{\gamma }^{2}}\right)$$

(18)

in which $\delta $ being a fixed parameter.

Generalization regression neural network (GRNN)

In 1991, a probabilistic-based neural network based on radial basis function (RBF), known as Generalized regression neural network (GRNN), was introduced by Specht⁶³. Nowadays, this model is usually used for classification and regression while dealing with non-linear fitting systems in large-scale samples, and the model operates according to the nonparametric kernel regression network. Overall, GRNN may lead to a fewer local minimum by a learning algorithm, whereas it has fewer adaptation parameters by comparison with the backpropagation and RBF artificial neural network⁶⁴. In other words, this model accounts for four layers, input layer, radial layer, regression layer, and an output layer, with the structure consisting of a radial neurons layer and a regression layer which are located in the input and output layers⁶⁵. To add to it, pattern (radial neurons) layer in which there is the input data in training step, the neurons number is identical with the data sample points. Besides, the summation layer has provided by a different neuron rather than the output layer being considered to estimate the density function. However, other neurons are supplied with the purpose of output estimation. To sum up, the GRNN model spend less time operation in comparison with other ANNs, since this method has directly selection operation between predictors and target⁶⁵. This model uses a control parameter called spread parameter, exhibits the spread of RBF and regulates the function to obtain the most relevant fitness.

Multivariate adaptive regression spline (MARS)

The MARS method is an innovative side of stepwise linear regression (SLR) and is used to solve modeling problems having high input parameters, and it was presented by Friedman⁶⁶. Regarding the MARS operation, this method works differently, as it brings about diverse slopes (i.e. linear bias functions (LBFs)) for different domains of variable range, whilst the SLR utilizes one slop for input variable⁶⁷. Therefore, it can be concluded that the MARS can be concomitant with more data than the SLR to elaborate on how an essential variable impacts the dependent variable. Consequently, the MARS is made based on the LBFs structure and stems from the SLR classification⁶⁸. This means it is unlikely to need previous knowledge to determine LBF numbers and parameters. In turn, it can be expressed that the MARS method has the edge over SLR thanks to the mentioned merit. Also, it is noteworthy that through a set of elementary LBFs the connection between input and output data appears, and in the following a LBF is formulated,

$${D}_{n}\left(x\right)=\mathrm{max}\left(0, B-x\right) or {D}_{n}\left(x\right)=\mathrm{max}\left(0, x-B\right)$$

(19)

in which $B$ is a beginning variable in order to divide the $x$ range into sub-ranges, ${D}_{n}$ is a basic function, $x$ defines the input dataset. The fundamental MARS formulation is expressed as,

$$ G\left( x \right) = \alpha_{0} + \mathop \sum \limits_{n = 1}^{N} \alpha_{n} D_{n} \left( x \right), $$

(20)

where $G\left(x\right)$ demonstrates the output, N is the total number of weighting factors, and ${\alpha }_{0}, {\alpha }_{1}, \dots , {\alpha }_{N}$ are weighting factors in the MARS method.

There are two fundamental steps in the MARS application process: forward and backward stages. The forward is a step in which it is tried to reduce the probable errors in the training phase by increasing LBFs in the model. Eventually, this step is completed by the provided total number of LBFs. On the other hand, the second stage is to decrease the overfitting trend, with it eradicting the extra LFBs. Estimating sub-models are necessary, being done using the generalized cross-validation (GCV) index^69,70 so,

$$GCV=\frac{MSE}{(1-\frac{m+0.5\times pen\times (m-1)}{n}{)}^{2}}$$

(21)

in which $MSE$ is the mean square error, $m$ is LBF number, $n$ means observations training dataset number, $pen$ defines a penalty factor, recommended by Friedman and Jekabsons, and it is in the range of Refs.^69,70.

Wavelet theory

In order to reach an appropriate analysis of non-stationary signals, wavelet transform (WT) is operated as a novel and efficient method. It was owing to the fact that this method is more flexible than Fourier transform, with it bringing about flexibility between the time scale and frequency⁷¹. Likewise, the new method has the advantage of analyzing signals, albeit at diverse degrees of the time scale. To put it simply, a wavelet works as a time function with fluctuations, and its energy is restricted to a fixed span of time. Provided that $\varphi$ is considered to detect the mother wavelet, the continuous wavelet transform (CWT) is defined by the equation mentioned below^72,73:

$$\omega \left(b,c\right)=\int f(t)\times \left(\frac{1}{\sqrt{b}}\right)\times \varphi \left(t-\frac{c}{b}\right)dt,$$

(22)

where b factor is a scale and depicts the stretch or duration of the wavelet. c factor is a transfer parameter supplying the required time concentration and defining the point of wavelet on the time pivot. In addition, the discrete type of wavelet transform (DWT) is highly likely to be utilized in order for analyzing the time series due to the fact that time discrete series are conventional in hydro-climatological works. At any spot in the signal (b) and for any scale value (c), the coefficients of wavelet are measurable by the equation mentioned below:

$$ \varphi_{b,c} \left( t \right) = \frac{1}{\sqrt b }\varphi \left| {\frac{t - c}{b}} \right|. $$

(23)

Regarding DWT, these transform and scale factors are disconnected as,

$$ b = 2^{k} , c = 2^{k} l. $$

(24)

In fact, k and l are integers. By changing b and c in relation, the following equations can be obtained (25):

$$ \varphi_{k,l} \left( t \right) = 2^{ - k/2} \varphi \left[ {2^{ - k} t - l} \right]. $$

(25)

Therefore, the wavelet function is discrete wavelet. The DWT can be:

$$ \omega \left( {b,c} \right) = 2^{ - k/2} \smallint f\left( t \right) \times \varphi (2^{ - k} t - l)dt. $$

(26)

Proposed ANFIS-ADEPSO

ANFIS model uses a classical optimization method to minimize the difference between the target and estimated outputs. The optimization method combines least squares solver (LSS) and gradient descent (GD) methods. The optimal MFs of input parameters and coefficients of the linear relation of FRs is determined by the hybrid optimization method during the training stage. One of the most severe critiques concerning the classical optimization methods is getting stuck in local solutions³⁶, where employing metaheuristic optimization methods such as A-DEPSO can be a helpful choice⁷⁴. The flowchart of the A-DEPSO algorithm coupled with the ANFIS model is displayed in Fig. 2 and expressed in the following section.

Adaptive hybrid of DE and PSO (A-DEPSO)

In this study, the adaptive hybrid of DE and PSO (A-DEPSO) introduced by Ref.⁷⁴ is used to determine the ANFIS model's decision parameters, in which mutation, crossover, refreshing, and selection operators are main operators. The proposed A-DEPSO algorithm is described in the following sections.

Mutation in A-DEPSO

Generally, a mutation operator can promote the efficiency of an optimization method^75,76. The A-DEPSO algorithm takes advantage of a powerful mutation operator for increasing the local and global searchability. The proposed mutant vector (${XDE}_{l\text{,}j}$) is generated by the mutant vector of DE (Eq. 27) and the vector created by the PSO algorithm (Eq. 29), which is formulated as,

$$ XDE_{{l{,}j}} = x_{l,j} + G.\left( {xp_{{l{,}j}} - x_{l,j} } \right) + G.\left( {x_{a1} - x_{a2} } \right), $$

(27)

$$ V_{{l{,}j}} = w.V_{{l{,}j}} + c_{1} .rand_{1} .\left( {xp_{{l{,}j}} - x_{l,j} } \right) + c_{2} .rand_{2} .\left( {x_{g} - x_{l,j} } \right), $$

(28)

$$ XPSO_{{l{,}j}} = x_{l,j} + V_{{l{,}j}} , $$

(29)

$$ X_{new} = \rho {{ \times }}XPSO_{{l{,}j}} { + (1} - \rho ){{ \times }}XDE_{{l{,}j}} , $$

(30)

$$ G = \sin \left( {\beta \times \pi \times \left( \frac{l}{L} \right)} \right) \times \exp \left( { - \frac{l}{L}} \right) \times \left( {0.5 + 0.15 \times randn} \right), $$

(31)

$$ w = \delta \times \exp \left( { - \frac{l}{L}} \right), $$

(32)

where $\beta $ and $\delta $ denote two constant number, $G$ denotes an adaptive parameter for scaling the differential vectors, $\rho $ is a random number in the range of [0, 1], ${c}_{1}$ and ${c}_{2}$ denote two constant numbers in which their values are equal to each other and equal to 1.5. ${rand}_{1}$ and ${rand}_{2}$ denote two random number in the range of [0, 1], $w$ denotes an inertial factor to control the velocities of particles. $randn$ denotes a random number with normal distribution. $l$ and $L$ denote the number of iteration and the maximum number of iterations, correspondingly. ${xp}_{l\text{,}j}$ and ${x}_{g}$ are the personal best of solution $j$ and best-so-far solution, correspondingly.

Crossover in A-DEPSO

The A-DEPSO uses a new binomial crossover (BC) to boost the population diversity. The BC merges three vectors, comprising the vector ${X}_{new}$, ${x}_{g}$, and the current solution (${x}_{l,j}$) by utilizing an adaptation rate parameter (${A}_{r}$) in which creating the crossover vector (${z}_{i}$) is done by the following equation:

$${z}_{i}=\left\{\begin{array}{l}{X}_{new,i} \quad \left( if {p}_{a}<{A}_{r} \; and \; {p}_{b}<0\text{.}5\right) \; or \; i= {i}_{rand } \\ {x}_{j,i} \quad \left(if {p}_{a}<{A}_{r} \; and \; {p}_{b}>0\text{.}5\right) \; or \; i= {i}_{rand} \\ {x}_{g,i} \quad if \; {p}_{a}>{A}_{r},\end{array}\right.$$

(33)

$${A}_{r}={m}_{cr}+0.1\times randn,$$

(34)

where ${A}_{r}$ denote the adaptive rate for the BC, which is expressed as Eq. (34). ${p}_{a}$ and ${p}_{b}$ denote two random parameters in the range of [0,1], ${i}_{rand}$ denote a random integer number in the range of [1, D]. ${m}_{cr}$ is equal to 0.5 in the first iteration and its value can be changed based on the relationship suggested by Ref.⁷⁷, is defined as,

$${m}_{cr}=\left(1-\mu \right)\times {m}_{cr}+\mu \times mean\left({S}_{Ar}\right),$$

(35)

where $\mu $ is equal to 0.1. ${S}_{Ar}$ denotes all successful ${A}_{r}$ during whole iterations. According to Eq. (35), the A-DEPSO determines the best amount for ${A}_{r}$ at each iteration and assists it to implement an appropriate search in the solution space.

Refreshing operator in A-DEPSO

Refreshing operator (RO) is added to the A-DEPSO for enhancing the convergence speed. The RO can create the vector (${z}_{i}$) based on the solution (${z}_{i}$) generated by the BC and two solutions ${x}_{1}$ and ${x}_{2}$. In fact, two solutions ${x}_{1}$ and ${x}_{2}$ are to promote the exploitation capability in the A-DEPSO. Thus, the RO is formulated as,

$$z=\left\{\begin{array}{l}{x}_{1 } \; if \; rand<LC \; and \; rand<0.5\\ {x}_{2} \; if \; rand<LC \; and \; rand>0.5\\ z \; if \; rand>LC\end{array}\right.$$

(36)

in which

$${x}_{1}=z+\sigma .\left(2.randn.{x}_{g}- {x}_{l\text{,}j}\right),$$

(37)

$${x}_{2}={x}_{g}+\sigma \times \left({x}_{a1}- {x}_{a2}\right),$$

(38)

where $LC$ denote a logistic chaotic map^78,79 ($LC=4.LC.(1-LC)$) and its initial value is 0.7. The LC is applied to increase the random behavior of A-DEPSO and avoid from local solutions.

Selection operator in A-DEPSO

A-DEPSO uses the selection operator (SO) to determine whether the solution $z$ is better than the current solution (${x}_{l\text{,}j}$) or not. Based on the SO, the solution in the next iteration (${x}_{l+1\text{,}j}$) can be formulated as,

$${x}_{l+1\text{,}j}=\left\{\begin{array}{ll}z \quad if \; f(z) >f({x}_{l\text{,}j}) \\ {x}_{l\text{,}j} \quad otherwise.\end{array}\right.$$

(39)

Performance evaluation

This section introduces seven statistical metrics for assessing the efficiency of five ML models, including Root mean square error (RMSE)³⁹, Correlation coefficient (R)⁶, Mean absolute error (MAE)⁸⁰, Relative absolute error (RAE)⁸⁰, Willmott's agreement Index (IA)⁸¹, Legate and McCabe's Index ($E$)⁸², and mean absolute percentage error (MAPE)⁸³, which are formulated as,

$$ RMSE = \left( {\frac{1}{M}\sum\limits_{{k = 1}}^{M} {\left( {EC_{{P,k}} - EC_{{o,k}} } \right)^{2} } } \right)^{{0.5}} , $$

(40)

$$R=\frac{\sum_{k=1}^{M}\left({EC}_{P,k}-\overline{{EC }_{P}}\right).({EC}_{o,k}-\overline{{EC }_{o}})}{\sqrt{\sum_{k=1}^{M}{({EC}_{P,k}-\overline{{EC }_{P}})}^{2}\sum_{k=1}^{N}{({EC}_{o,k}-\overline{{EC }_{o}})}^{2}}},$$

(41)

$$RAE=\frac{{\sum }_{k=1}^{M}\left|{EC}_{P,k}-{EC}_{o,k}\right|}{{\sum }_{k=1}^{M}\left|{EC}_{P,k}-\overline{{EC }_{o}}\right|},$$

(42)

$$ MAE = \left( {\frac{1}{M}} \right)\sum\limits_{{i = 1}}^{M} {\left| {EC_{{P,k}} - EC_{{o,k}} } \right|,} $$

(43)

$$ MAPE\left( \% \right) = \left( {\frac{{100}}{M}} \right)\sum\limits_{{k = 1}}^{M} {\left| {\frac{{EC_{{P,k}} - EC_{{o,k}} }}{{EC_{{o,k}} }}} \right|,} $$

(44)

$$IA=1-\frac{\sum_{k}^{M}{\left({EC}_{P,k}-{EC}_{o,k}\right)}^{2}}{\sum_{k=1}^{M}{\left(\left|\left({EC}_{P,k}-\overline{{EC }_{P}}\right)\right|+\left|\left({EC}_{o,k}-\overline{{EC }_{o,k}}\right)\right|\right)}^{2}}, 0<IA\le 1,$$

(45)

$$E=1-\frac{{\sum }_{k=1}^{M}\left|{EC}_{o,k}-{EC}_{P,k}\right|}{{\sum }_{k=1}^{M}\left|{EC}_{o,k}-\overline{{EC }_{P}}\right|},$$

(46)

where ${EC}_{o,k}$ denotes the observed EC value, ${EC}_{P,k}$ denotes the predicted EC value, $\overline{{EC }_{P}}$ and $\overline{{EC }_{o,k}}$ denote average values of observed and predicted EC, respectively, and M denotes the number of data samples. In addition, if the RMSE, MAE, RAE and MAPE are near to 0 and R, E and I_A are near to 1, the model presents better efficiency.

Since specifying the best model according to seven metrics is a difficult issue, the multi-index criterion (PI) (Wang et al., 2018) is used in this study to make an easy decision for selecting the best model. The formula of PI is defined as,

$$PI=\frac{1}{7}.\left(\frac{{R}_{min}}{R}+\frac{RMSE}{{RMSE}_{max}}+\frac{MAE}{{MAE}_{max}}+\frac{RAE}{{RAE}_{max}}+\frac{MAEP}{{MAEP}_{max}}+\frac{{E}_{min}}{E}+\frac{{IA}_{min}}{IA}\right),$$

(47)

where ${R}_{min}$, ${E}_{min}$, and ${IA}_{min}$ are the minimum values of $R$, $E$, and $IA$ achieved by all ML methods. Also, ${RMSE}_{max}$, ${MAE}_{max}$, and ${MAEP}_{max}$ are the maximum values of $RMSE$, $MAE$, and $MAEP$ obtained by all ML models.

Study area

In this study, the parameters selected on a monthly basis consisting of discharge and electrical conductivity, which is originated from the Tange-Takab gauging station (Longitude 50° 20′ 02″, Latitude 30° 41′ 09″, and 280 m from mean sea level) and located on the Maroon River of Khuzestan province, Iran. The exact location of the Tange-Takab gauging station is illustrated in Fig. 3. Needless to say, this river, with a drainage area of 6824 km² and almost 310 km long, has a profound impact on supplying drinking water, irrigation and recreation for Iranians, in particular southeastern regions' residents in Iran. More specifically, this area, the Maroon basin, witnesses almost the average of 24 °C temperature and 350.04 mm precipitation annually.

Pre-processing and selecting the best combination

The data collected in a span of 36 years (21-March-1980–16-Feb-2016, 432 months) is to simulate EC on a monthly basis through ML models. The given data provided on a monthly time step are segregate into two distinct sections, namely train and test, as 70% (302-month) and 30% (130-month) of whole data is dedicated to training and testing set respectively. As can be observed, Fig. 4 (upper) depicts independent variables, being the time series of Q and Fig. 4 (lower) also illustrates the EC time series being considered a purpose in training and testing periods. Table 1 provides a classification of various statistical criteria such as minimum (MIN), maximum (MAX), average (AVG), range, standard deviation (SD), skewness (S), kurtosis (K), and autocorrelation coefficients (AC) for training, testing, and all data points. From what has been mentioned in the Table, it is clear that the S and K amounts of EC for both training and testing datasets have the same range of ([− 2, 2]), whereas the S range ([3.469, 6.49]) and K range ([15.4, 48.81]) of Q reveals the distribution of discharge time series is away from normal distribution.

Table 1 Statistic information of dataset utilized in train and test.

Full size table

The step of choosing the optimum combination of input variables in time series concerning forecasting models by ML models is considered a significant stage, in which the consecutive time series lagged data is influential to a great extent^84,85,86. There is not any criterion to specify the number of lags; however, the auto-correlation function (ACF), partial auto-correlation function (PACF) and cross correlation (CC) statistical methods are considered to detect the input combination on hydrological models^6,87.

In Fig. 5 the AF is operated to estimate the effective input parameter. As can be observed, the AC of 1-month and 2-month lagged signals has a more significant influence (more than 55%) on the original input datasets ($Q_{t}$) and $EC_{t}$ in comparison with the lagged times ($Q_{t - 3} {, }Q_{t - 4} {,} \ldots {,; }EC_{t - 5}$). In addition, based on the PCFA, the 1-month lagged signal for $Q_{t}$ and 1-month to 4-month lagged signals for $EC_{t}$ can be considered.

The Fig. 6 reveals the high correlation belongs to the input single (Q_t) at the current time, the first two time-lagged signals (${Q}_{t-1}\text{, }{Q}_{t-2}$) and first four time-lagged signals of $EC_{t}$($E{C}_{t-1}\text{, }E{C}_{t-2},E{C}_{t-3},E{C}_{t-4}$), which has more significant effect on creating a predictive model compared to the $Q_{t - 3}$ and $E{C}_{t-5}$. To add to it, comparing the cross-correlation values between target signals ($E{C}_{t}$) and the input signals proves that the $E{C}_{t-1}\text{, }E{C}_{t-2}$, with greater correlation coefficients, by 0.68 and 0.45 respectively, play an important part in predicting the WQ parameter of the target. As a result, by analyzing ACF and CC, it is clear that the lagged t of up to 2 and 4 months for the current month predicting of $EC_{t}$ were accepted. Then, in order to determine the best input patterns amongst all available and possible patterns, one of the best subset regression analyses in this research was assessed. Simply put, to choose the optimal input pattern for each WQ target, four distinct criteria, namely of R², adjusted R², Mallows (${C}_{P}$)⁸⁸, and Amemiya prediction criterion (PC)⁸⁹ are used. In the following $C_{P}^{{}}$ and PC are defined⁹⁰:

$${C}_{p}=\frac{RS{S}_{i}}{MS{E}_{m}}+2i-N,m>i,$$

(48)

$$PC=\left(\frac{n+i}{n-i}\right)\left(1-\left({R}^{2}\right)\right).$$

(49)

In which $MS{E}_{m}$ expresses mean squared error, i is predictors' number, $RSS_{i}$ is considered the residual sum of squares and N is the historical dataset's number. With regard to Table 2, in which the best subset regression analysis's result $EC_{t}$ is classified, the analysis $EC_{t}$ was evaluated to selected four the most appropriate pattern, the optimum input data for predictive models, based on the best result of factors such as R² ([56.90 57.00%]), $C_{P}^{{}}$([4.74 8.00]) and PC ([0.441 0.444]). It is true to say; this method is unlikely to ensure alone the accuracy of the most suitable input combinations. In turn, taking other statistical conditions, including the Pearson correlation between basic input parameters and the purpose parameters and multicollinearity interaction analysis between inputs, into account is an essential matter to raise the certainty of the combination selection. Admittedly, the current Q and time-lagged EC time series affect considerably the input combination of purpose signal. Hence, in order to predict the current $EC_{t}$ on a monthly basis, four input mixtures were separately provided with the purpose of enhancing ML based on predictive models categorized in the form of boldface in Table 2.

Table 2 Best subset analysis to optimally chosen the input combination of EC in ML models.

Full size table

Application results and discussion

In this paper, the A-DEPSO is developed to find optimal parameters of the ANFIS model and to enhance the convergence speed. The efficiency of the proposed method is compared with LSSVM, MARS, and GRNN models to predict the EC parameter in standalone and wavelet-complementary frameworks. In this regard, ANFIS-A-DEPSO can predict the EC, with it overcoming the demerits of the basic ANFIS algorithm by optimizing coefficients detecting membership functions. Thus, it is concomitant with more meticulous predictive outcomes. In fact, the A-DEPSO optimization method is used to extract optimal parameters of the ANFIS model and increase the precision and speed of convergence rate. Furthermore, the position of each member in the A-DEPSO algorithm indicate the amounts of consequent (${\alpha }_{1}$, ${\beta }_{1}$, ${\gamma }_{1}$, ${\alpha }_{2}$, ${\beta }_{2}$, ${\gamma }_{2}$) and membership parameters (${a}_{k}$, ${e}_{k}$, ${m}_{k}$) in the ANFIS model. The baseline parameter values were treated as the starting locations of the solutions. To validate prediction accuracy, the fitness function of root mean square error (RMSE) was used. The hybrid ANFIS model were run until the RMSE was reduced to a minimum and the methods were converged toward the best solutions. Within every update of the solution' positions, the ideal amounts of design variables were discovered.

Likewise, operating the ANFIS, LSSVM, MARS, and GRNN as striking machine learning methods were useful to confirm the predictive ability of ANFIS-A-DEPSO, which leads to the major novelty of this research. During a trial-and-error manner, the LSSVM, GRNN, and MARS gained their substantial setting parameters. The given Table 3 classifies the amounts of control parameters for these mentioned methods. It should be noted that the population size and the maximum number of iterations for the A-DEPSO algorithm are equal to 50 and 200, respectively.

Table 3 Control parameters of the LSSVM, MARS, and GRNN models.

Full size table

Wavelet-ML models

As another effective model in terms of the certainty of predictive hydrological models can refer to the complementary data-intelligence models, including wavelet discrete or continuous wavelet transforms (DWT and CWT, correspondingly) as well ML model, which stems from a appropriate mother wavelet and decomposition level disintegration. It is commonly observed, two mother wavelet, namely as discrete Meyer (Damey) and Biorthogonal 6.8 (Bior 6.8) have proven their noteworthy ability in WQ predictive models, mainly because they support condensed form and are useful in producing time localization^12,15,20. In this research, the mother wavelet (i.e., bior6.8, and dmey) was used to break up the time series. In the following, the optimal disintegration level (1) of wavelet transform for the WQ time series was formulate⁶:

$$ nMW = int\left[ {\log \left( N \right)} \right]. $$

(50)

In which N describes the dataset's number, accounting for 432. So, the figure of disintegration level will be 3. As a result, the used basic signals in the EC modeling were divided into three levels of details and approximations as Fig. 7.

In the next step, influential sub-series was collected (e.g.,$Q = A_{3} + \sum\nolimits_{i = 1}^{3} {D_{i} }$) and arranged as the input variables for supplementary five ML models based on the input combinations for the EC. Figure 8 demonstrates the details (Ds) and approximations (As) of separated signals of the EC simulation. Figure 9 displays the flowchart of ML models for forecasting EC parameters.

Evaluate the performance of standalone ML models

In this subsection, four various combinations of input parameters given in Tables 4 and 5 evaluated the ability of five standalone ML models in forecasting the $E{C}_{t}$ for training and testing stages, respectively. Based on the previous studies^3,11,44,91, the models having the best results in the test period show the best performance, whereby the results of the test period will be examined in order to determine the best model in this study. In fact, the outcomes of the ANFIS-A-DEPSO model, the first standalone one, is addressed. In this regard, Table 5, in which the performance of the ANFIS-A-DEPSO model to predict the $EC_{t}$ in the testing phases provided reveals Combo 3 (R = 0.672, RMSE = 275.404, MAPE = 10.956, E = 0.394, I_A = 0.801, and PI = 0.832) has the edge over other combinations. Likewise, four combinations are used to recognize the most appropriate combination of input parameters in MARS and ANFIS, and the best combinations for both were equal to Combo 4. In GRNN (R = 0.676, RMSE = 301.595, MAPE = 12.165, E = 0.274, I_A = 0.803, and PI = 0.964) and LSSVM (R = 0.620, RMSE = 284.036, MAPE = 11.843, E = 0.356, I_A = 0.747, and PI = 0.948), the most suitable mixture is combo 4.

Table 4 Statistic metrics obtained by five ML models to forecast the EC parameter in training stage.

Full size table

Table 5 Statistic metrics obtained by five ML models to forecast the EC parameter in testing stage.

Full size table

The given Fig. 10 provides the data on the observed against predicted WQP amounts for training and testing phases. It is clear that the proportion of error in predicted amounts gained through two ML models accounts for ± 40%. Therefore, five standalone ML models are not appropriate to predict the EC.

Figure 11 illustrates the distribution of predicted and measured amounts of the EC, which is obtained through the ANFIS-A-DEPSO, ANFIS, LSSVM, MARS and GRNN models for all datasets. More specifically, it is notable that the disparity between the predicted and measured amounts of the EC, resulting in the five standalone ML models, are not able to predict the EC accurately.

Evaluate the performance of wavelet-based ML models

In this study, the W-ANFIS-A-DEPSO, W-ANFIS, W-LSSVM, W-GRNN, and W-MARS models are enhanced to boost five standalone ML models' accuracy (i.e., ANFIS-A-DEPSO, ANFIS, MARS, GRNN, and LSSVM). As mentioned before, decomposing the time series of EC is implemented by two mother wavelets (i.e., bior6.8 and dmey). Four mixtures of input variables are used to address the ability of the W-ML models with diverse mother wavelets. The given Table 6 provides data on optimal parameters of the all-wavelet-based models.

Table 6 Control parameters of the LSSVM, MARS, and GRNN models.

Full size table

The comparison of the W-ANFIS-A-DEPSO models’ prediction certainty towards two mother wavelets and all combinations are reported on Table 7, which is illustrated the W-ANFIS-A-DEPSO model with mother wavelets Dmey and Bior6.8, as the best combination, is Comb 4(R = 0.990, RMSE = 51.193, MAPE = 2.143, E = 0.979, and PI = 0.480) and Comb 1 (R = 0.988, RMSE = 54.064, MAPE = 13.5676, E = 0.977, and PI = 0.518) correspondingly for $EC_{t}$ prediction in testing phase. The outcomes reveal Dmey has the most appropriate performance comparison with the Bior6.8 for W-ANFIS-A-DEPSO, owing to the fact that it has more suitable accuracy in comparison by the Bior6.8 mother wavelet.

Table 7 Statistic metrics obtained by W-ANFIS-A-DEPSO model to forecast the EC parameter for all combinations.

Full size table

Concerning the W-ANFIS model, it is true to say the best combination is equivalent to all mother wavelets and Comb 4 (Table 8). In addition, the outcomes of various mother wavelets for the best combination are obtained as, W-ANFIS -Dmey (Combo4: R = 0.985, RMSE = 60.295, MAPE = 2.484, E = 0.971, and PI = 0.517), and W-ANFIS -Bior6.8 (Combo4: R = 0.984, RMSE = 64.090, MAPE = 2.543, E = 0.967, and PI = 0.539). From what has been gained, it is readily apparent that the best mother wavelet is Dmey for W-ANFIS, thanks to higher accuracy than others.

Table 8 Statistic metrics obtained by W-ANFIS model to forecast the EC parameter for all combinations.

Full size table

In the case of W-LSSVM, based on Table 9, the best combination for two mother wavelets is Combo4. The results of Bior6.8 and Dmey mother wavelets for the best combination are: W-LSSVM -Dmey (R = 0.984, RMSE = 64.727, MAPE = 2.638, E = 0.967, and PI = 0.599), and W- LSSVM -Bior6.8 (R = 0.985, RMSE = 62.553, MAPE = 2.564, E = 0.969, and PI = 0.572). Accordingly, the results reveal that the best mother wavelet for the W- LSSVM is Bior6.8, which has a higher precision compared with the Dmey.

Table 9 Statistic metrics obtained by W-LSSVM model to forecast the EC parameter for all combinations.

Full size table

According to Table 10, in which W-MARS model’s outcomes with mother wavelets Dmey and bior6.8 is reported, proves that the best combination considered for Dmey and Bior6.8 are Comb 4 (R = 0.977, RMSE = 77.944, MAPE = 3.358, E = 0.951, and PI = 0.612) and Comb 4 (R = 0.978, RMSE = 77.937, MAPE = 3.337, E = 0.951, and PI = 0.616) correspondingly. Hence, the Bior6.8, as the best mother wavelet has better performance and certainty compared to the Dmey.

Table 10 Statistic metrics obtained by W-MARS model to forecast the EC parameter for all combinations.

Full size table

With regard to Table 11, the W-GRNN outcomes reveal the best combination is Combo 1 for both mother wavelets (i.e., Dmey and Bior6.8). Statistically, the best combination of W-GRNN-Dmey and W-GRNN-Bior6.8 are (R = 0.811, RMSE = 214.160, MAPE = 9.055, E = 0.634, and PI = 0.937) and Comb 1 (R = 0.810, RMSE = 219.231, MAPE = 9.238, E = 0.617, and PI = 0.960, correspondingly. As a result, the best model is considered W-GRNN-Dmey with the PI equal to 0.937.

Table 11 Statistic metrics obtained by W-GRNN model to forecast the EC parameter for all combinations.

Full size table

Figure 12 depicts the comparison's results of predicted $EC_{t}$ and observed $EC_{t}$ being carried out by five W-ML models in the best combination of each mother wavelet. As can be observed, W-ANFIS-A-DEPSO-Dmey (Combo 4) outperforms compared to the W-ANFIS-A-DEPSO with Bior6.8 mother wavelet and the four others with all mother wavelets. To add to it, the proportion of errors concerning W-ANFIS-A-DEPSO-Dmey (Combo 4) accounted for under 10% for the majority of predicted values. Comparing this figure with Fig. 10, it is clearly observed that the hybrid model W-ANFIS-A-DEPSO can improve the correlation coefficient (R = 0.988) up to 52% compared to the standalone ANFIS-A-DEPSO (R = 0.672) during the test period.

The spider plot based on seven factors for the top four models along with the best combination of input variables is displayed in Fig. 13. In fact, according to the mentioned diagram, the more the values of "R, I_A and E_L,M" and "RMSE, MAE, RAE, and MAPE" obtained by each model become closer to the value 1 and to the center of the diagram, respectively, the more the model is reliable. According to Fig. 13 (lower panel), the most effective model to rise the accuracy of forecasting the $EC_{t}$ is W-ANFIS-A-DEPSO-Demy with the largest R, I_A, and E_L,M, and smallest RMSE, RAE, and MAPE for both training and testing stages. On the other hand, Fig. 13 (upper panel) depicts the W-ANFIS-A-DEPSO-Bior6.8-C4 with the highest R, I_A, and E_L,M, and lowest RMSE, RAE, and MAPE have the most significant impact on the accuracy of $EC_{t}$ prediction for both training and testing stages.

The correlation coefficient (R) was figured based on the Taylor diagram to assess the overall ability of the models, with it providing the models' efficiency in detail^5,11. According to the R and standard deviation, the diagram demonstrated a more perceptible and persuasive connection between predicted and observed WQ parameters. The Taylor diagram illustrated in Fig. 14 associated the current monthly EC with Bior6.8 mother wavelet (Upper panel) and EC with Dmey mother wavelet (Lower panel) for all ML models. As a result, W-ANFIS-A-DEPSO has the most suitable performance for EC prediction compared with the other models and is the closest model to the target point.

Compare the performance of all ML models

The contrastive analysis is provided in this section to determine the best model. Consequently, five ML models along with two mother wavelets and four combinations of input variables are under review to forecast the EC in this research. As mentioned before, the W-ANFIS-A-DEPSO-C4 (Dmey), W-ANFIS-C4 (Dmey), W-LSSVM-C4 (Bior6.8), W-MARS-C1 (Dmey), and W-GRNN-C1 (Dmey) have the better performance amongst all models for $EC_{t}$ prediction.

Figure 15 demonstrates the physical trend of five methods to further address their abilities, which results in the disability of standalone ML models in prediction for the $EC_{t}$. Since there are high variations and the characteristic non-linear correlation between the WQ parameters, making a steady model through ANFIS-A-DEPSO, ANFIS, LSSVM, MARS, and GRNN is sophisticated matter. In turn, the aim is to enhance five meticulous ML models concerning wavelet theorem and assess the impact of wavelet transform joined with ML models for EC prediction. According to Fig. 15 W-MLs have the edge over standalone ML models without wavelets in terms of efficiency.

Eventually, Fig. 16, in which the relative deviation (RD) to predict the $EC_{t}$ by W-ANFIS-A-DEPSO, W-ANFIS, W-LSSVM, W-MARS, and W-GRNN in diverse combinations is illustrated, confirms that the W-ANFIS-A-DEPSO-C4 with the RD in the domain of [− 43.89, 29.60] has the superior ability to predict the $EC_{t}$ amongst other models.

Comparison of W-ANFIS-A-DEPSO with hybrid models

To further investigation of the proposed W-ANFIS-A-DEPSO efficiency, this section compares the performance of proposed model with three hybrid models, comprising W-ANFIS-PSO [i.e., hybrid of W-ANFIS with particle swarm optimization⁹² (W-ANFIS-PSO)], W-ANFIS-GWO [i.e., hybrid of W-ANFIS with grey wolf optimizer⁹³ (W-ANFIS-GWO)], and W-ANFIS-WOA [i.e., hybrid of W-ANFIS with whale optimization algorithm⁹⁴ (W-ANFIS-WOA)]. To implement a fair comparison, the population size and the maximum number of iterations (MaxIt) are equal to 50 and 300 for all hybrid models respectively, except for ANFIS-A-DEPSO, which is equal to 50. In fact, by choosing the value of 50 for MaxIt, we try to show that the proposed model can provide better performance than other models in a much smaller number of iterations. Table 12 reports the parameter settings of all methods. According to the selected hybrid models, the PSO, GWO, and WOA do not have any parameter settings. For instance, PSO uses a weighted factor ($w$), decreasing with a linear relationship, to damp its velocity, so it does not need to be set. It should be noted that the values of parameters c1 and c2 are constant. In Refs.^74,92 their values are recommended equally to 1.5 for both of them. This is true of the other two methods as well (i.e., W-ANFIS-GWO and W-ANFIS-WOA). In this section, all hybrid models were applied to predict the EC parameter using the Combo 4-Dmey as the input, because this combination has a better performance for the ANFIS model based on previous sections.

Table 12 Control parameters of the LSSVM, MARS, and GRNN models.

Full size table

Figure 17 displays the convergence graphs of all hybrid models to predict the EC parameter. From the figure, it can be clearly seen that the proposed model can converge to a lower value (83.955) compared with the other hybrid models. Also, the proposed model can achieve a better value of RMSE at less than 10 iterations, while the other hybrid models cannot even converge to a suitable solution after 50 iterations. This confirms the proposed model's superiority compared to the other hybrid models again.

Tables 13 and 14 give the statistical outcomes of the WANFIS-A-DEPSO and three other hybrid models to predict the EC parameter for both training and testing stages. According to these tables, the proposed W-ANFIS-A-DEPSO can provide better results in terms of RMSE (train: 83.955, test: 51.193), MAE (train: 65.822, test: 41.870), RAE (train: 0.121. test: 0.1509), and MAPE (train: 3.607, test: 2.1427) compared with the other hybrid models. Simply put, due to the fact that the proposed model using powerful exploration and exploitation mechanisms and adaptive parameters to better transit from global to local search, it can present more accurate outcomes fast compared to the other hybrid models.

Table 13 Compare W-ANFIS-A-DEPSO with three hybrid models in training stage.

Full size table

Table 14 Compare W-ANFIS-A-DEPSO with three hybrid models in testing stage.

Full size table

Figure 18 depicts all hybrid models' relative errors (REs). Regarding the figure, the proposed model can estimate the EC with a less RE range ([− 0.190, 0.191]) compared with PSO ([− 1.160, 0.22]), GWO ([− 1.610, 0.22]), and WOA ([− 0.353, 0.276]). The proposed model can predict the EC with high accuracy based on these results compared with the other hybrid methods.

Conclusion

By designing a promising model called wavelet-ANFIS-A-DEPSO with two mother wavelets (Dmey and bior6.8), EC_t prediction can be made on a monthly basis in surface water. In fact, a powerful optimization method, A-DEPSO, was developed to increase the ability of ANFIS models. The A-DEPSO is a hybrid of DE and PSO with a boost of exploration and exploitation and two adaptive parameters. In addition, a novel crossover with adaptive parameters was used to increase the diversity of the population. Moreover, a refreshing operator was implemented to raise the chance of escaping from local solutions. Four ML models (i.e., ANFIS, LSSVM, MARS, and GRNN) were operated to forecast the EC with the purpose of evaluating the proposed model's efficiency. Besides, standalone ML models were utilized to assess the predictive ability of all W-ML models for the EC water quality parameter via some metrics and validation manner. Consequently, the monthly time series of Q and EC were operated during 36 years in the Maroon river within two and four time-lagged correspondingly. Indeed, these two and four time-lagged were detected by statistical procedures, and three decomposing levels were used for each mother wavelets.

More specifically, the best subset regression analysis was considered to detect the best input combination of EC prediction. The gained outcomes of the ML model without wavelet in all combinations proved that the ANFIS-A-DEPSO model in Combo 4 had the striking ability in the prediction of EC (PI = 0.832) on a monthly basis. To add to it, in Combo 4 (PI = 0.887), Combo 1 (PI = 0.964), Combo 4 (PI = 0.890), and Combo 1 (PI = 0.948), the ANFIS, LSSVM, MARS, and GRNN models correspondingly showed the superior performance for EC prediction. In addition, seven metrics obtained by the ANFIS-A-DEPSO as the best model are R = 0.672, RMSE = 275.404, MAE = 198.092, RAE = 0.714, MAPE = 10.956, E = 0.394, and I_A = 0.801.

More importantly, W-ML models improved the certainty of EC modelling. The Dmey, jointed with ANFIS-A-DEPSO, ANFIS, MARS, and GRNN models to predict EC, proved the noticeable and best advancement in terms of the accuracy level of simulation, albeit Bior 6.8 showed appropriate performance. On the other hand, when Bior6.8 joined with LSSVM brought about a more suitable performance compared to Dmey.

Regarding monthly EC_t prediction, it is clear that W-ANFIS-A-DEPSO-Dmey in Combo 4 had the best efficiency (PI = 0.485), while W-ANFIS-Dmey model in Combo 4 (PI = 0.517) also proved the reliable ability, followed by W-LSSVM-Bior6.8 in Combo 4 (PI = 0.572), W-MARS-Dmey in Combo 4 (PI = 0.612), and W-GRNN-Dmey in Combo 4 (PI = 0.937) correspondingly. Furthermore, by comparing the seven statistical metrics obtained by W-ANFIS-A-DEPSO-Dmey (R = 0.988, RMSE = 53.841, MAE = 42.941, RAE = 0.155, MAPE = 2.192, E = 0.977, and I_A = 0.994) and W-ANFIS-Dmey (R = 0.985, RMSE = 60.295, MAE = 46.328, RAE = 0.167, MAPE = 2.484, E = 0.971, and I_A = 0.993), it can be clearly determined that the proposed model has a better efficiency than the W-ANFIS model. Moreover, according to the graphical analysis (i.e., scatter plots, time series plots, Taylor diagram, and violon graph), it is evident that the proposed model can predict the EC parameter more accurate and reliable than the other models.

Furthermore, the suggested model was compared to three hybrid models (W-ANFIS-PSO, W-ANFIS-GWO, and W-ANFIS-WOA) to evaluate its effectiveness. The findings show that the suggested model is more accurate in terms of RMSE (train: 83.955, test: 51.193) and MAPE (train: 3.607, test: 2.1427) than the other models.

To sum up, from what had been addressed in all ML-based models, it is obvious that the W-ANFIS-A-DEPSO, a supplementary model, is able to predict the EC accurately. As a suggestion, firstly, it would be operated as an ensemble multi-wavelet model in order to use wavelets simultaneously. Secondly, designing an ensemble ANFIS-based method could have a positive impact on WQPs prediction in surface water, which may lead to accumulating the merits of each supplementary procedure. Finally, it can be applied to other optimization methods to optimize the main parameters of ANFIS model^95,96,97.

Data availability

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

Tiyasha, T. et al. Dual water choices: The assessment of the influential factors on water sources choices using unsupervised machine learning market basket analysis. IEEE Access 9, 150532–150544 (2021).
Article Google Scholar
Abba, S. et al. Evolutionary computational intelligence algorithm coupled with self-tuning predictive model for water quality index determination. J. Hydrol. 587, 124974 (2020).
Article Google Scholar
Asadollah, S. B. H. S., Sharafati, A., Motta, D. & Yaseen, Z. M. River water quality index prediction and uncertainty analysis: A comparative study of machine learning models. J. Environ. Chem. Eng. 9, 104599 (2021).
Article CAS Google Scholar
Seifi, A. & Riahi-Madvar, H. Improving one-dimensional pollution dispersion modeling in rivers using ANFIS and ANN-based GA optimized models. Environ. Sci. Pollut. Res. 26, 867–885 (2019).
Article Google Scholar
Ahmadianfar, I., Jamei, M. & Chu, X. A novel hybrid wavelet-locally weighted linear regression (W-LWLR) model for electrical conductivity (EC) prediction in surface water. J. Contam. Hydrol. 232, 103641 (2020).
Article CAS PubMed Google Scholar
Barzegar, R., Moghaddam, A. A., Adamowski, J. & Ozga-Zielinski, B. Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model. Stoch. Environ. Res. Risk Assess. 32, 799–813 (2018).
Article Google Scholar
Ravansalar, M., Rajaee, T. & Kisi, O. Wavelet-linear genetic programming: A new approach for modeling monthly streamflow. J. Hydrol. 549, 461–475 (2017).
Article ADS Google Scholar
Ravansalar, M., Rajaee, T. & Zounemat-Kermani, M. A wavelet–linear genetic programming model for sodium (Na+) concentration forecasting in rivers. J. Hydrol. 537, 398–407 (2016).
Article ADS CAS Google Scholar
Camara, M., Jamil, N. & Abdullah, F. Variations of water quality in the monitoring network of a tropical river. Glob. J. Environ. Sci. Manag. 6, 85–96 (2020).
CAS Google Scholar
Libera, D. A. & Sankarasubramanian, A. Multivariate bias corrections of mechanistic water quality model predictions. J. Hydrol. 564, 529–541 (2018).
Article ADS CAS Google Scholar
Jamei, M., Ahmadianfar, I., Chu, X. & Yaseen, Z. M. Prediction of surface water total dissolved solids using hybridized wavelet-multigene genetic programming: New approach. J. Hydrol. 589, 125335 (2020).
Article CAS Google Scholar
Li, X. et al. Water quality analysis of the Yangtze and the Rhine River: A comparative study based on monitoring data from 2007 to 2018. Bull. Environ. Contam. Toxicol. 106, 825–831 (2021).
Article CAS PubMed Google Scholar
Tiyasha, T. et al. Functionalization of remote sensing and on-site data for simulating surface water dissolved oxygen: Development of hybrid tree-based artificial intelligence models. Mar. Pollut. Bull. 170, 112639 (2021).
Article CAS PubMed Google Scholar
Jafari, H., Rajaee, T. & Kisi, O. Improved water quality prediction with hybrid wavelet-genetic programming model and shannon entropy. Nat. Resour. Res. 29, 3819–3840 (2020).
Article CAS Google Scholar
Li, L. et al. Water quality prediction based on recurrent neural network and improved evidence theory: A case study of Qiantang River, China. Environ. Sci. Pollut. Res. 26, 19879–19896 (2019).
Article Google Scholar
Lu, H. & Ma, X. Hybrid decision tree-based machine learning models for short-term water quality prediction. Chemosphere 249, 126169 (2020).
Article ADS CAS PubMed Google Scholar
Zou, Q. et al. A water quality prediction method based on the multi-time scale bidirectional long short-term memory network. Environ. Sci. Pollut. Res. 27, 16853–16864 (2020).
Article Google Scholar
Seifi, A., Dehghani, M. & Singh, V. P. Uncertainty analysis of water quality index (WQI) for groundwater quality evaluation: Application of Monte-Carlo method for weight allocation. Ecol. Indic. 117, 106653 (2020).
Article CAS Google Scholar
Kumarasamy, P., Dahms, H.-U., Jeon, H.-J., Rajendran, A. & James, R. A. Irrigation water quality assessment—An example from the Tamiraparani river, Southern India. Arab. J. Geosci. 7, 5209–5220 (2014).
Article CAS Google Scholar
Ravansalar, M. & Rajaee, T. Evaluation of wavelet performance via an ANN-based electrical conductivity prediction model. Environ. Monit. Assess. 187, 1–16 (2015).
Article Google Scholar
Orouji, H., Bozorg Haddad, O., Fallah-Mehdipour, E. & Mariño, M. Modeling of water quality parameters using data-driven models. J. Environ. Eng. 139, 947–957 (2013).
Article CAS Google Scholar
Rezaie-Balf, M. et al. Physicochemical parameters data assimilation for efficient improvement of water quality index prediction: Comparative assessment of a noise suppression hybridization approach. J. Clean. Prod. 271, 122576 (2020).
Article CAS Google Scholar
Tung, T. M. & Yaseen, Z. M. A survey on river water quality modelling using artificial intelligence models: 2000–2020. J. Hydrol. 585, 124670 (2020).
Article Google Scholar
Zhou, J., Wang, Y., Xiao, F., Wang, Y. & Sun, L. Water quality prediction method based on IGRA and LSTM. Water 10, 1148 (2018).
Article Google Scholar
Uddin, M. G., Nash, S. & Olbert, A. I. A review of water quality index models and their use for assessing surface water quality. Ecol. Indic. 122, 107218 (2021).
Article CAS Google Scholar
Ahmed, A. N. et al. Machine learning methods for better water quality prediction. J. Hydrol. 578, 124084 (2019).
Article Google Scholar
Amirkhani, M., Bozorg-Haddad, O., Fallah-Mehdipour, E. & Loáiciga, H. A. Multiobjective reservoir operation for water quality optimization. J. Irrig. Drain. Eng. 142, 04016065 (2016).
Article Google Scholar
Chang, F.-J., Tsai, Y.-H., Chen, P.-A., Coynel, A. & Vachaud, G. Modeling water quality in an urban river using hydrological factors—Data driven approaches. J. Environ. Manag. 151, 87–96 (2015).
Article CAS Google Scholar
Ho, J. Y. et al. Towards a time and cost effective approach to water quality index class prediction. J. Hydrol. 575, 148–165 (2019).
Article ADS CAS Google Scholar
Searcy, R. T. & Boehm, A. B. A day at the beach: Enabling coastal water quality prediction with high-frequency sampling and data-driven models. Environ. Sci. Technol. 55, 1908–1918 (2021).
Article ADS CAS PubMed Google Scholar
Herrig, I. M., Böer, S. I., Brennholt, N. & Manz, W. Development of multiple linear regression models as predictive tools for fecal indicator concentrations in a stretch of the lower Lahn River, Germany. Water Res. 85, 148–157 (2015).
Article CAS PubMed Google Scholar
Çamdevýren, H., Demýr, N., Kanik, A. & Keskýn, S. Use of principal component scores in multiple linear regression models for prediction of Chlorophyll-a in reservoirs. Ecol. Model. 181, 581–589 (2005).
Article Google Scholar
Civelekoglu, G., Yigit, N., Diamadopoulos, E. & Kitis, M. Prediction of bromate formation using multi-linear regression and artificial neural networks. Ozone Sci. Eng. 29, 353–362 (2007).
Article CAS Google Scholar
Deng, W., Wang, G. & Zhang, X. A novel hybrid water quality time series prediction method based on cloud model and fuzzy forecasting. Chemom. Intell. Lab. Syst. 149, 39–49 (2015).
Article CAS Google Scholar
Abdollahi, A. & Ahmadianfar, I. Multi-mechanism ensemble interior search algorithm to derive optimal hedging rule curves in multi-reservoir systems. J. Hydrol. 598, 126211 (2021).
Article Google Scholar
Ahmadianfar, I., Bozorg-Haddad, O. & Chu, X. Gradient-based optimizer: A new metaheuristic optimization algorithm. Inf. Sci. 540, 131–159 (2020).
Article MathSciNet MATH Google Scholar
Ahmadianfar, I., Heidari, A. A., Gandomi, A. H., Chu, X. & Chen, H. RUN beyond the metaphor: An efficient optimization algorithm based on Runge Kutta method. Expert Syst. Appl. 181, 115079 (2021).
Article Google Scholar
Fang, Y. et al. An accelerated gradient-based optimization development for multi-reservoir hydropower systems optimization. Energy Rep. 7, 7854–7877 (2021).
Article Google Scholar
Barzegar, R., Adamowski, J. & Moghaddam, A. A. Application of wavelet-artificial intelligence hybrid models for water quality prediction: A case study in Aji-Chay River, Iran. Stoch. Environ. Res. Risk Assess. 30, 1797–1819 (2016).
Article Google Scholar
Huang, M. et al. A hybrid fuzzy wavelet neural network model with self-adapted fuzzy-means clustering and genetic algorithm for water quality prediction in rivers. Complexity 2018, 1–11 (2018).
Article Google Scholar
Jadhav, M. S., Khare, K. C. & Warke, A. S. Water quality prediction of Gangapur Reservoir (India) using LS-SVM and genetic programming. Lakes Reserv. Res. Manag. 20, 275–284 (2015).
Article CAS Google Scholar
Liu, J. et al. Accurate prediction scheme of water quality in smart mariculture with deep Bi-S-SRU learning network. IEEE Access 8, 24784–24798 (2020).
Article Google Scholar
Kisi, O. & Parmar, K. S. Application of least square support vector machine and multivariate adaptive regression spline models in long term prediction of river water pollution. J. Hydrol. 534, 104–112 (2016).
Article ADS CAS Google Scholar
Khadr, M. & Elshemy, M. Data-driven modeling for water quality prediction case study: The drains system associated with Manzala Lake, Egypt. Ain Shams Eng. J. 8, 549–557 (2017).
Article Google Scholar
Chatterjee, S. et al. In 2017 IEEE 15th International Conference on Industrial Informatics (INDIN). 963–968 (IEEE).
Dehghani, M., Seifi, A. & Riahi-Madvar, H. Novel forecasting models for immediate-short-term to long-term influent flow prediction by combining ANFIS and grey wolf optimization. J. Hydrol. 576, 698–725 (2019).
Article ADS Google Scholar
Song, C., Yao, L., Hua, C. & Ni, Q. A water quality prediction model based on variational mode decomposition and the least squares support vector machine optimized by the sparrow search algorithm (VMD-SSA-LSSVM) of the Yangtze River, China. Environ. Monit. Assess. 193, 1–17 (2021).
Article Google Scholar
Bui, D. T., Khosravi, K., Tiefenbacher, J., Nguyen, H. & Kazakis, N. Improving prediction of water quality indices using novel hybrid machine-learning algorithms. Sci. Total Environ. 721, 137612 (2020).
Article ADS CAS PubMed Google Scholar
Dehghani, R., Torabi Poudeh, H. & Izadi, Z. Dissolved oxygen concentration predictions for running waters with using hybrid machine learning techniques. Model. Earth Syst. Environ. 1–15. https://doi.org/10.1007/s40808-021-01253-x (2021).
Ransom, K. M. et al. A hybrid machine learning model to predict and visualize nitrate concentration throughout the Central Valley aquifer, California, USA. Sci. Total Environ. 601, 1160–1172 (2017).
Article ADS PubMed Google Scholar
Alavi, J., Ewees, A. A., Ansari, S., Shahid, S. & Yaseen, Z. M. A new insight for real-time wastewater quality prediction using hybridized kernel-based extreme learning machines with advanced optimization algorithms. Environ. Sci. Pollut. Res. 29, 1–21 (2021).
Google Scholar
Khayyam, H. et al. A novel hybrid machine learning algorithm for limited and big data modeling with application in industry 4.0. IEEE Access 8, 111381–111393 (2020).
Article Google Scholar
Ngouna, R. H. et al. A data-driven method for detecting and diagnosing causes of water quality contamination in a dataset with a high rate of missing values. Eng. Appl. Artif. Intell. 95, 103822 (2020).
Article Google Scholar
Wu, D., Wang, H. & Seidu, R. Smart data driven quality prediction for urban water source management. Futur. Gener. Comput. Syst. 107, 418–432 (2020).
Article Google Scholar
Jang, J. S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern. 23, 665–685 (1993).
Article Google Scholar
Seifi, A., Ehteram, M. & Dehghani, M. A robust integrated Bayesian multi-model uncertainty estimation framework (IBMUEF) for quantifying the uncertainty of hybrid meta-heuristic in global horizontal irradiation predictions. Energy Convers. Manag. 241, 114292 (2021).
Article Google Scholar
Seifi, A. et al. GLUE uncertainty analysis of hybrid models for predicting hourly soil temperature and application wavelet coherence analysis for correlation with meteorological variables. Soft. Comput. 25, 10723–10748 (2021).
Article Google Scholar
Gharagheizi, F. et al. Development of a LSSVM-GC model for estimating the electrical conductivity of ionic liquids. Chem. Eng. Res. Des. 92, 66–79 (2014).
Article CAS Google Scholar
Hosseinzadeh, M., Hemmati-Sarapardeh, A., Ameli, F., Naderi, F. & Dastgahi, M. A computational intelligence scheme for estimating electrical conductivity of ternary mixtures containing ionic liquids. J. Mol. Liq. 221, 624–632 (2016).
Article CAS Google Scholar
Suykens, J. A. & Vandewalle, J. Least squares support vector machine classifiers. Neural Process. Lett. 9, 293–300 (1999).
Article Google Scholar
Han, H., Cui, X., Fan, Y. & Qing, H. Least squares support vector machine (LS-SVM)-based chiller fault diagnosis using fault indicative features. Appl. Therm. Eng. 154, 540–547 (2019).
Article Google Scholar
Zhu, B. et al. Achieving the carbon intensity target of China: A least squares support vector machine with mixture kernel function approach. Appl. Energy 233, 196–207 (2019).
Article Google Scholar
Specht, D. F. A general regression neural network. IEEE Trans. Neural Netw. 2, 568–576 (1991).
Article CAS PubMed Google Scholar
Yu, X. Prediction of chemical toxicity to Tetrahymena pyriformis with four-descriptor models. Ecotoxicol. Environ. Saf. 190, 110146 (2020).
Article CAS PubMed Google Scholar
Ramsami, P. & Oree, V. A hybrid method for forecasting the energy output of photovoltaic systems. Energy Convers. Manag. 95, 406–413 (2015).
Article Google Scholar
Friedman, J. H. Multivariate adaptive regression splines. Ann. Stat. 19, 1–67 (1991).
MathSciNet MATH Google Scholar
Cheng, M.-Y. & Cao, M.-T. Estimating strength of rubberized concrete using evolutionary multivariate adaptive regression splines. J. Civ. Eng. Manag. 22, 711–720 (2016).
Article Google Scholar
Malik, A. et al. Modeling monthly pan evaporation process over the Indian central Himalayas: Application of multiple learning artificial intelligence model. Eng. Appl. Comput. Fluid Mech. 14, 323–338 (2020).
ADS Google Scholar
Jekabsons, G. Adaptive regression splines toolbox for matlab/octave. Version 1, 72 (2013).
Google Scholar
Suman, S., Das, S. K. & Mohanty, R. Prediction of friction capacity of driven piles in clay using artificial intelligence techniques. Int. J. Geotechnol. Eng. 10, 469–475 (2016).
Article CAS Google Scholar
Zhang, L., Zhou, W. & Jiao, L. Wavelet support vector machine. IEEE Trans. Syst. Man Cybern. Part B Cybern. 34, 34–39 (2004).
Article Google Scholar
Chiu, S. L. Fuzzy model identification based on cluster estimation, 2 (3) (1). J. Intell. Fuzzy Syst. 2, 267–278 (1994).
Article Google Scholar
Debnath, L. & Shah, F. A. Wavelet Transforms and Their Applications. (Springer, 2002).
Ahmadianfar, I., Kheyrandish, A., Jamei, M. & Gharabaghi, B. Optimizing operating rules for multi-reservoir hydropower generation systems: An adaptive hybrid differential evolution algorithm. Renew. Energy 167, 774–790 (2021).
Article Google Scholar
Ahmadianfar, I., Khajeh, Z., Asghari-Pari, S.-A. & Chu, X. Developing optimal policies for reservoir systems using a multi-strategy optimization algorithm. Appl. Soft Comput. 80, 888–903 (2019).
Article Google Scholar
Ahmadianfar, I., Noshadian, S., Elagib, N. A. & Salarijazi, M. Robust diversity-based sine-cosine algorithm for optimizing hydropower multi-reservoir systems. Water Resour. Manag. 35, 3513–3538 (2021).
Article Google Scholar
Zhang, J. & Sanderson, A. C. JADE: Adaptive differential evolution with optional external archive. IEEE Trans. Evol. Comput. 13, 945–958 (2009).
Article Google Scholar
Zhao, X. et al. Chaos enhanced grey wolf optimization wrapped ELM for diagnosis of paraquat-poisoned patients. Comput. Biol. Chem. 78, 481–490 (2019).
Article CAS PubMed Google Scholar
Wang, M. & Chen, H. Chaotic multi-swarm whale optimizer boosted support vector machine for medical diagnosis. Appl. Soft Comput. 88, 105946 (2020).
Article Google Scholar
Kisi, O., Keshavarzi, A., Shiri, J., Zounemat-Kermani, M. & Omran, E.-S.E. Groundwater quality modeling using neuro-particle swarm optimization and neuro-differential evolution techniques. Hydrol. Res. 48, 1508–1519 (2017).
Article CAS Google Scholar
Willmott, C. J. Some comments on the evaluation of model performance. Bull. Am. Meteorol. Soc. 63, 1309–1313 (1982).
Article ADS Google Scholar
Willmott, C. J., Robeson, S. M. & Matsuura, K. A refined index of model performance. Int. J. Climatol. 32, 2088–2094 (2012).
Article Google Scholar
Yaseen, Z. M. An insight into machine learning models era in simulating soil, water bodies and adsorption heavy metals: Review, challenges and solutions. Chemosphere 277, 130126 (2021).
Article ADS CAS PubMed Google Scholar
Malik, A., Kumar, A. & Singh, R. P. Application of heuristic approaches for prediction of hydrological drought using multi-scalar streamflow drought index. Water Resour. Manag. 33, 3985–4006 (2019).
Article Google Scholar
Sang, Y.-F. Improved wavelet modeling framework for hydrologic time series forecasting. Water Resour. Manag. 27, 2807–2821 (2013).
Article Google Scholar
Yaseen, Z. M. et al. Stream-flow forecasting using extreme learning machines: A case study in a semi-arid region in Iraq. J. Hydrol. 542, 603–614 (2016).
Article ADS Google Scholar
Sudheer, K., Gosain, A. & Ramasastri, K. A data-driven algorithm for constructing artificial neural network rainfall-runoff models. Hydrol. Process. 16, 1325–1330 (2002).
Article ADS Google Scholar
Gilmour, S. G. The interpretation of Mallows’s Cp-statistic. J. R. Stat. Soc. Ser. D Stat. 45, 49–56 (1996).
Google Scholar
Claeskens, G. & Hjort, N. L. Model Selection and Model Averaging. (Cambridge Books, 2008).
Kobayashi, M. & Sakata, S. Mallows’ Cp criterion and unbiasedness of model selection. J. Econom. 45, 385–395 (1990).
Article Google Scholar
Bozorg-Haddad, O., Soleimani, S. & Loáiciga, H. A. Modeling water-quality parameters using genetic algorithm–least squares support vector regression and genetic programming. J. Environ. Eng. 143, 04017021 (2017).
Article Google Scholar
Kennedy, J. & Eberhart, R. In Proceedings of ICNN'95-International Conference on Neural Networks. 1942–1948 (IEEE).
Mirjalili, S., Mirjalili, S. M. & Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 69, 46–61 (2014).
Article Google Scholar
Mirjalili, S. & Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 95, 51–67 (2016).
Article Google Scholar
Bansal, J. C., Sharma, H., Jadon, S. S. & Clerc, M. Spider monkey optimization algorithm for numerical optimization. Memet. Comput. 6, 31–47 (2014).
Article Google Scholar
Połap, D. Polar bear optimization algorithm: Meta-heuristic with fast population movement and dynamic birth and death mechanism. Symmetry 9, 203 (2017).
Article Google Scholar
Połap, D. & Woźniak, M. Red fox optimization algorithm. Expert Syst. Appl. 166, 114107 (2021).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Civil Engineering, Behbahan Khatam Alanbia University of Technology, Behbahan, Iran
Iman Ahmadianfar & Arvin Samadi-Koucheksaraee
Department of Chemical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran
Seyedehelham Shirvani-Hosseini
Department of Civil Engineering, University of Calgary, Calgary, AB, Canada
Jianxun He
Adjunct Research Fellow, USQ’s Advanced Data Analytics Research Group, School of Mathematics Physics and Computing, University of Southern Queensland, QLD 4350, Toowoomba, Australia
Zaher Mundher Yaseen
New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, 64001, Iraq
Zaher Mundher Yaseen

Authors

Iman Ahmadianfar
View author publications
You can also search for this author in PubMed Google Scholar
Seyedehelham Shirvani-Hosseini
View author publications
You can also search for this author in PubMed Google Scholar
Jianxun He
View author publications
You can also search for this author in PubMed Google Scholar
Arvin Samadi-Koucheksaraee
View author publications
You can also search for this author in PubMed Google Scholar
Zaher Mundher Yaseen
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.A.: Conceptualization, Methodology, Software, Writing—Original draft preparation, Visualization, Investigation, S.S.-H.: Software, Writing—Original draft preparation, Visualization, Investigatio, J.H.: Supervision, Reviewing and Editing, Investigation, A.S.-K.: Writing—Original draft preparation, Visualization, Investigation, Z.M.Y.: Supervision, Reviewing and Editing, Investigation.

Corresponding author

Correspondence to Iman Ahmadianfar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ahmadianfar, I., Shirvani-Hosseini, S., He, J. et al. An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction. Sci Rep 12, 4934 (2022). https://doi.org/10.1038/s41598-022-08875-w

Download citation

Received: 30 October 2021
Accepted: 14 March 2022
Published: 23 March 2022
DOI: https://doi.org/10.1038/s41598-022-08875-w
Springer Nature Limited

This article is cited by

Multi-step ahead forecasting of electrical conductivity in rivers by using a hybrid Convolutional Neural Network-Long Short-Term Memory (CNN-LSTM) model enhanced by Boruta-XGBoost feature selection algorithm
- Masoud Karbasi
- Mumtaz Ali
- Zaher Mundher Yaseen
Scientific Reports (2024)
A genetic algorithm for rule extraction in fuzzy adaptive learning control networks
- Glender Brás
- Alisson Marques Silva
- Elizabeth F. Wanner
Genetic Programming and Evolvable Machines (2024)
Surface water quality index forecasting using multivariate complementing approach reinforced with locally weighted linear regression model
- Tao Hai
- Iman Ahmadianfar
- Zaher Mundher Yaseen
Environmental Science and Pollution Research (2024)
Multi-objective optimal allocation of water resources based on improved marine predator algorithm and entropy weighting method
- Zhaocai Wang
- Haifeng Zhao
- Tunhua Wu
Earth Science Informatics (2024)
Active filter design and synthesis for hybrid neuro-fuzzy and robust PID controllers
- Rasoul Hosseini
- Javad Mashayekhi Fard
- Sepehr Soltani
International Journal of Dynamics and Control (2024)

An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction

Abstract

Similar content being viewed by others

Introduction

Research background

Problem statement

Literature review predictive models

Main objectives and contributions

Methodology

Adaptive neuro fuzzy inference system (ANFIS)

Least square support vector machine (LSSVM)

Generalization regression neural network (GRNN)

Multivariate adaptive regression spline (MARS)

Wavelet theory

Proposed ANFIS-ADEPSO

Adaptive hybrid of DE and PSO (A-DEPSO)

Mutation in A-DEPSO

Crossover in A-DEPSO

Refreshing operator in A-DEPSO

Selection operator in A-DEPSO

Performance evaluation

Study area

Pre-processing and selecting the best combination

Application results and discussion

Wavelet-ML models

Evaluate the performance of standalone ML models

Evaluate the performance of wavelet-based ML models

Compare the performance of all ML models

Comparison of W-ANFIS-A-DEPSO with hybrid models

Conclusion

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation