Introduction

A lateral intake or diversion channel has been usually employed in order to divert waterway from rivers and main channels into the secondary channels, water supply networks, and power stations. Some of the distinguishing features of a flow separation zone in a lateral intake are demonstrated in Fig. 1.

Fig. 1
figure 1

Characteristics of dividing flow in open channels (Ramamurthy et al. 2007)

According to Fig. 1, three zones are observed: (1) a zone of separation in the vicinity of entrance (separation zone), (2) a contracted flow region in the channel (contracted flow zone), and (3) a stagnation point next to the downstream corner of junction (stagnation zone) (Ramamurthy et al. 2007). The flow pattern in the lateral intake is extremely complicated since the flow separation results in some remarkable changes in the hydraulic conditions such as reduction of the actual width of current and further alluviation in the entrance of lateral intake. Thus, it is crucial to determine DFSZ.

In order to reduce DFSZ and manage misdirected discharge/sediments into the basin, it has generally been found effective to install submerged vanes in the entry of lateral intake. Submerged vanes frequently create downstream vortices that shift the flow direction and apply shear stress to the channel bed. It causes bed material to move in a transverse direction along the channel’s cross-section, which changes the bed profile. The properties and array of vanes can be arranged logically to achieve the appropriate bed profiles (Sharma and Ahmad 2020).

Submerged vane has been employed for extensive purposes such as channel bank protections (Wang and Odgaard 1993), sediment elimination in water intakes (Wang et al. 1996), and expanding rivers for navigation (Odgaard and Wang 1991b). A summary of theoretical studies and innovative relationships for sediment control by submerged vanes is provided by Odgaard and Spoljaric (1988) and Odgaard and Wang (1991a). They reported that the small submerged vanes caused the remarkable alterations in the velocity distribution in rivers. Nakato et al. (1990) suggested a pioneering technique in the laboratory field to produce a scour-trench in front of a power-plant. They concluded that the performance of the utilized vane model can be considered an outstanding scheme. Barkdoll et al. (1999) conducted numerous laboratory tests to identify the fundamental geometrical and physical characteristics that submerged vanes can use to stop extreme bed sedimentation into channels used for diverting sediment from sedimentary rivers. His experiments proved that the recommended shape for submerged vanes caused only an insignificant rate of bed sediment pass into the diversion as the ratio of qr was less than 0.2. Numerous instance field applications have been presented by Odgaard (2009). Gupta et al. (2010) studied the results of a collar on the efficiency of a scour hole around the submerged vane and optimized its dimensions in the prevention of scour hole. They declared that by using a collar on the leading edge of a submerged vane, the scour depth was noticeably diminished. Behbahan (2011) investigated the effect of different shapes of submerged vanes as flat, angled, and curved types with various arrays on the scour and sediment control. He reported that based on experimental results, the curved and angled forms were more operative than the flat form in riverbank protection by 35 and 20%, respectively. Ouyang (2009), Ouyang and Lin (2016), and Karami et al. (2017) conducted in-depth analyses to evaluate the border effects of channel banks, water surface, and channel bed profile generated by various arrays of submerged vanes in various forms using novel techniques.

Characteristically, a hydrogeological process can be analyzed by three methods, including physical, numerical, and artificial intelligence (AI) methods. To date, many complicated numerical methods have been adopted to evaluate the flow characteristics and sediment transportation phenomenon in an open channel (Kaya and Gharehbaghi 2012; Gharehbaghi et al. 2016, 2017; Sui and Huang 2017; Ehteram et al. 2020; Monier et al. 2020). Nevertheless, because of complex flow interactions, these methods were mostly unable to precisely justify vane-induced flow field characteristics by submerged multiple-vane systems.

Due to the costly and time-consuming function of experimental and numerical approaches, nonlinear systems such as machine learning models (MLMs) and intelligent optimization models (IOMs), AI-based techniques, should be utilized to identify the extremely unbalanced behavior of DFSZ. These methods are well known as the top adaptive ones suitable for finding complex and nonlinear indefinite patterns in large dimensional data. As scientists’ skill with these AI-based systems deepens, they are becoming more dependable, and now they are frequently utilized as robust approaches in different fields of water sciences to predict complex hydraulic and hydrological variables such as sugarcane growth based on climatological parameters (Taherei Ghazvinei et al. 2018), daily dew point temperature (Qasem et al. 2019), forecasting nitrate concentration as a water quality parameter (Latif et al. 2020), inflow forecasting (Latif et al. 2021a), phosphate forecasting in reservoir water system (Latif et al. 2021b), daily streamflow time-series prediction (Latif and Ahmed 2021; Tofiq et al. 2022), surface water quality status and prediction during movement control operation order under COVID-19 pandemic (Najah et al. 2021), groundwater level fluctuations (Ghasemlounia et al. 2021; Gharehbaghi et al. 2022), discharge coefficient of a new type of sharp-crested V-notch weirs (Gharehbaghi and Ghasemlounia 2022), and dissolved oxygen prediction (Ziyad Sami et al. 2022).

Regarding the AI methods, GEP (gene expression program) and SVR (support vector regression) are very popular and have been also regularly utilized in various fields of engineering. GEP is contemplated as a well-recognized AI-based technique inspired by biological basics and has a satisfactory ability in estimating parameters with high nonlinearity relationships. SVR is regarded as a commendable method for classification, regression, and function estimation. In recent years, IOMs have been developed as an effective substitute for the traditional standalone data-driven models (DDMs), AI-based modeling methodologies, to improve performance and resolve issues. This study uses three alternative DDMs, including SVR, GEP, and a robust hybrid bio-inspired IOM, to both test experimentally and predict DFSZ induced by a coupled submerged vane. The hybrid model is a coupled version of SVR with an ant colony optimization (ACO) algorithm (i.e., hybrid SVR-ACO model), which is employed to progress the estimation precision of DFSZ. In the hybrid SVR-ACO model, ACO is a bio-inspired IOM that optimizes mechanical approximation parameters of the standalone SVR model in computational engineering problems. Several scenarios are defined in the current simulation by varying the hyperparameters and structure of the produced DDMs. It should be noted that finding the ideal value for a hyperparameter requires a process of trial and error in order to reach the best configuration. In this way, 192 experimental datasets are used to evaluate DFSZ while accounting for dimensionless effects.

According to the literature study, earlier studies primarily focused on the most fundamental types of submerged vanes since their theories and underlying assumptions were oversimplified. The innovative aspect of the current study is the experimental assessment of the effects of coupled submerged vanes on the DFSZ and flow pattern at the entry of the lateral intake, and the use of DDMs for prediction.

The main aims and scope of this study are to (i) investigate experimentally effects of a combined submerged multiple-vane system on DFSZ and flow attributes, (ii) determine the relationships among nitty–gritty layout and physical characteristics of the present submerged multiple-vane system with DFSZ to improve vane performances, and (iii) develop different DDM techniques to estimate DFSZ. Additionally, an evaluation is carried out to determine the best DDMs by validating applied physical and DDMs using statistical measures.

Experimental setup

The combined submerged vanes employed in this work are comprised of two segments, curved and flat plates. The curved section has different degrees of curvature (β) including 0, 17, 34, and 68°, as described in Fig. 2.

Fig. 2
figure 2

The sketch view of the present submerged vane

A laboratory system for experimental work is described in Fig. 3 (A and B). In this submerged multiple-vane system, the optimal angle of lateral intake channel, i.e., 55° (Keshavarzi and Habibi 2005), and the butterfly array among vanes are utilized. Many studies have been conducted in the area of submerged vanes’ size and array. Two rows of vanes would theoretically be more effective than three rows for managing sediment in any vane layout (Barkdoll et al. 1999). The most successful design guidelines and dimensions are typically based on local flow-field observations and expert judgments because there are no specified all-inclusive guidelines and standards for the ideal dimensions and design essentials for submerged vanes in the literature.

Fig. 3
figure 3

Characteristics of the proposed submerged multiple-vane system in the A β = 0° and B β = 17, 34, and 68°

In Fig. 3, θ is the angle of attack to the local primary flow direction, δl is the distances in the median of vanes along the flow (i.e., stream-wise direction); Lv and Hv are the length and height of the proposed vane, respectively; δt is the distance in the median of vanes across the flow (i.e., perpendicular to streamwise direction); δb is the distance of first row from in the median of vanes to the bank. It is necessary to mention that the dimensions of the proposed multiple-vane system are taken from the standards and recommended ranges by Odgaard and Wang (1991a, b) and Behbahan (2011), and the size of flume utilized in the hydraulic laboratory.

The hydraulic laboratory of Tabriz Azad University’s Water Engineering Department is where this study’s experimental work is done. In order to assess DFSZ in the horizontal lateral intake channel, two sets of experiments are conducted in this regard. The approved submerged multiple-vane system is employed in the second set, whereas no vane is used in the first set (control test). Table 1 provides the dimensions of the utilized system and vanes.

Table 1 Specifications of present submerged multiple-vane system

Based on the requirements of Froude law scaling and free steady flow conditions, all discharges are taken into account in the ranges of 20 to 35 (l/s), making the downstream flow supercritical in all tests. On the basis of flow-field analysis, potential plans to enhance vane performance are identified.

In this experiment, a 10-m long, 0.4-m wide, and 0.5-m high rectangular horizontal flume is employed as the main channel, and a 4-m long, 0.28-m wide, and 0.5-m high rectangular horizontal flume is used as the lateral intake channel. Both channels have a longitudinal slope of 0.0001 with glassy sidewalls for better visualization of flow patterns.

In the intake of the main channel, a grid wall and a wave suppressor are employed to reduce turbulence on the water’s surface, break big eddies, and maintain flow homogeneity. To measure the depths of flow downstream of lateral intake and upstream of main channel, point gauges with Vernier scales and precision of ± 0.1 (mm) are used. Figure 4 (A and B) shows the geometric characteristics of the experimental open channel set-up and the point gauges employed in the laboratory.

Fig. 4
figure 4

A Layout of the experimental setup and B point gauges with Vernier scales

A pump with a power of 200 hp discharges the water into the stilling basin in the inlet of main channel. In both channels, all discharges (Q) are gauged by using an electromagnetic flow meter in the measuring basin with a precision of ± 0.5%.

In this study, to determine the DFSZ in the entrance of lateral intake channel, numerous photos of the separation zone are captured from the top view; then, through these photos, the maximum length (LS) and width (WS) of separation zone are computed by AUTOCAD and SURFER software.

Dimensional analysis

In this research, experimental data is assessed to determine the potential parameters on DFSZ. In this respect, this phenomenon can be written as a function of

$${{f}_{1}(y}_{1},{y}_{2},{B}_{2},{q}_{1},{q}_{2},{L}_{V},{H}_{V},\rho ,g,\mu ,{\delta }_{l},{\delta }_{t},{\delta }_{b},{L}_{s}{,W}_{s},{S}_{0},\alpha ,\theta ,\beta , n)=0$$
(1)

where y1 and y2 are the depths of flow in the upstream of main channel and downstream of lateral intake channel, respectively, B2 is the width of lateral intake channel, q1 is the inflow discharge per unit width in the main channel, q2 is the outflow discharge per unit width in the lateral intake channel, Lv and Hv are length and height of present vanes, respectively, ρ is flow density, n is the number of rows in the proposed submerged multiple-vane system, g is the gravitational acceleration, μ is the dynamic viscosity of flow, LS and WS are the maximum length and width of separation zone, respectively, So is the channel slope, α is the angle of the lateral intake, and β is the degree of curvature in the curved segment of vanes suggested.

Through Buckingham π-theorem and the property of dimensional analysis, non-dimensional parameters can be obtained as

$${f}_{2}\left({Fr}_{1},{Re}_{1},\frac{{B}_{2}}{{y}_{1}}, \theta , {S}_{0}, \alpha , n,\beta , \frac{{\delta }_{b}}{{y}_{1}},\frac{{\delta }_{l}}{{y}_{1}} ,\frac{{\delta }_{t}}{{y}_{1}},\frac{{q}_{2}}{{q}_{1}},\frac{{L}_{s}}{{y}_{1}},\frac{{W}_{s}}{{y}_{1}},\frac{{y}_{2}}{{y}_{1}},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\right)=0$$
(2)

where Fr1 and Re1 are the approaching Froude and Reynolds numbers (dimensionless). By overlooking fixed-amounts variables including So, α, and n and supposing a high scale of turbulence (i.e., large Reynolds numbers), the effect of Re1 can be disregarded (Borghei et al. 1999). Lastly, for the sake of simplicity, Eq. (2) can be expressed in the following forms:

$$\frac{{L}_{s}}{{B}_{2}}=\left({Fr}_{1}, \frac{{\delta }_{b}}{{y}_{1}},\frac{{\delta }_{l}}{{y}_{1}} ,\frac{{\delta }_{t}}{{y}_{1}}, \frac{{q}_{2}}{{q}_{1}}, \frac{{y}_{2}}{{y}_{1}},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}, \theta ,\beta \right)$$
(3)
$$\frac{{W}_{s}}{{B}_{2}}=\left({Fr}_{1}, \frac{{\delta }_{b}}{{y}_{1}},\frac{{\delta }_{l}}{{y}_{1}} ,\frac{{\delta }_{t}}{{y}_{1}}, \frac{{q}_{2}}{{q}_{1}},\frac{{y}_{2}}{{y}_{1}},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}, \theta ,\beta \right)$$
(4)

The \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) as representatives of DFSZ are the maximum relative length and width of flow separation zone. The range and statistical indexes of parameters used in Eqs. (3) and (4) are presented in Tables 2 and 3, respectively.

Table 2 Range of used experimental data
Table 3 Statistical indexes of variables used in Eqs. (3) and (4)

Pre-processing

Many characteristics have been utilized in many different sorts of research to explain unidentified events, albeit some of these parameters have negligible influence. Unwise choices might negatively impact on the effectiveness of approaches, because the performance of the exact estimation of the target depends on the opposite choice of forecasters. The combination of model inputs and adequate datasets for training and testing are two of the key factors limiting how accurately nonlinear techniques can model (Gharehbaghi and Ghasemlounia 2022). In this way, approaches for dimensionality reduction have been used to extract useful parameters and speed up the computation. In order to select appropriate input variables for Eqs. (3) and (4), this method is applied in this work using two special tests, Spearman’s rho and Pearson’s correlation coefficient, which serve as examples of nonparametric and parametric statistical tests, respectively. With a 95% confidence level, Table 4 displays the correlation coefficient values of the variables utilized in Eqs. (3) and (4).

Table 4 Spearman’s rho and Pearson’s correlation coefficient values in Eqs. (3) and (4)

Based on Table 4, the highest correlation coefficient values in \(\frac{{L}_{s}}{{B}_{2}}\) and.

\(\frac{{W}_{s}}{{B}_{2}}\) are observed with the\(\frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{l}}{{y}_{1}},\frac{{\delta }_{b}}{{y}_{1}} ,\frac{{\delta }_{t}}{{y}_{1}} ,\frac{{L}_{V}}{{y}_{1}} ,\frac{{H}_{V}}{{y}_{1}}\),\(\beta\); and\(\theta ,\beta , \frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{l}}{{y}_{1}},{Fr}_{1},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\), respectively; hence, they are considered input parameters, and \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) are target. Finally, Eqs. (3) and (4) can be rewritten in the ultimate following forms:

$$\frac{{L}_{s}}{{B}_{2}}=\left(\beta , \frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{l}}{{y}_{1}},\frac{{\delta }_{b}}{{y}_{1}},\frac{{\delta }_{t}}{{y}_{1}},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\right)$$
(5)
$$\frac{{W}_{s}}{{B}_{2}}=\left(\theta ,\beta , \frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{l}}{{y}_{1}},{Fr}_{1},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\right)$$
(6)

Sensitivity analysis

A sensitivity analysis for the forecasters in Eqs. (5) and (6) is implemented to depict the impact of each forecaster on the model target by altering each forecaster in a fixed amount and holding the other forecasters constant. The cosine amplitude technique is applied to adopt the sensitivity analysis (Momeni et al. 2014) as follows:

$${{\varvec{R}}}_{{\varvec{i}}{\varvec{j}}}=\frac{\sum_{{\varvec{k}}=1}^{{\varvec{N}}}\left({{\varvec{I}}}_{{\varvec{i}}{\varvec{k}}}{{\varvec{O}}}_{{\varvec{j}}{\varvec{k}}}\right)}{\sqrt{\sum_{{\varvec{k}}=1}^{{\varvec{N}}}{{\varvec{I}}}_{{\varvec{i}}{\varvec{k}}}^{2}\sum_{{\varvec{k}}=1}^{{\varvec{N}}}{{\varvec{O}}}_{{\varvec{j}}{\varvec{k}}}^{2}}}$$
(7)

where N is the total number of datasets and Ii and Oj are the input and output variables. Rij value is in the range of 0 to 1 and specifies the strength of relationship between every forecaster and target in Eqs. (5) and (6), as are presented in Fig. 5 A and B. According to these figures, \(\frac{{H}_{V}}{{y}_{1}}\) and \(\frac{{L}_{V}}{{y}_{1}}\) are the most effective variables for estimation of \(\frac{{L}_{s}}{{B}_{2}}\) and

Fig. 5
figure 5

Values of Rij for the variables used in Eqs. (5) and (6): A \(\frac{{L}_{s}}{{B}_{2}}\) and B \(\frac{{W}_{s}}{{B}_{2}}\)

\(\frac{{W}_{s}}{{B}_{2}}\), respectively.

Estimation models

Based on Table 3, it is apparent that \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) have an unstable behavior via the kurtosis and skewness, and their relationships with forecasters in Eqs. (5) and (6) are extremely intricate and nonlinear. Thus, robust accurate DDMs are essential for estimating \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\). Concerning, as above mentioned, in the present study, for predicting \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) by SVR, GEP, and hybrid SVR-ACO methods, 192 experimental datasets are employed. In this direction, a set of 144 datasets are randomly utilized for calibration (75% of the data), and the other 48 datasets (25%) are randomly utilized for validation. Then, these sets are normalized to zero mean and unit variance as suggested by Lawrence et al. (1997).

Gene expression programming (GEP)

GEP was developed by Ferreira (2001) inspired by “Darwinian evolution theory” and is one of the circulating algorithm systems. In this method, an excellent population is elected, or else a fresh populace is regenerated to accomplish the excellent populace. The flowchart of a GEP algorithm is depicted in Fig. 6.

Fig. 6
figure 6

Flowchart of a GEP algorithm (Ferreira 2001)

The first step in the GEP method, which can run randomly or with some information, is the development of the primary populace. The structure of an expression tree is then given, along with the chromosomes. To determine whether a resolution is appropriate, the effects are evaluated using a fitness function. If a respectable resolution value is attained, the progression mechanism is stopped, and the superb resolution for this phase is then shown. The best option for the current group is held in reserve in the absence of stop conditions. This process is performed for a predetermined number of groups in order to achieve the desired outcome (Ferreira 2001).

GEP model development

In the current research, GeneXpro Tools 4.0 program is employed to adopt GEP algorithm for estimating the values of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) by following steps (Mehdizadeh et al. 2017). In the first step, RMSE has been opted as the suitable fitness function. The second step is to define variables and function sets to generate the chromosomes. In this direction, forecasters and target variables (\(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\)) in Eqs. (5) and (6) have been, respectively, opted. Function sets are chosen for four dominant mathematics operators including {+ − / ×} and several arithmetical functions including x2, x3, ex, etc. The third step is identifying the basic components of chromosomes, such as head size, number of genes, and chromosomes. The fourth phase uses additional functions like an expression tree connecting function to connect subsections. Lastly, the stop condition is a generation number of 10,000. Several situations are adopted in this work by adjusting important hyperparameters including the number of chromosomes and genes, head size, and kind of linking function. It should be noted that these hyperparameters are chosen through a process of trial and error to achieve the optimal model configuration.

Support vector regression (SVR)

SVM was created and descended from statistical learning concepts. It functions as a subclass of supervised MLMs and employs an operational risk reduction procedure to reduce high oversimplification error. It may also be used to realize patterns or arrange goods in certain sectors, which is known as SVR. SVR is viewed as an effective method for forecasting, function estimation, and regression. Additionally, by effectively minimizing the observed training and distribution error, it can learn and estimate nonlinear relationships among the predictor-target datasets in vast dimensions (Yu et al. 2018). The construction of an SVR algorithm is shown in Fig. 7.

Fig. 7
figure 7

Structure of an SVR algorithm

In keeping with Fig. 7, the n-dimensional observed datasets are categorized as the predictor vectors and one-dimensional target vectors are categorized as the output. SVR distinguishes a fitting function f(x), by dropping the distribution error limitations to accomplish a general regression efficacy as

$$f (x) = wT\times \varphi (x) + b$$
(8)

where \(\varphi\)(x) denotes a nonlinear function, b is the bias statement, and w is the weight vectors specified by decreasing the risk function. Therefore, it is vital to decrease the error function (Eq. 9) by applying some conditions presented in Eq. (10) (Pai and Hong 2006) as follows:

$${Minimize}_{w,b}\frac{1}{2}{w}^{T}\times w+C{\sum }_{i=1}^{l}({\xi i}^{*}+\xi i)$$
(9)
$$yi-(wT\times \varphi (x) + b)\le \varepsilon +\xi i; (wT\times \varphi (x) + b)-yi\le \varepsilon +\xi i*;\xi i*,\xi i\ge 0$$
(10)

where C is a legalization argument with a positive integer amount. Bigger values of C imply a more complicated and precise learning machine, yet a lesser C value implies large error toleration and a fairly weaker one (Smola and Schölkopf 2004; Yu et al. 2006). \(\upxi\) i and \(\xi\) i* are the slack variables (Olatomiwa et al. 2015). As SVR network is calibrated, it approximates outputs via the next equation as follows:

$$f (x) = {\sum }_{i=1}^{l}{\theta }_{i}\times \varphi (x, {x}_{i})+b$$
(11)

where \({\theta }_{i}\) exemplifies the mean Lagrangian coefficient. The calculation of \(\varphi\)(x) in futures space is very complicated.

The SVR approach can make use of a variety of kernel functions, including linear, RBF, and polynomial. The type and parameters of a kernel function are often chosen based on the data, which has a substantial impact on prediction accuracy. There are no standard guidelines that specify which kernel function to use with a given set of data. Given that the choice of kernel parameters directly organizes the complexity of computations, the optimal rates of C and error for the sensitive area should be specified (Zaji et al. 2016; Mohammadi and Mehdizadeh 2020).

SVR model development

The SVR model’s calibration and validation accuracy are used in this work, and the MATLAB 2019b program is used. In each case, the objective is DFSZ, and the predictor sets are the dimensionless parameters from Eqs. (5) and (6). Additionally, the strong nonlinearity of the relationships in Eqs. (5) and (6) is investigated using RBF and polynomial kernel functions. Then, their key parameters i.e., d, ε, C in SVR-poly and γ, ε, C in SVR-RBF models are determined by developing several scenarios and using a trial-and-error process to acquire the ideal structure. In this sense, different ranges and step size are randomly taken into account for kernel amounts. RMSE and R2 values of each scenario are regarded as the evaluation metrics; consequently, a kernel with the maximum R2 and minimum RMSE values is contemplated as the ideal one.

Hybrid SVR-ACO model

ACO is a successful IOM inspired by the actual and group actions of ants in their realm (Dorigo et al. 1999), which is suitable for dealing with issues in the direct paths. The next equations are used to adopt the math behind this algorithm:

$${c}_{i}^{t}={Ant}_{j}^{t}+{c}^{t}$$
(12)
$${d}_{i}^{t}={Ant}_{j}^{t}+{d}^{t}$$
(13)

where \({c}_{i}^{t}\) is the minimum of all variables for \({i}^{\mathrm{th}}\) ant, \({c}^{t}\) is the minimum amongst variables in \({t}^{\mathrm{th}}\) iteration, \({d}_{i}^{t}\) is the maximum of all variables for \({i}^{\mathrm{th}}\) ant, and \({d}^{t}\) is the vector including the maximum of all variables in the \({t}^{\mathrm{th}}\) repetition. \({Ant}_{j}^{t}\) determines the location of the picked \({j}^{\mathrm{th}}\) ant in the \({t}^{\mathrm{th}}\) repetition. The next equations are also applied to transfer the ants at random as

$${c}^{t}=\frac{{c}^{t}}{I}$$
(14)
$${d}^{t}=\frac{{d}^{t}}{I}$$
(15)

where I is a ratio (Dorigo et al. 1996, 1999; Gharehbaghi and Ghasemlounia 2022).

Hybrid SVR-ACO development

The hybrid SVR-ACO model is developed to boost the capability of conventional standalone SVR model for estimating \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\). Since RBF appraises the scattered data with high accuracy, it is employed as the kernel function. In this method, ACO algorithm is coupled with SVR to optimize RBF functional parameters. The differences in kernel amount of RBF function affect meaningly the precision of results.

The layers of the conventional SVR-RBF approach’s membership functions, weights, and biases are trained using ACO. The optimization procedure in the new d-dimensional space needs to be constructed in order to apply ACO to update weights. This methodology modifies each ACO circumstance resident and, as a consequence, regenerates the best possible ACO. In this regard, the SVR-RBF weights are updated based on data obtained from the superior ACO. The original weights are then replicated using the calibration set, and ACO optimized the membership functions of the original weights. When the variance between the SVR output and the original output is less than a specified range, the original weights are then modified and improved with each iteration. The hybrid SVR-ACO model’s simulation process is visually shown in Fig. 8.

Fig. 8
figure 8

Flowchart of hybrid SVR-ACO model

Performance evaluation metrics

In this investigation, the next statistical metrics are applied to assess the capability and accuracy of models used as follows (Kişi 2007):

$$\mathrm{Determination coefficient }\left({R}^{2}\right); {R}^{2}={\left(\frac{\sum_{i=1}^{N}\left({x}_{i}-{\mu }_{x}\right)\left({y}_{i}-{\mu }_{y}\right)}{N{\sigma }_{x}{\sigma }_{y}}\right)}^{2}$$
(16)
$$\mathrm{Root mean square error }\left(RMSE\right); RMSE=\sqrt{\frac{{\sum }_{i=1}^{N}{\left({x}_{i}-{y}_{i}\right)}^{2}}{N}}$$
(17)
$$\mathrm{Mean absolute error }\left(MAE\right); MAE=\frac{{\sum }_{i=1}^{N}\left|{y}_{i}-{x}_{i}\right|}{N}$$
(18)
$$\mathrm{Mean bias error }\left(MBE\right); MBE=\frac{{\sum }_{i=1}^{N}\left({y}_{i}-{x}_{i}\right)}{N}$$
(19)

where N is the number of datasets and xi and yi are the observed and forecasted values of target variables (\(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\)), respectively. σx and σy are the standard deviations of observed and forecasted values of target variables, respectively. μx and μy are the average of observed and forecasted values of target variables, respectively. Lesser amounts for RMSE, MAE, and MBE accompanied by higher amounts for R2 denote a superior modeling performance.

Results and discussion

Variation of DFSZ

According to the experimental measurements, in the same value of δl, DFSZ increased by rising Q. By increasing Q, flow velocity increased and flow passed faster in the downstream of main channel rather than intake channel. Because in the same value of Q by increasing the value of δl from 10 to 20 (cm) the DFSZ increased, δl = 10 ponders as the optimal one.

Table 5 illustrates the maximum percentage reduction of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) as the representatives of DFSZ in the applied δl, β, and θ in comparison with the control test. Consistent with Table 5, the maximum percentage reduction occurs at θ = 30°, β = 34°, and δl = 1 0 (cm) with the value of 78 and 76%, respectively.

Table 5 Maximum percentage reduction of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) in the applied δl, β°, and θ

Variation of water depth in the main channel

Table 6 shows the maximum percentage of rising water depth (MPRWD) by using the present submerged multiple-vanes system in the main channel compared to the control test at β = 34° and the applied δl and θ. According to the experimental observations and Table 6, in comparison with the control test, water depth in the main channel by using the current submerged multiple-vane system noticeably increased. Moreover, based on Table 6, δl = 15 (cm) and θ = 30° are confirmed as the optimum ones. Figure 9 demonstrates water depth in the main channel at Q = 25 (l/s) in the θ = 30°, β = 34°, and applied δl.

Table 6 MPRWD in β = 34˚ and applied δl and θ
Fig. 9
figure 9

Depths of flow in the main channel in Q = 25 (l/s), θ = 30°, β = 34°, and the applied δl

Validation of GEP method

The value of genetic operators of GEP to estimate \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) under the optimal scenario are presented in Table 7. The results of GEP model in the validation stage under the optimal scenario for \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) are presented in Table 8.

Table 7 Rate of genetic operators under the ideal scenario in GEP for both \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\)
Table 8 Statistical metrics in the testing stage under the ideal scenario of GEP for \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\)

According to Table 8, the value of MBE indicates that the method for the.

\(\frac{{L}_{s}}{{B}_{2}}\) overestimated and for \(\frac{{W}_{s}}{{B}_{2}}\) underestimated corresponding observed ones. Also, it can be concluded that the used structure in the optimum scenario can be employed with sufficient precision. Mathematical equations generated in the ideal scenario are presented in Eqs. (20) and (21) as follows:

$$\frac{{L}_{s}}{{B}_{2}} = ({\mathit{log}}_{10}(\frac{{L}_{V}}{{y}_{1}}\times \sqrt[2]{4.73\times\upbeta +\frac{{\mathrm{q}}_{2}}{{\mathrm{q}}_{1}}-\frac{{\delta }_{l}}{{y}_{1}}}){)}^{0.25}+Ln(\frac{{H}_{V}}{{y}_{1}}) +{\frac{{\mathrm{q}}_{2}}{{\mathrm{q}}_{1}}}^{1.25}+0.7\frac{{\delta }_{t}}{{y}_{1}}\times \frac{{\delta }_{l}}{{y}_{1}}$$
(20)
$$\frac{{W}_{s}}{{B}_{2}} = (Sin \theta \times \frac{{H}_{V}}{{y}_{1}}{)}^{2}+({Fr}_{1}\times \sqrt[3]{Atan(\beta )} )+(\frac{{L}_{V}}{{y}_{1}}\times \frac{{q}_{2}}{{q}_{1}}\times \frac{{\delta }_{l}}{{y}_{1}})$$
(21)

The estimated \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) by Eqs. (20) and (21) are compared with the corresponding observed ones, and results are within 0–10%. Hence, it can be concluded that these equations can be applied as multivariate mathematical relationships with high precision. R2 of these equations is in the range of 0.91 to 0.93, which statistically can be considered apposite equations only for the initial approximation of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) in hydraulic studies.

The exact prediction of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) mainly in the peak values shows the ability of model developed. Figure 10 A and B shows comparison plots between the observed and predicted \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) by the ideal scenario in the testing stage, respectively.

Fig. 10
figure 10

Comparison plots between the observed and predicted, A \(\frac{{L}_{s}}{{B}_{2}} \mathrm{and}\) B \(\frac{{W}_{s}}{{B}_{2}}\), in the testing stage under the ideal scenario of GEP method

Based on Fig. 10 A and B, it can be inferred that GEP method has partially failed to fit data, particularly in the peak values that indicate some deviations in comparison with the measured ones.

Validation of SVR model

The values of C, ε, d, and γ in RBF and polynomial kernel functions in the optimal scenario are given in Table 9. Also, the number of input and output neurons were achieved 3 and 1, respectively. The values of statistical metrics in the testing stages under the optimal scenario are presented in Table 10.

Table 9 Kernel factors in SVR-RBF and SVR-polynomial models under the optimal scenario
Table 10 Statistical metrics in the testing stage under the ideal scenario of SVR method for \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\)

According to Table 10, the value of MBE indicates that the models in the testing stage have overestimated corresponding observed ones. Also, it can be concluded that SVR-RBF has outperformed SVR-polynomial model.

Validation of SVR-ACO model

In this hybrid model, RBF is used as the appropriate kernel function based on the findings of the SVR model. The value of its optimized kernel parameters is obtained as C = 250, γ = 0.08, and ε = 0.009. Also, the optimal number of input and output neurons is obtained as 3 and 1, respectively.

Every inhabitant of the ACO scenario is adjusted by the hybrid SVR-ACO, which then outputs the best ACO. With 100 as the population size, 1000 as the maximum number of repetitions, 20 as the initial pheromone, 0.02 as the heuristic exponential weight, 0.5 as the pheromone exponential weight, and 0.1 as the pheromone’s evaporation rate, the optimal scenario results in the optimal values of ACO parameters.

The values of statistical metrics in the validation stage for developed hybrid SVR-ACO in the optimal scenario are given in Table 11.

Table 11 Statistical metrics in the validation stage under the ideal scenario of hybrid SVR-ACO model for.\(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\).

The value of MBE signifies that the model in the validation stage for the \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) has overestimated corresponding observed ones. Consistent with Table 11, it can be inferred that the model can be utilized to predict with high accuracy. However, the performance of model in predicting \(\frac{{L}_{s}}{{B}_{2}}\) is fairly higher than \(\frac{{W}_{s}}{{B}_{2}}\).

Figure. 11 A and B demonstrates a comparison plot of the observed and forecasted in the validation stage under the optimal scenario

Fig. 11
figure 11

Comparison plots between the observed and forecasted, A \(\frac{{L}_{s}}{{B}_{2}}\) and B \(\frac{{W}_{s}}{{B}_{2}},\) in the validation stage by the ideal scenario of hybrid SVR-ACO model

As seen in Fig. 11 A and B, because of the high ability of model, it can appositely capture the variation trend of the measured \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\), particularly in the peak and lowermost values. Figure 12 A and B demonstrates a scatter plot comparing the observed and forecasted \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\) in the validation stage by the optimal scenario, respectively.

Fig. 12
figure 12

Scatter plots between the observed and forecasted: A \(\frac{{L}_{s}}{{B}_{2}}\) and B the \(\frac{{W}_{s}}{{B}_{2}}\) in the validation stage by the optimal scenario of hybrid SVR-ACO model

It can be seen from the trend line equation (presume that the equation is y = aox + a1) in the scatter plot that a0 and a1 coefficients for the model are, respectively, closer to 1 and 0 with a high value of R2.

Performance comparison of models used

Accompanied by Tables 8, 10, and 11 and Figs. 10, 11, and 12, it can be deduced that developed DDMs have almost acceptable precision with slight differences in the prediction of DFSZ. In this study, the best method is picked via the model grading process suggested by Vaheddoost et al. (2016). In this method, SG (success grade) and FG (failure grade) take into account performance pivotal metrics expressed as follows:

$$\mathrm{Spearmans rho correlation coefficient}\left(\rho \right); \rho =1-\frac{6\sum {d}_{i}^{2}}{{N}^{3}-N}$$
(22)
$$\mathrm{Performance index}\left(PI\right); PI=\frac{1}{N}{\sum }_{i=1}^{N}\frac{{y}_{i}}{{x}_{i}}$$
(23)
$$SG{\left({R}^{2}\right)}_{i}=\frac{{R}_{i}^{2}}{{R}_{max}^{2}}\times 10$$
(24)
$$\mathrm{S}G{\left(CC\right)}_{i}=\left|\frac{{CC}_{i}}{{CC}_{max}}\right|\times 10$$
(25)
$$FG{\left(PI\right)}_{i}=\left(\frac{{\left|1-PI\right|}_{min}}{{\left|1-PI\right|}_{i}}\times 10\right)-10$$
(26)
$$FG{\left(RMSE\right)}_{i}=\left(\frac{{RMSE}_{min}}{{RMSE}_{i}}\times 10\right)-10$$
(27)

where N is the number of datasets and di is the difference in paired ranks; xi and yi are measured and forecasted values of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\). Total grade (TG) of each model is gained by adding up FG and SG of each model distinctly that can differ between − 20 and + 20 (Table 12) and is expressed as

Table 12 Results of all models in A and B, the test data set, and related grades
$$TG=SG(CC)+SG({R}^{2})-FG(RMSE)-FG(PI)$$
(28)

Consistent with TG values in Table 12, the hybrid SVR-ACO model is considered the best method for the prediction of \(\frac{{L}_{s}}{{B}_{2}}\) and \(\frac{{W}_{s}}{{B}_{2}}\).

Conclusion

In this research, the dimensions of flow separation zone (DFSZ) in a combined submerged multiple-vane system in diverse θ, δl, and Q on the steady and free overflow conditions were investigated. The technique used a butterfly array of applied vanes, which were composed of two portions, curved and flat plates. In order to evaluate the impact of the current system on DFSZ and flow pattern, extensive lab work was done over the scaled-down model in this regard. All experiments were carried out in a range of 0.1 < Fr1 < 0.5. Certain features about interaction effect were discussed, which can be supportive in design of submerged multiple-vane systems at initial phase. A summary of the experimental results is presented as follows:

  1. 1.

    Pre-processing by Spearman’s rho and Pearson’s correlation coefficients showed that the\(\frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{b}}{{y}_{1}},\frac{{\delta }_{l}}{{y}_{1}} ,\frac{{\delta }_{t}}{{y}_{1}},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\),\(\beta\) and \(\theta ,\beta , \frac{{q}_{2}}{{q}_{1}}, \frac{{\delta }_{l}}{{y}_{1}},{Fr}_{1},\frac{{L}_{V}}{{y}_{1}},\frac{{H}_{V}}{{y}_{1}}\) were potential input variables to predict \(\frac{{L}_{s}}{{y}_{1}}\) and\(\frac{{W}_{s}}{{y}_{1}}\), respectively. Moreover, sensitivity analysis by the cosine amplitude technique verified the \(\frac{{H}_{V}}{{y}_{1}}\) and \(\frac{{L}_{V}}{{y}_{1}}\) as the most effective variables in estimating the \(\frac{Ls}{{B}_{2}}\) and \(\frac{Ws}{{B}_{2}}\) as the target parameters, respectively.

  2. 2.

    In the same value of δl, DFSZ increased by increasing Q. Also, in the same value of Q, by increasing δl, DFSZ increased and, consequently, δl = 10 (cm) was selected as the optimum option. trial

  3. 3.

    In comparison with the control test, the maximum percentage of reduction in DFSZ occurred at θ = 30˚, β = 34°, and δl = 10 (cm) with the value of 78 and 76%, respectively. It caused a flow suitable length profile in the main and lateral intake channels and diminished flow turbulence in the intake position. Besides, the maximum percentage of rising water depth at β = 34° occurred at δl = 15 (cm) and θ = 30° with a value of 43%.

  4. 4.

    Based on the observations of flow-field, the specific nature of the present submerged multiple-vane system induced to generate more severe turbulence in the shear layer at the linking boundaries of the main channel and lateral intake channel compared to the control test.

Besides, in this study, several DDMs were characterized through a trial-and-error process to estimate DFSZ. Assessment of DDMs was conducted in the testing stages. For this aim, several performance criteria, namely, R2, CC, PI, RMSE, MAE, MBE, and total grade (TG) were applied. The outcomes confirmed that the hybrid SVR-ACO model was the most effective and precise DDMs were created for the estimation of DFSZ. It should be emphasized that even though all possible DFSZ variables were evaluated in this work, the conclusions cannot be applied to other variations of the submerged multiple-vane system. The results of this study will help with the construction of hydraulic structures, diversion channels, sediment movement in rivers, and other things. The authors strongly encourage conducting experiments using various array types in open channels.