Introduction

Concrete has always been considered a brittle material that cannot withstand tensile or shear pressures. When exposed to external loads, reinforced concrete beams are susceptible to high shear stresses. Any reinforced concrete (RC) member that cracks experiences a different state of stress than the theory of linear elasticity predicts because the region between the flexural tension and flexural compression zones is compromised. Consequently, it becomes crucial to consider how shear transmits in a cracked RC member. It is challenging to comprehend the relative contribution of the transmission mechanisms to the shear capacity because of the complicated interactions of shear stresses [6, 7]. The following shear transfer methods can be found in the ASCE-ACI Committee 445 report [2]:

  • Shear strains in the uncracked section's flexural compression zone.

  • Crack friction, interface shear transmission, or aggregate interlock.

  • Dowel's action.

  • The transient tensile stress transfer.

Practitioners' primary concern is preventing these tensile and shear failures. Thus, various methods have been introduced to enhance shear resistance, including steel fibres, which have been improved in the last decade. Tests on the concrete fibre material in both structural and non-structural applications have shown it to be reliable. Ground- and pile-supported slabs, tunnel linings, different precast components, raft foundations, industrial pavement, and intricate architectural forms are some examples of steel fibre-reinforced concrete's practical applications [19].

Researchers have explored the parameters that impact the shear since the early 1970s. Batson et al. [18] conducted a study that looked at the influence of the volume percentage of steel fibres (\(Vf\) and the shear span to depth ratio (a/d; and found that adding steel fibres substantially increases the shear strength of members. Further research by Swamy and Bahia [36, 37] demonstrated that the shear strength of steel fibre reinforced concrete (SFRC rectangular and T-shaped beams was much higher than that of the reinforced concrete beams without shear reinforcement. The shear performance of SFRC members was examined by Mansur et al. [29], Lim et al. [28], and Narayanan and Darwish [30]. By utilizing test factors such as concrete compressive strength (\(fc\)), the volume fraction of steel fibres (\(Vf\)), shear span/effective depth ratio (a/d), and longitudinal steel ratio (\(\rho\)), they sought to explain the complex shear process of SFRC beams. Shear experiments on SFRC beams were undertaken by Narayanan and Darwish [30] and focused on the impact of tensile strength. They suggested a shear strength equation based on the shear contributions of steel fibre and concrete under the assumption of a 45 °crack angle, with a fibre factor (\(F\)) incorporated to account for the effect of geometry and volume percentage of fibre on the SFRC member's shear capacity.

By revising the shear strength equations of ACI 318 [1] and Zsutty [40], Ashour et al. [17] created two equations for determining the shear strength of steel fibre reinforced high-strength concrete (SFRHC) beams based on shear testing of SFRHC beams (1968). In addition, Kwak et al. [27], Oh and Kim [32], and Karl et al. [24] proposed shear strength estimation equation based on the acquired experimental data for SFRC and SFRHC beams. In 2021, Sharifi and Moghbeli [35] and Tarawneh et al. [39] used the artificial neural network (ANN) approach to refine the shear prediction presented in the literature and they were able to get a better prediction compared to previous studies.

Based on this review, it is evident that there is a need to examine the accuracy of the available equations of the shear strength of steel fibre-reinforced concrete beams. In addition, it is also clear that there are limited attempts to implicate soft computing approaches to develop robust models to predict the shear capacity of SFRC beams, where only the ANN is utilized. However, evidence in the literature showed that soft computing approaches could provide better predictions compared to the ANN [3, 4, 8, 22, 34]. Thus, this study uses a novel approach that combines regression analysis with the AI (EPR-MOGA) to develop new correlation to predict the shear strength of steel fibre-reinforced concrete beams. In addition, this study also assesses the accuracy of the available equations developed to predict the shear capacity of SFRC beams.

Available equations to predict the shear strength of steel fibre reinforced concrete beams

A survey of the literature was carried out to collect and understand the available predictive equations of the shear strength of steel fibre-reinforced concrete beams (Swamy and Al-Taan, [38]; [29, 30], Al-Ta'an and Al-Fee, [5]; [17, 20, 23, 25,26,27, 35, 39]. The result of this literature survey was 12 predictive equations. These equations are listed in Table 1. These equations will be assessed as part of this research to check their accuracy and see if these equations could be used with confidence in practice. The assessment will also show the performance of the model developed in this research.

Table 1 Predictive equations available in the literature of the shear strength of SFRHC beams

\(\rho\): is the flexural reinforcement ratio, \({f}_{c}^{^{\prime}}\): is the cylinder compressive strength, \({f}_{\mathrm{cu}}^{^{\prime}}\): is the cube compressive strength, and it can be taken as \(0.8{f}_{c}^{^{\prime}}\), \(\tau\): is the average fibre matrix interfacial bond stress, \(a\): is the shear span, \(d\): is the effective depth, \(\lambda\): is the arch action factor, and it is equal to 1.0 for \(a/d > 2.8\), and \(2.8 d/a\) for\(a/d \le 2.8\), \(F\): is the fibre factor, and it is equal to \(({l}_{f}/{d}_{f}){V}_{f}\), \({l}_{f}\): is the fibre length, \({d}_{f}\): is the fibre diameter, \({V}_{f}\): is the volume fraction of steel fibres, \(\Omega\): is the arch action factor and it is equal to 1.0 for \(a/d > 3.4\), and \(3.4 d/a\) for\(a/d \le 3.4\), \(e\): is the arch action factor and it is equal to 1.0 for, \(a/d > 2.5\), and \(2.5 d/a\) for\(a/d \le 2.5\), \(V\): is the shear force, \(M\): is the bending moment, \(\alpha\): is the arch action factor and it is equal to 1.0 for \(a/d \ge 2.5\), and \(2.5 d/a\le 3\) for\(a/d < 2.5\), \({A}_{u}\): is a factor and it is given as 0.65, \({B}_{u}\): is a factor and it is given as \({B}_{u}=0.01162\frac{{l}_{f}}{{d}_{f}}-0.214\), \({C}_{u}\): is a factor and it is given as \({C}_{u}=-0.00552\frac{{l}_{f}}{{d}_{f}}+0.24\), \({D}_{u}\): is a factor and it is given as \({D}_{u}=0.00045\frac{{l}_{f}}{{d}_{f}}-0.026\), and\(d\): is the effective depth.

Methodology

Data collection

As a result, database has been compiled from the study of Sharifi and Moghbeli [35], who developed this database using the results of experimental studies available in the literature. In total, 235 test results (i.e., data points) have been collected and used. The collected data are the fibre content by volume (\(vf\)), the length of the fibre (\(lf\)), the diameter of the fibre (\(df\)), the ratio of the reinforcement in the longitudinal direction (\(\rho 1\)), the effective depth of the beam (\(d\)), the shear span (\(a\)), the compressive strength of the concrete (\({f}_{c}^{^{\prime}}\)), and the shear capacity of the beam (\({v}_{frc}\)).

A basic statistical analysis is carried out to find the basic statistical indicators of the collected database: the maximum value, minimum value, standard deviation, mean, and range. Table 2 presents the obtained results from this analysis.

Table 2 Basic statistical indicators of the collected database

EPR-MOGA

Evolutionary polynomial regression analysis (EPR-MOGA) is used to aid the development of the new model. EPR-MOGA stands for multi-objective evolutionary polynomial regression analysis, an intelligent computational method that leverages input data to generate a novel solution to a specific problem [10, 14]. This method is based on regression analysis and employs a genetic algorithm (GA) to create a mathematical model that can explain the relationship between physical input variables [21, 31]. The EPR-MOGA uses GA to find the best model automatically. This GA is further refined by the addition of several objectives to control model complexity and ensure the new model's accuracy and fitness [10, 21]. As a result, one of the key features of this regression method over traditional regression is that the searched algorithm automatically finds the optimal mathematical model. The EPR-MOGA does this by selecting candidate exponents and examining the accuracy of the developed correlation using a coefficient called the coefficient of determination (CD) (shown in Eq. 1). Then, the EPR-MOGA identifies, by several trails, the exponents arrangement that achieve the best accuracy. Also, in contrast to regression analysis, where the user must manually try every conceivable association, the user must provide the model structure, the number of model terms, and the range and step of the exponents. Also, the EPR-MOGA tackles the overfitting problem associated with conventional regression analysis methods that do not use AI. Further details on this method could be found in other publications (e.g., [9, 10, 12, 13, 41]. Figure 1 shows a flow chart of the EPR-MOGA analysis, which illustrates how the EPR-MOGA analysis works.

$$CD=1-\frac{{\sum }_{n}{({Y}_{a}-Y)}^{2}}{{\sum }_{n}{({Y}_{a}-\frac{1}{n}\sum_{n}{Y}_{p})}^{2}}$$
(1)

where, \({Y}_{a}\) is the dependent input value (i.e., the measured shear strength); \(Y\) is the predicted dependent input value from the EPR-MOGA correlation (i.e., the predicted shear strength); and \(n\) is the number of data points.

Fig. 1
figure 1

Simple flow chart of the steps of the EPR-MOGA analysis [10, 15]

Statistical assessment

The performance of the existed equations and the proposed model are checked by calculating the mean absolute error (\(\mathrm{MAE}\)), root mean square error (\(\mathrm{RMSE}\)), mean (\(\mu\)), and coefficient of determination (\({R}^{2}\)). These accuracy checks are calculated with the aid of excel and using Eqs. 25 [42,43,44].

The overall error is checked by assessing the values of the \(\mathrm{MAE}\) and \(\mathrm{RMSE}\). In addition, the overall over-and-under estimation is examined using the value of the mean, where a value higher than 1 indicates an overall over-estimation of the shear capacity and a value lower than 1 means an overall under-estimation in the prediction. Furthermore, the overall accuracy of the prediction is examined using the value of the \({R}^{2}\).

In addition to the accuracy checks, an error analysis is also carried out to investigate the cumulative frequency of the error to provide insight into the existing equations' performance and the new model.

$$\mathrm{MAE}= \frac{1}{n}\sum_{1}^{n}\left|{v}_{\mathrm{frc} (p)}-{v}_{\mathrm{frc} (m)}\right|$$
(2)
$$\mathrm{RMSE}=\sqrt{\frac{1}{n}\sum_{1}^{n}{\left({v}_{\mathrm{frc} (p)}-{v}_{\mathrm{frc} (m)}\right)}^{2}}$$
(3)
$$\mu =\frac{1}{n}\sum_{1}^{n}\left(\frac{{v}_{\mathrm{frc} (p)}}{{v}_{\mathrm{frc} (m)}}\right)$$
(4)
$${R}^{2}={\left(\frac{\sum_{i=1}^{n}({v}_{\mathrm{frc} (p)}-{{v}_{\mathrm{frc} (p)}}_{\mathrm{average}})({v}_{\mathrm{frc} (m)}-{{v}_{\mathrm{frc} (m)}}_{\mathrm{average}})}{\sqrt{\sum_{i=1}^{n}{({v}_{\mathrm{frc} (p)}-{{v}_{\mathrm{frc} (p)}}_{\mathrm{average}})}^{2}\sum_{i=1}^{n}{({v}_{\mathrm{frc} (m)}-{{v}_{\mathrm{frc} (m)}}_{\mathrm{average}})}^{2}}}\right)}^{2}$$
(5)

where, \(n\) is the number of inputs involved in the calculations, \({v}_{\mathrm{frc} (p)}\) is the predicted shear capacity of the SFRC beam, and \({v}_{\mathrm{frc} (m)}\) is the measured shear capacity of the SFRC beam.

Results

The presentation of the new model, the assessment of the existed equations, and the error analysis of the new model and the existed equations are discussed in this section as follows:

EPR-MOGA new model

The database discussed in “Data collection” Section is utilized to develop the new predictive model of the shear capacity of SFRC beams. This database is divided into two group: training group and testing group. 80% of the data are used as training data and the remaining data are considered a testing data. The training group is utilized to train the EPR-MOGA model, and the testing group is employed to assess the model's accuracy in predicting unseen scenarios that were not used in its development (i.e., training). The data separation into training and testing is conducted carefully to ensure that the maximum, minimum, and range of the testing data group are within the same range of that of the training data group to avoid extrapolations in the predictions of the model in the testing stage [11, 16]. After completing the dividing stage, this has been ensured by conducting a statistical analysis on both data groups. Tables 3 and 4 present the basic statistical indicators of the two group to demonstrate that the range of the testing data group employed in this study was within that of the training data.

Table 3 Basic statistical indicators of the collected database
Table 4 Basic statistical indicators of the collected database

The development of the model is carried out after dividing the data, where the two data groups are provided into the algorithm of the EPR-MOGA to search for the best fit model and to test (validate) it. Many trials were carried out in this stage to examine the influence of the model’s number of terms and exponents range on its prediction accuracy. As a result, the best model obtained from the EPR-MOGA is shown in Eq. 6. The statistical accuracy measures of the new model (i.e., \(\mathrm{MAE}\), \(\mathrm{RMSE}\), \(\mu\), and \({R}^{2}\)) are presented in Table 5 for both data groups. In addition, Fig. 2a, b present the comparison of the predictions of the model in comparisons to the zero-error line for training and testing data groups, respectively. Table 5 and Fig. 2a and b provide a testament to the new model's accuracy for both data groups as the model scored low error, mean value slightly higher than the optimal value and coefficient of determination slightly lower than the optimal value.

Table 5 Statistical assessment of the proposed model using training and testing data
Fig. 2
figure 2

Relationship between predictions of the new model and the measured values of \(Vfrc\) for: a training group; b testing group

$${V}_{frc} = -3487.502 d/({\frac{{L}_{f}}{{D}_{f}})}^{3}({\frac{a}{d})}^{3} +72967859.62 {\rho }_{1}^{2}/({\frac{{L}_{f}}{{D}_{f}})}^{2} ({\frac{a}{d})}^{1.5} -202.1093 {\rho }_{1}^{0.5}/({\frac{{L}_{f}}{{D}_{f}})}^{0.5} + 365.6352 {\rho }_{1}^{0.5} /{d}^{0.5} - 665137.294 {\rho }_{1}^{2} /d ({\frac{a}{d})}^{1.5}+ 0.0020068 {V}_{f}^{0.5}{d}^{1.5} + 0.12937 {V}_{f}{F}_{c}^{2}/{\rho }_{1}^{2}{d}^{1.5}({\frac{a}{d})}^{2} - 164655.6281 {V}_{f}^{3}/({\frac{a}{d})}^{2} + 0.5481$$
(6)

Assessment of the available equations

A statistical assessment is carried out in this section to check the predictions accuracy of available equations, where the \(\mathrm{MAE}\), \(\mathrm{RMSE}\), \(\mu\), and \({R}^{2}\) were calculated for the available equations and compared with the values obtained for the new model (presented in Table 5). Notably, the data groups developed in “EPR-MOGA new model” Section (training and testing) are also utilized here to provide clear comparisons of the accuracy with the new model.

Tables 6 and 7 present the results of the accuracy assessment of these equations for both training and testing data groups. In addition, Figs. 3 and 4 compare the existing equations' predictions against the zero-error line. It is clear from the tables that the available equations scored an error higher than the new model (i.e., higher values of \(\mathrm{MAE}\) and \(\mathrm{RMSE}\)). In addition, the scored values of the \({R}^{2}\) of the available equations are lower than that scored by the new model. Figures 3 and 4 also show that the predictions of the available equations have significant scatter compared to the zero-error line, while the new model showed less scatter (Fig. 2a, b). Thus, it could be stated that the new model provides better accuracy than the available equations. Importantly, it should be noted that the equation of Sharifi and Moghbeli [35] scored lower \(\mathrm{MAE}\) and \(\mathrm{RMSE}\), and higher \({R}^{2}\) compared to other available equations for both data groups.

Table 6 Statistical assessment of the available equations using training data
Table 7 Statistical assessment of the available equations using testing data
Fig. 3
figure 3figure 3

Relationship between measured and the predicted \(V\mathrm{frc}\) using the available equations for the training data group

Fig. 4
figure 4figure 4

Relationship between measured and the predicted \(V\mathrm{frc}\) using the available equations for the testing data group

Error analysis

Finally, an error analysis is carried out to compare the accuracy of the new EPR-MOGA model with the available equations in the literature. This was done by obtaining and plotting the relationship between the cumulative frequency and error level using the methodology employed by many previous studies (e.g., [11, 15, 33]. The results are presented in Fig. 5 for the new model (denoted as EPR-MOGA) and the available equations. It is evident from the figure that the new model scored significantly better than the available equations as it achieved higher frequency with lower error compared to the available equations. For instance, the new model scored 70% cumulative frequency at an error level of 20%, while the available equations scored cumulative frequency between 16 and 53%. It is also clear from Fig. 5 that the equation of Sharifi and Moghbeli [35] scored second, and thus, it could be considered as the best available equation. Lastly, the equation developed by Tarawneh et al. [39] provided the worst cumulative frequency of error for error range of 20–59%.

Fig. 5
figure 5

Results of the cumulative frequency and error level relationship for the new model and the available equations

In conclusion, it is evident based on the assessment presented in Sections “EPR-MOGA new model” “Assessment of the available equations” and “Error analysis” that the new model is superior and outperformed the available equations. Thus, the new model could be used to reduce uncertainty and increase the accuracy in the calculation of the shear capacity.

Conclusions

Using data collected from the literature, this study used the cutting-edge EPR-MOGA technique to develop new equation to predict the shear capacity of steel fibre reinforced concrete (SFRC) beams. The outcomes demonstrated that the new strategy led to significantly higher accuracy. Therefore, the results of this work offer a potentially powerful and straightforward approach for designers and have shown enhanced accuracy compared to previous equations available in the literature to predict shear capacity of SFRC beams. The following conclusions can be formed, taking into account the limitations of the work presented:

  1. 1

    The proposed equation showed accuracy better than the available equations in the literature, with \({R}^{2}\) equal to 0.93 for the training set and 0.93 for the testing set compared to the equations available in the literature where these equations scored \({R}^{2}\) between 0.01 and 0.89. In addition, the new model also scored lower error (i.e., lower \(\mathrm{MAE}\) and \(\mathrm{RMSE}\)) compared to the available equations.

  2. 2

    The current model achieved better performance as it has been developed using this database and therefore, the

  3. 3

    The best equation available in the literature to predict the shear capacity of SFRC beams is the equation proposed by Sharifi and Moghbeli [35], where it recorded \({R}^{2}\) equal to 0.87 for the training set and 0.89 for the testing set. This equation also scored better than the other available equations in terms of the cumulative frequency of error.

  4. 4

    Based on the cumulative frequency of error, it could be stated that the equation of Tarawneh et al. [39] provides the poorest predictions for error level range of 20–59%.

Finally, it is important to state that one of the reasons that the model developed in this research achieved better performance than the other empirical equations is because this model has been developed using the same database which has been used to examine the accuracy performance, while the other empirical equations have been developed using other limited datasets. However, it is significant to stress that our work has demonstrated a rational methodology for putting into practice a soft computing strategy employing an AI algorithm to create novel robust model that could be used with confidence to predict the shear capacity of steel fibre reinforced concrete (SFRC) beams.