1 Introduction

Concrete is the most popular material employed in construction around the world. The demolition of old structures and the construction of new ones leads to an increased accumulation of Construction and Demolition Waste (CDW), causing environmental issues. Moreover, producing new concrete involves utilizing non-renewable resources like cement and natural coarse aggregates. To cope with this, various studies have been conducted on recycling CDW to reduce the demand for natural resources and cut down on construction waste [1,2,3,4]. As concrete is the largest source of CDW, many research studies have examined the use of Recycled Coarse Aggregate (RCA) to produce Recycled Aggregate Concrete (RAC) and assess its mechanical properties and durability in comparison to those of Natural Coarse Aggregate Concrete (NCAC) [5,6,7] given that coarse aggregate makes up 60–75% of the volume of concrete. Nevertheless, test results have been discouraging as concrete with RCA has demonstrated inferior properties compared to concrete with NCAC [5,6,7,8].

RCA is created by crushing concrete debris that is 90 to 95% of the total mass and has a contamination level at or below 1%. This aggregate is taken from demolition waste that is clean and free of any sound [9, 10].

Recently, recycled aggregate obtained from construction and demolition waste has been gaining considerable attention, due to its potential use in green concrete structures. Many countries have been motivated to substitute natural aggregates with recycled aggregate in the creation of concrete, due to the shortage of natural aggregates, rising transportation expenses, limited dumping sites, and environmental contamination. Consequently, numerous studies have been conducted to ascertain how waste can be employed in the construction industry [11].

Despite a great deal of research being done to examine the rheology and durability of recycled aggregate, only a few studies have looked into the behavior of recycled aggregate concrete structural members [12,13,14,15]. These studies are particularly important, as it is hard to make predictions about the performance of reinforced recycled aggregate concrete members based solely on the tests done on the material characteristics of the recycled aggregate [16].

Analysis of current design equations by simply comparing them with experimental data may lead to inaccurate results because the gathered information may include incorrect data [17]. The research [18] suggests a Bayesian method to evaluate various shear prediction models of RAC beams. This Bayesian parameter estimation eliminates the fragility of any conclusion obtained from the traits of the collected data. The risk of wrong results is minimized by employing Bayesian parameter estimation, as it provides a full distribution of believable values in terms of mean, standard deviation, and the effect size of experimental-to-calculated capacity values.

Muttoni and Ruiz [19] have discussed critical shear crack width theory, including developing and activating the arching action in reinforced concrete beams subjected to shear. The critical shear crack, which can be taken regarding the effective depth of the reinforcements (d), is identified as one parameter that governs the shear strength, along with its location, width, and aggregate size.

From what can be gathered from the previous literature on the shear capacity of RAC beams, it is clear that a more comprehensive analysis is required to confirm whether or not existing design codes are accurate in their predictions. Although the code equations may be overly conservative in their predictions of the shear capacity of RAC beams, it is essential to accurately determine the degree of this conservatism for further safety and reliability of the shear design of these beams [18].

The shear strength of reinforced recycled aggregate concrete beams without stirrups is of great importance as it directly affects the structural integrity of the beam. Without stirrups, the beam is more vulnerable to failure, potentially leading to a collapse which can have serious consequences for the building and its occupants. Investigating shear strength helps to ensure that the beam can withstand the loads and forces of the environment and execute its function successfully. However, the shear strength of concrete is a very intricate phenomenon that cannot be simply deduced from the properties of the elements used [20]. Research has been done to find replacements for natural aggregates, including coal bottom ash, waste glass, waste marble powder, and other items [21,22,23].

This study particularly looks into replacing natural aggregates with recycled aggregates from demolition waste and assessing the shear strength of the reinforced concrete members. As shear strength is a complex matter and there is limited research on the use of recycled aggregates theoretically, more in-depth investigation into using RA in concrete structures is necessary.

This research is exploring the influence of different parameters and ranges on the shear force of concrete (Vc) beams made with coarse recycled aggregate (RCA) from demolition waste. It will contribute new information and data to the existing body of knowledge on the topic, assisting developers of structural codes in formulating equations for predicting shear strength in recycled concrete. In an age where large data sets are highly valued, more records are essential for refining construction codes that promote sustainable materials, particularly for complicated behaviour based on empirical evidence [20].

Machine Learning (ML) has gained popularity in the field of engineering due to its ability to establish connections between input and output data [24,25,26,27,28,29,30]. In recent studies [24, 26,27,28,29], ML techniques like Multi-Objective Genetic Algorithm Evolutionary Polynomial Regression (MOGA-EPR), Genetic Expression Programming (GEP), Multivariate Adaptive Regression Splines (MARS), Model Tree (MT), and Extreme Learning Machine (ELM) have been employed to explore the mechanical properties and durability of concrete. These methods are utilized for predicting the behavior of concrete materials [24,25,26,27,28,29,30].

This paper investigates the shear strength of coarse recycled aggregate (RCA) beams without stirrups. Understanding that RCA affects the shear strength is an important factor to consider in the design toward the net-zero policy. Reference [31] studied the effect of the RCA substitution rate and the longitudinal reinforcement ratio (ρw) on the Vc of the beams. Five different substitution levels (0%, 25%, 50%, 75%, and 100%) and two reinforcement ratio percentages (1.16% and 1.81%) were selected for the study. Two NCAC beams and eight RAC beams without stirrups were tested on one-third of the beam span as described in the experimental work of reference [31]. The results of RCA beams without stirrups from other studies were also compiled and evaluated to evaluate the accuracy of the existing equations.

Multi-Objective Genetic Algorithm Evolutionary Polynomial Regression (MOGA-EPR) and Gene Expression Programming (GEP) are powerful computational tools, that are especially useful in estimating concrete strength [24, 25]. MOGA-EPR uncovers the relationships between physical input variables crucial to determining shear strength in this particular concrete by applying the genetic algorithm (GA) to regression analysis. This method outperforms classic regression algorithms by eliminating overfitting problems, resulting in more accurate and dependable predictions. Engineers and researchers can use MOGA-EPR to adjust the correlation structure, exponent ranges, and term numbers to the particular properties of recycled aggregate concrete, allowing for improved design and construction decisions for long-term infrastructure projects.

2 Current code provision for shear in the beam without stirrups

In this paper, a comprehensive empirical evaluation was carried out to predict the shear strength of concrete members without stirrups. The findings of prior research show that the shear capacity of reinforced concrete (RC) beams is lowered when natural aggregate is replaced with recycled aggregate [12, 15, 32]. To see if the shear capacity prediction equations outlined in Table 1 are suitable for use when designing RAC beams, they need to be tested against experimental results. Therefore, further evaluation is necessary.

The ACI 318-14 [33], ACI 318-19 [34], Eurocode 2 [35], Indian Standard:456 [36], and the equation proposed by Bazant and Yu [37, 38] consider the longitudinal reinforcement ratio (ρw), compressive strength of concrete (fc), and the beam effective depth (d) when predicting the shear capacity of RC beams. Furthermore, CEB-FIP [39], Zsutty [40, 41], Niwa et al. [42], Gastebled and May [43], Kim and Park [44], Rebeiz [45], New Zealand code [46], and Arslan [47] include the shear span-to-depth ratio (l/d) in their models. Bazant and Sun [48] and Bazant and Kim [49] developed their equations based on the fracture mechanics approach, accounting for the compressive strength of concrete, longitudinal reinforcement ratio, the effective depth of the beam, and shear span-to-depth ratio. Russo et al. [50] and Pradhan et al. [51] also consider these parameters, but Rahal and Alrefaei [52], the modified ACI 318-19 and Pradhan et al. [51] are the only ones from the literature include the aggregate replacement ratio (RCA) and maximum aggregate size (dmax.).

In this article, the 18 equations in Table 1 will be examined with the Gene Expression Programming (GEP) and Multi-Objective Genetic Algorithm Evolutionary Polynomial Regression (MOGA-EPR) models that have been developed and various statistical and numerical contrasts will be highlighted and discussed later on in the article.

Table 1 The current code equations for the shear force of concrete members without stirrups

3 Methodology

This paper investigates the potential of predicting the shear force of concrete (Vc) beams without stirrups, using machine learning techniques. To do this, experimental datasets were gathered based on reference [31], and the values of the shear force of concrete were calculated according to Eqs. (1) to (18) listed in Table 1, using the following input variables; the percentage of RCA replacement ratio, compressive strength (fc), effective depth (d), beam width (b), shear span-to-effective depth ratio (a/d), and longitudinal reinforcement ratio (ρw). Three models employing GEP and MOGA-EPR machine learning techniques were used to estimate the shear force of the concrete beams without stirrups with different RCA percentages.

The procedure outlined in Fig. 1 is the approach adopted in this paper for determining the shear force of concrete (Vc) beams without stirrups. It begins with collecting and analyzing the data statistically from reference [31], which is used to form GEP and MOGA-EPS models and create various equations that predict the shear force of concrete. Following this, data grouping for training and testing the predicted equations, then the statistical indicators for the data are computed. Subsequently, the results from the models are compared to existing equations (in Table 1).

Fig. 1
figure 1

Flowchart process for this paper methodology

3.1 Data collection and statistical analysis

A comprehensive experimental shear database was collected to assess the accuracy of the equations in Table 1 and to formulate models from the GEP and MOGA-EPR methods for predicting the concrete shear strength of RAC beams. The compilation comprises 128 results from RAC beams tested by other researchers [12,13,14,15, 32, 44, 51, 52, 54,55,56,57,58,59,60,61,62]. The parameters taken into account during this investigation were the percentage of RCA replacement ratio, compressive strength (fc), effective depth (d), beam width (b), shear span-to-effective depth ratio (a/d), and longitudinal reinforcement ratio (ρw). The observed shear force of concrete at the point of failure in all the specimens was referred to as Vc, experimental, while Vc was calculated in accordance with the equations in Table 1 and the model established in this study.

Table 2 shows the statistical measures for the collected experimental dataset provided by the reference [31].

Table 2 Statistical measures of the collected data

3.2 Data grouping

This research assessed the capability of two Gene Expression Programming (GEP) models as well as a Multi-Objective Genetic Algorithm Evolutionary Polynomial Regression (MOGA-EPR) model and compared them to the equations listed in Table 1. To guarantee accuracy, the data was split into two parts: 80% for training the models and 20% for testing. The statistical values associated with the training and testing data sets respectively for both the GEP and MOGA-EPR models are shown in Tables 3 and 4.

Table 3 Statistical measures of the training dataset
Table 4 Statistical measures of the testing dataset

3.3 Developing the models

In this paper, a study is conducted to assess the effectiveness of two methods, GEP and MOGA-EPR, in predicting the shear force (strength) of 128 RC beams without stirrups from the literature. The accuracy of these models is largely due to their ability to account for the percentage of RAC, as only two of the literature (the modified ACI 318-19 (Eq. 4) and Pradhan et al. [51] (Eq. 18) sources examined in the study took this factor into account when formulating the equation for predicting shear force (strength) of concrete without stirrups.

3.3.1 Gene expression programming (GEP)

This research looked into GEP analysis using the GeneXproTools software [63]. Chromosomes are the primary building blocks of this process; they are linear, condensed, small in nature, and can be changed through genetic methods, like replication, mutation, recombination, and transposition. In addition, the chromosomes were then converted into tree expressions; this is where the selection process begins. After evaluation of the fitness levels, the chromosomes were chosen to be reproduced and altered. The chromosomes, not the tree expressions, are modified and then passed on to the next generation during reproduction [64].

Copying genetic material is not the only element of reproduction; genetic operators are needed to generate genetic diversity. Without these operators, simply replicating the genome would not create new genetic variants. The operators act randomly to decide which chromosomes will be altered and which will remain the same.

In this paper, as shown in Table 5, two various GEP models are formulated. The first one utilized the base operations (+, -, ×, and /) and the other included the square root function.

In Table 6, Eqs. 19 and 20 are developed taking into consideration all of the input variables that affect the shear force (strength) of concrete without stirrups.

Table 5 The main setting parameters and adjustments of GEP models
Table 6 GEP model equations

3.3.2 Multi-objective evolutionary polynomial regression (MOGA- EPR)

MOGA-EPR (Multi-objective evolutionary polynomial regression analysis) is an effective computational tool that applies a genetic algorithm (GA) to determine the correlation between physical input variables via regression analysis. This method is advantageous over traditional regression techniques, as it eliminates the risk of overfitting and automatically finds the most suitable correlation through a search algorithm. To utilize the MOGA-EPR, the user must specify the correlation structure, the range of exponents, and the number of terms. For further details about the MOGA-EPR, see references [65,66,67,68,69,70,71,72].

Table 7 showcases the MOGA-EPR model equation which is used to estimate the shear strength of concrete without stirrups, taking into account all the relevant input variables. This equation is represented by Eq. (21).

Table 7 MOGA- EPR model equation

3.4 Statistical indicators and measurements

A thorough examination of both new and existing analytical methods is conducted using statistical parameters such as mean absolute error (MAE), root mean squared error (RMSE), mean (µ), and coefficient of determination (R2) (Eqs. 2225). This is a common accuracy assessment approach that has been used in prior studies [73,74,75,76,77]. The MAE and RMSE values represent the perfect fit, with lower values signifying better performance. An ideal µ value should be 1.0; values greater than that suggest an overestimate of the shear strength (Vc,p) of concrete without stirrups, whereas smaller values signify an underestimate.

$$\text{M}\text{A}\text{E}= \frac{1}{n}\sum _{1}^{n}\left|{V}_{c,p}-{V}_{c,m}\right|$$
(22)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{1}^{n} {\left( {V_{{c,p}} - V_{{c,m}} } \right)^{2} } }$$
(23)
$${\text{Mean}}\,\left( \mu \right) = \frac{1}{n}\sum\limits_{1}^{n} {\left( {\frac{{V_{{c,p}} }}{{V_{{c,m}} }}} \right)}$$
(24)
$$R^{2} = \left( {\frac{{\sum\nolimits_{{i = 1}}^{n} {(V_{{c,p}} - V_{{c,p_{{average}} }} )(V_{{c,m}} - V_{{c,m_{{average}} }} )} }}{{\sqrt {\sum\nolimits_{{i = 1}}^{n} {(V_{{c,p}} - V_{{c,p_{{average}} }} )^{2} } \sum\nolimits_{{i = 1}}^{n} {(V_{{c,m}} - V_{{c,m_{{average}} }} )^{2} } } }}} \right)^{2}$$
(25)

In Eqs. 2225, the number of data points (n) utilized for the evaluation is denoted, with \({V}_{c,p}\) depicting the predicted shear force of concrete and \({V}_{c,m}\) signifying the measured (experimentally) shear force of concrete.

4 Results

Table 8 shows the results of the mean absolute error (MAE), root mean squared error (RMSE), mean (µ) and coefficient of determination (R2) of the GEP and MOGA-EPR approaches, which are calculated using the statistical indicators and measurements in Sect. 3.2. These results compare the predictions of the shear force of concrete (Vc) without stirrups by the training and testing datasets to the shear in concrete measured experimentally.

Table 8’s results show that the MAE of the developed approaches is between 9.85 and 12.64 for the training datasets, and 12.81 and 16.57 for the test datasets. RMSE values for the training datasets range from 13.36 to 17.61, and 16.05 to 25.72 for testing datasets. The mean of the datasets is 1.01 to 1.03 for the training datasets and 1.05 to 1.09 for the testing datasets. Lastly, the R2 scores for the training datasets range from 0.84 to 0.91 and 0.93 to 0.95 for the test datasets. The results from Table 8 are quite positive, with the MOGA-EPR model having the greatest R2 value out of the three models for the training dataset, and the GEP model (2) having the highest R2 for the testing dataset. All the models have mean values close to 1, suggesting that the models are performing well.

Table 9 presents the statistical metrics of all datasets for both the GEP and MOGA-EPR models. It can be seen that the R2 scores of the GEP model (2) and MOGA-EPR are the same, at 0.90, and the score of the GEP model (1) is slightly lower but still relatively close. All models have mean values that are very close to 1, demonstrating their good performance.

Figures 2 and 3 compare the predicted and measured results from the training and testing datasets for the three models. Most of the predictions are close to the ideal fit line and fall within the ± 20% error range, indicating accurate predictions.

The graph in Fig. 4 displays the comparison between the experimental shear force of concrete (Vc, experimental) and the predicted shear force of concrete (Vc, predicted) obtained by GEP and MOGA-EPR Eqs. (19–21). The mean ± standard deviation (STDV), for the ratios, is indicated in the graph. It is obvious that the majority of the shear force of concrete ratios are situated inside the range of the mean and standard deviation. The predicted shear force of concrete equations did not overestimate the force for any of the RCA replacement levels, as the experimental shear force of concrete was less than the predicted shear force of concrete (Vc, experimental/Vc, predicted ≤ 1). Also, from Fig. 4, it can be seen that 50.8% of the data samples predicted by the GEP model (1) and 55.5% of the data samples predicted by the GEP model (2) and MOGA-EPR models had a shear force ratio that was lower than 1, indicating that the equations used to predict the shear force of concrete did not overestimate the force for any of the RCA replacements.

Table 8 Statistical measures of the testing and training datasets
Table 9 Statistical accuracy comparison of the developed models using all of the datasets
Fig. 2
figure 2

Relationship between experimental and predicted shear force of concrete (Vc) using the developed models for the training dataset: a GEP model (1), b GEP model (2) and c MOGA-EPR

Fig. 3
figure 3

Relationship between experimental and predicted shear force of concrete (Vc) using the developed models for the testing dataset: a GEP model (1), b GEP model (2) and c MOGA-EPR

Fig. 4
figure 4

Relationship between experimental and predicted Shear force of concrete ratio and the % of RCA replacement using the developed models for all of the datasets: a GEP model (1), b GEP model (2) and c MOGA-EPR

5 Comparison with current codes and developed equations

This paper used GEP and MOGA-EPR methods to predict the shear force of concrete (Vc) beams without stirrups with RCA. 18 analytical equations were also evaluated for comparison. Table 9; Fig. 5 display the R2, MAE, and RMSE for each of the models and references from the highest R2 to the lowest.

The results from Table 10; Fig. 5 indicate that the R2 values range from 0.6 to 0.9, the MAE from 11.16 to 74.28 kN, and the RMSE from 15.44 to 86.71 kN. The developed prediction models by GEP and MOGA-EPR demonstrate the most accurate and precise performance in comparison to the experimental values, as indicated by the highest R2 values of 0.9. Furthermore, GEP (2) and MOGA-EPR have the lowest MAE and RMSE in comparison with the other available analytical equations. It should be noted that the GEP and MOGA-EPR models, along with the Pradhan et al. [51] and the ACI 318−19 modified equation [31], are the only approaches that take into account the RCA ratio in their equation variables.

In addition to the values of R2 mentioned in Table 10; Figs. 5 and 6 goes further by comparing the 18 analytical equations in the literature to these models, showing the cumulative frequency of the error level in percentage. It is clear from this figure that the MOGA-EPR and GEP models are on par with each other and better at predicting Vc than the other references.

Table 11; Fig. 7 show the average of the experimental to predicted shear force of concrete ratio for all of the models and equations from the highest to the lowest values. If the ratio is more than 1.0, then the model or equation is deemed conservative and overestimated [31]. The GEP and MOGA-EPR models were found to be accurate and precise, as the predicted values were not overestimated and were close to the experimentally determined values. The same conclusion can be drawn from Table 12; Fig. 8. This table and figure present the percentage of the samples of experimental and predicted shear force of concrete ratio greater than 1.0 for all of the models and equations. This is useful to determine the conservative nature of the equations from the literature and the models developed by the GEP and MOGA-EPR. If the percentage of the # of predicted (or/and) calculated values by any of the models and equations (from 1 to 18) is greater than 50%, then the model or equation is considered to be conservative and overestimating the value of the predicted shear force of concrete.

Table 10 Statistical accuracy comparison
Fig. 5
figure 5

Statistical accuracy comparison for the different models and equations for predicting the shear force of concrete (Vc): a R2 , b MAE (kN) and c MSE (kN)

Table 11 The average of experimental and predicted Shear force of concrete ratio for all of the models and equations
Fig. 6
figure 6

Comparison of the error level for different cumulative frequencies

Fig. 7
figure 7

The average of experimental and predicted Shear force of concrete ratio for all of the models and equations

Table 12 The percentage of the # of samples of experimental and predicted Shear force of concrete ratio > 1.0 for all of the models and equations
Fig. 8
figure 8

The percentage of the # of samples of experimental and predicted Shear force of concrete ratio > 1.0 for all of the models and equations

6 Conclusions

The findings of this study indicated that MOGA-EPR and GEP are effective for predicting the shear strength of concrete beams with recycled aggregate, without the presence of stirrups. Three new models were created, giving designers an uncomplicated and powerful tool to use. The new methods were more precise and showed enhanced precision in comparison to the equations already existing in codes and literature.

The results of this study demonstrate the following conclusions with taking into consideration the constraints of the study:

  1. (1)

    The three proposed models have higher accuracy than existing equations in the literature, with R2 values of up to 0.90.

  2. (2)

    The predicted results by using GEP and MOGA-EPR showed very good accuracy within the ± 20% error range.

  3. (3)

    GEP models 1 and 2 showed good accuracy with MAE of 11.43 kN and 12.74 kN, RMSE of 15.44 kN and 17.61 kN, Mean of 1.02 and 1.04, and R2 of 0.88 and 0.90 respectively.

  4. (4)

    The MOGA-EPR model demonstrated high accuracy with MAE of 11.16 kN, RMSE of 16.52, Mean of 1.02, and R2 of 0.90.

  5. (5)

    The models proposed by this study provide precise solutions and avoid overestimating shear strength (force) in concrete, contrary to existing equations from the literature.

  6. (6)

    The three models consider all input variables, including recycled coarse aggregate replacement ratio, which other existing equations do not take into account.

This investigation brings to light the proposed system’s potential future influence. This system joins a recognized soft computing technique with an artificial intelligence protocol to generate three practicable models. These models have the capabilities to be adopted on a broad scale and impact a number of industries, as they present a consistent and successful way of handling intricate matters that involve all the variables that other existing approaches do not take into account in their equations. The execution of these models could bring about significant progress in multiple areas and ultimately result in considerable advantages for practitioners and scholars. Future research should concentrate on validating and calibrating the proposed models using a variety of experimental data, extending their application to other structural elements, incorporating additional parameters, integrating them into design codes, and investigating the long-term durability of concrete with recycled aggregates and other aspects of concrete design. These efforts will improve the models’ dependability, broaden their field of application, and encourage practical adoption and integration of AI, ultimately leading to more sustainable and efficient construction processes. Furthermore, enhancing accuracy demands the testing and validation of more mix design samples through increased experimental data.