Machine learning intelligence to assess the shear capacity of corroded reinforced concrete beams

Kumar, Aman; Arora, Harish Chandra; Kapoor, Nishant Raj; Kumar, Krishna; Hadzima-Nyarko, Marijana; Radu, Dorin

doi:10.1038/s41598-023-30037-9

Machine learning intelligence to assess the shear capacity of corroded reinforced concrete beams

Article
Open access
Published: 17 February 2023

Volume 13, article number 2857, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Machine learning intelligence to assess the shear capacity of corroded reinforced concrete beams

Download PDF

Aman Kumar^1,2,
Harish Chandra Arora^1,2,
Nishant Raj Kapoor^1,3,
Krishna Kumar⁴,
Marijana Hadzima-Nyarko^5,6 &
…
Dorin Radu⁶

2446 Accesses
6 Citations
Explore all metrics

Abstract

The ability of machine learning (ML) techniques to forecast the shear strength of corroded reinforced concrete beams (CRCBs) is examined in the present study. These ML techniques include artificial neural networks (ANN), adaptive-neuro fuzzy inference systems (ANFIS), decision tree (DT) and extreme gradient boosting (XGBoost). A thorough databank with 140 data points about the shear capacity of CRCBs with various degrees of corrosion was compiled after a review of the literature. The inputs parameters of the implemented models are the width of the beam, the effective depth of the beam, concrete compressive strength (CS), yield strength of reinforcement, percentage of longitudinal reinforcement, percentage of transversal reinforcement (stirrups), yield strength of stirrups, stirrups spacing, shear span-to-depth ratio (a/d), corrosion degree of main reinforcement, and corrosion degree of stirrups. The coefficient of determination of the ANN, ANFIS, DT, and XGBoost models are 0.9811, 0.9866, 0.9799, and 0.9998, respectively. The MAPE of the XGBoost model is 99.39%, 99.16%, and 99.28% lower than ANN, ANFIS, and DT models. According to the results of the sensitivity examination, the shear strength of the CRCBs is most affected by the depth of the beam, stirrups spacing, and the a/d. The graphical displays of the Taylor graph, violin plot, and multi-histogram plot additionally support the XGBoost model's dependability and precision. In addition, this model demonstrated good experimental data fit when compared to other analytical and ML models. Accurate prediction of shear strength using the XGBoost approach confirmed that this approach is capable of handling a wide range of data and can be used as a model to predict shear strength with higher accuracy. The effectiveness of the developed XGBoost model is higher than the existing models in terms of precision, economic considerations, and safety, as indicated by the comparative study.

Proposition of new computer artificial intelligence models for shear strength prediction of reinforced concrete beams

Article 11 May 2021

Data-driven prediction of the shear capacity of ETS-FRP-strengthened beams in the hybrid 2PKT–ML approach

Article Open access 14 November 2023

Machine learning-based shear strength prediction of exterior RC beam-column joints

Article 08 January 2024

Introduction

One of the most important construction activities being carried out around the world today is the improvement of existing infrastructures. In terms of sustainable development, it is becoming necessary to upgrade old structures rather than demolish them. In many countries, design codes are constantly being updated due to increased load requirements that demand greater strength in structural components. Aging structures deteriorate over time as a result of environmental factors. Existing concrete structures can deteriorate due to reinforcement corrosion, carbonation, freeze–thaw cycles, etc. One of the most common causes of reinforced concrete (RC) elements degradation is corrosion, which reduces the cross-sectional area (A_cs) of reinforcement bars, degrades their mechanical characteristics causes the concrete surface to crack or flake, and damages the steel to concrete bond¹. Due to the gradual loss of the steel area, the propagation of the damage in the form of cracks and the final spalling of the concrete cover, and the deterioration of the connection between the steel reinforcement and the concrete, the bearing capacity of the corroding element decreases with the time of corrosion. Corrosion reduces the ability of the structures to support the load, which has an impact on both structural safety and in-service performance.

A well-known form of failure of RC elements is a shear failure which is very fast, without warning, failure, thus the RC elements must have sufficient shear capacity². However, in service life, concrete structures undergo various types of environmental changes. Considering the corrosion, in the shear area, the small diameter of the stirrup and the brittle failure of the RC beam makes it more vulnerable. But still, these structure members take the load but when these damages increase more then it becomes very difficult to retain the loads. As shear failures are very dangerous, it is necessary to get early shear strength prediction using novel techniques.

A substantial amount of study has been conducted on many aspects of reinforcement corrosion relating to the corrosion process, its beginning, and undesirable impacts such as strength decrease and prediction of the residual strength of corroded components. The shear capacity of corroded RC beams (CRCBs) has recently been calculated using a variety of analytical models. Several formulas proposed by researchers, namely, Xu and Niu³, Yu⁴, Huo⁵, Zhao and Jin⁶, Li et al.⁷, Higgins et al.⁸, Webster⁹, Xue et al.¹⁰, and Khan et al.¹¹ have been listed in the previous studies and used calculate the shear strength of the CRCBs. The shear performance of the CRCBs has been studied well through experimentation, however, extensive testing may be costly and time-consuming. The suggested models only take into account a small number of variables, leaving out certain significant ones that define the level of corrosion and other important parameters. As the analytical models were built on various model assumptions and various experimental databases, the accuracy and applicability of these models vary greatly under various situations. Additionally, the ability of analytical models to precisely forecast the shear capacity of degraded beams is constrained by the quick rise in corrosion levels in RC beams. Therefore, it is crucial to create methodologies that take into account all controlling factors. It is undoubtedly difficult to create an advanced shear prediction model that takes into account all the important factors since the shear capacity of CRCBs depends on a number of variables that are difficult to quantitatively characterize without making several assumptions.

Modern machine learning (ML) algorithms may offer a better solution to address these concerns because they are effective at handling complex problems involving many different factors without making any assumptions. These algorithms might be used to construct a shear capacity prediction model. Numerous services and sectors have seen significant productivity gains due to ML. Although it is still in its infancy in the construction sector, its application has grown recently to address several difficulties, including concrete technology^{12,13,14,15,16} and concrete durability^17,18,19. The other various application in the field of civil and environmental engineering of ML algorithms can be found in the published research work^{20,21,22,23,24}. The following is a brief description of some selected research related to the use of Extreme Gradient Boosting (XGBoost) and other ML (Classification and regression tree (CART), adaptive boosting (AdaBoost), gradient boosted decision trees (GBDT), support vector regression (SVR), random forest (RF), extremely random trees (ERT), artificial neural network (ANN), gene expression programming (GEP), Kernel ridge regression (KNN), K-nearest neighbor (KRR), Gaussian process regression (GPR) multivariate adaptive regression splines (MARS), support vector machine (SVM), linear regression (LR), decision tree (DT) ensemble tree (ET), and evolutionary polynomial regression (EPR)) algorithms in concrete technology and RC beams:

Wakjira et al.²⁵ investigated the flexural capacity of FRP-reinforced RC beams. Six input parameters were used to develop the ML models. Five ML algorithms (CART, AdaBoost, GBDT, XGBoost, and Super-learner) were used to develop the most appropriate predictive model. The super-learner algorithm provides the highest predictive performance among all the models studied, with the lowest RMSE and MAPE and as the highest R². In another study, the shear capacity of the shear-critical RC beams reinforced with novel composites was investigated by Wakjira et al. using ML techniques²⁶. The prediction models were built using SVR, CART, RF, extremely random trees, GBDT, and XGBoost algorithms. According to the results of the study findings, the XGBoost model performs well compared to other ML techniques and existing guidelines. Uddin et al.²⁷ used ANN, RF, GEP, and GBDT to predict the shear strength of RC beams. The performance of GBDT algorithm was good compared to ANN, RF and GEP algorithms. In another study, Wakjira et al.²⁸ investigated the flexural capacity of FRCM-reinforced RC beams using KNN, KRR, SVR, CART, RF, GBDT and XGBoost methods. The XGBoost model shows good performance and having the highest R²-value of 99.3%, the lowest MAE, and the MAPE. The proposed model has better predictive power and robustness as shown by its performance with that of existing analytical models. Based on the above mentioned research work, the performance of the XGBoost algorithm was higher compared to other ML algorithms.

Badra et al.²⁹ predicted the punching shear strength of FRP-reinforced concrete slabs without shear reinforcement using ANN and SVM algorithms. The RMSE value of the ACI, CSA, JSCE, ANN, and SVM models are 3.06 kN, 1.70 kN, 1.99 kN, 1.10 kN, and 1.32 kN, respectively. When comparing the performance metrics the performance of ANN was superior. Deifalla and Salem³⁰ investigated the torsional strength of externally bonded FRP-reinforced concrete beams using ET, GPR and ANN models. The broad neural network model was the most effective model for predicting the torsion strength of RC beams strengthened with EB-FRP. The R², RMSE, and MAE values of the models were 0.93, 16,634 kN and 0.98 kN, respectively, and they reported the best performance; however, they required the most training time. Mohammed and Ismail³¹ used MARS, XGBoost and SVM models to predict the shear strength of RC beams. According to the research results, the developed MARS and XGBoost models for simulating the shear strength of RC beams have potential. The results showed that all the beam geometry and concrete properties criteria used were important for building the prediction model. Numerically, the MARS model achieved the lowest possible RMSE (89.96 kN). Salem and Deifalla³² evaluated the strength of slab-column connections with FRPs using ML algorithms (LR, DT, SVM, ET, and GPR). The ideal hyper-parameters of the ML-based algorithms were selected during the training process using a grid search with a 15-fold cross-validation. Among all the applied ML models, the ensemble boosted model was found to be the most trustworthy and accurate model, with the best accuracy: R², RMSE, and MAE were 0.97, 71.963 kN, and 43.452 kN, for the test dataset, respectively. Ebid and Deifalla³³ used ML procedures to predict the punching shear capacity of lightweight concrete slabs. The column dimensions, concrete density, slab effective depth, CS, yield strength of steel, and flexural reinforcement ratio were considered as input parameters. The highest prediction accuracy is shared by ANN and EPR (73.9% and 73.6%, respectively), while the GP model has the lowest prediction accuracy (67.6%). Kaveh et al.³⁴ used an XGBoost framework to calculate the shear capacity of FRP-strengthened concrete beams. The correlation coefficient of the developed XGBoost model was 0.94, which was higher than all the empirical models.

The aim of this article is to estimate the shear strength of CRCBs using ANN, ANFIS, DT, and XGBoost algorithms. To the best knowledge of the authors, DT, ANFIS, and XGBoost algorithms have not been previously used to estimate the shear strength of CRCBs. The influence of individual input parameters on the predicted shear capacity of CRCBs is also determined.

Analytical models to estimate the shear capacity of CRCBs

Xu and Niu’s model

The limit equilibrium theory serves as the foundation for the formula used by Xu and Niu³ to determine the shear strength of CRCBs. The impact of reinforcing steel corrosion on shear capacity is taken into consideration by adding a shear span-to-depth ratio (a/d) as well as the reduction in A_cs of the stirrup and yield strength due to corrosion. The formulation is expressed in Eqs. (1–4):

$${V}_{u}= {V}_{c}+{V}_{s}=\xi \left(\lambda ,{\eta }_{w,sn}\right)\times \left[\frac{\left(0.08+4{\rho }_{l}\right)}{\left(\lambda -0.3\right)}\right]\times {f}_{ck}b{h}_{o}+\alpha \left({\eta }_{w,sn}\right)\times \left(0.25+0.4\lambda \right){A}_{vc}{f}_{yv}\frac{{h}_{o}}{s}$$

(1)

$$\xi \left(\lambda ,{\eta }_{w,sn}\right)=\left\{\begin{array}{l}1, {\eta }_{w,sn}\le {\eta }_{cr,sn} \\ {\left(\frac{{\eta }_{w,sn}}{{\eta }_{cr,sn}}\right)}^{0.069\lambda -0.43}, {\eta }_{w,sn}>{\eta }_{cr,sn}\end{array}\right.$$

(2)

$${\eta }_{cr,sn}=10.4\frac{{c}_{v}}{{\phi }_{v}^{2}}+\frac{{f}_{cu,150}}{{\phi }_{v}}$$

(3)

$$\alpha \left({\eta }_{w,sn}\right)=1-1.077{\eta }_{w,sn}$$

(4)

where, ${V}_{u}$, ${V}_{c}$, and ${V}_{s}$ represents the shear resistance of the RC beam, concrete resistance, and stirrups shear resistance, respectively. $\xi \left(\lambda ,{\eta }_{w,sn}\right)$ and $\alpha \left({\eta }_{w,sn}\right)$ considered as reduction factors. $\lambda $, ${f}_{ck}$, ${f}_{yv}$ ${A}_{vc}$, ${f}_{cu,150}$, and ${\phi }_{v}$ are a/d, CS of concrete, stirrups yield strength, residual area of the stirrups, CS of concrete (150 mm cube), and diameter of stirrups, respectively. ${\rho }_{l}$, ${\eta }_{w,sn}$, and ${\eta }_{cr,sn}$ are the percentage of longitudinal steel, percentage of stirrups corrosion, and crack initiation of stirrups corrosion ratio, respectively. b, ${h}_{o}$, $s$ and ${c}_{v}$ are the beam width, effective depth, stirrups spacing, and concrete cover of stirrups, respectively.

Yu’s model

Yu has suggested a small change to the GB50010-2002³⁵ standard guideline. A new coefficient was proposed by Yu⁴ to identify the impact of corrosion on longitudinal steel. The model also takes into account how corrosion affects the A_cs and stirrups yield strength⁴. The formulation to forecast the shear capacity of CRCBs is expressed in Eqs. (5–7):

$${V}_{u}=\frac{1.75\varphi {f}_{t}b{h}_{o}}{\left(\lambda +1\right)}+\frac{1.25{f}_{yc}{A}_{vc}{h}_{o}}{s}$$

(5)

$$\varphi = -0.0354{\eta }_{l,sn}^{2}+0.6256{n}_{l,sn}-1.2349$$

(6)

$${f}_{yc}=\frac{\left(0.985-1.028{\eta }_{w,sn}\right){f}_{yv}}{\left(1-{\eta }_{w,sn}\right)}$$

(7)

where, $\varphi $, ${f}_{yc}$, ${f}_{t}$ and ${n}_{l,sn}$ are the reduction factor, yield strength of stirrups (corroded), the tensile strength of concrete, and section loss ratio of main steel.

Huo’s model

Huo⁵ introduced two reduction factors on the basis of the shear strength analytical model of an un-corroded RC beam proposed by the China Academy of Building Research in 1985³⁶ to take into account stirrup corrosion and longitudinal steel of CRCBs. In experimental tests of CRCBs, regression analysis is used to determine both of these reduction variables⁵. The formulation to estimate the shear strength of CRCBs is given in Eqs. (8–10):

$${V}_{u}=\varphi {f}_{ck}b{h}_{o}\left[\frac{0.08}{\left(\lambda -0.3\right)}+\frac{100{\rho }_{l}}{\left(\lambda . {f}_{ck}\right)}\right]+\frac{\alpha \left(0.4+0.3\lambda \right){A}_{v}{f}_{yv}{h}_{o}}{s}$$

(8)

$$\varphi =\left\{\begin{array}{l}1.0, {\eta }_{l,wt}\le 5\% \\ 1.098-1.96{\eta }_{l,wt}, { \eta }_{l,wt}>5\%\end{array}\right.$$

(9)

$$\alpha =1-1.059{\eta }_{w.wt}$$

(10)

where, both $\varphi $ and $\alpha $ are the reduction factors (${\eta }_{l,wt}$ and ${\eta }_{w.wt}$), and ${A}_{v}$ is the un-corroded area of the stirrup.

Zhao and Jin’s model

A methodology for estimating the shear capacity of un-corroded RC beams under two-point loading was provided by Zararis³⁷. Zhao and Jin⁶ suggested a variation of Zararis's model, which is applied to calculate the shear strength of CRCBs. Zhao and Jin⁶ take into account a reduction factor that includes all of the effects of corrosion on stirrups. The formulation to forecast the shear capacity of CRCBs is expressed in Eqs. (11–13):

$${V}_{u}= \alpha {V}_{u0}= \alpha b{h}_{o}\left[\frac{{C}_{s}{f}_{cyl,150}\left(1-\frac{0.5{C}_{s}}{{h}_{o}}\right)}{{h}_{o}}+\frac{0.5{\rho }_{v}{f}_{yv}{\left(1-\frac{{C}_{s}}{{h}_{o}}\right)}^{2}{\left(\frac{a}{{h}_{o}}\right)}^{2}}{\left(\frac{a}{{h}_{o}}\right)}\right]$$

(11)

$$\alpha =\left\{\begin{array}{l}1.0, {\eta }_{w,sn}\le 10\%\\ 1.17-1.7{\eta }_{w,sn}, {\eta }_{w,sn}>10\%\end{array}\right.$$

(12)

$$\frac{{C}_{s}}{{h}_{o}}=\frac{1+0.27\left[{ \left(\frac{a}{{h}_{o}}\right)}^{2}+\frac{{\rho }_{v}}{{\rho }_{l}}\right]}{2\left[1+{\left(\frac{a}{{h}_{o}}\right)}^{2}+\frac{{\rho }_{v}}{{\rho }_{l}}\right]}\left[\sqrt{{\left(\frac{600{\rho }_{l}}{{f}_{cyl,150}}\right)}^{2}+4\left(\frac{600{\rho }_{l}}{{f}_{cyl,150}}\right)}-\frac{600{\rho }_{l}}{{f}_{cyl,150}}\right]$$

(13)

where, ${V}_{u0}$, $\alpha $, ${C}_{s}$, ${\rho }_{v}$, and ${f}_{cyl,150}$ are the ultimate shear resistance of uncorroded beams, shear span, compression zone depth, percentage steel of stirrups, and CS of concrete (150 × 300 mm specimens), respectively.

Li et al.’s model

Li et al.⁷ also proposed an equation based on the Chinese Guideline (GB50010-2002)³⁵ for the shear capacity estimation of CRCBs. The equation takes into account the change in height and width of the corrosion-damaged stirrups cross-section. Additionally, stirrups' corrosion and yielding strength are taken into consideration. The formulation to evaluate the shear capacity of CRCBs is expressed in Eqs. (14–17):

$${V}_{u}=\frac{1.75{f}_{t}{b}_{c}{h}_{oc}}{\left(\lambda +1\right)}+\frac{{f}_{yc}{A}_{vc}{h}_{oc}}{s}$$

(14)

$${f}_{yc}=\frac{\left(1-1.1219{\eta }_{w,wt}\right){f}_{yv}}{\left(1-{\eta }_{w,wt}\right)}$$

(15)

$${b}_{c}=b-{C}_{v1}-{C}_{v2}$$

(16)

$${h}_{0c}={h}_{0}-{C}_{sc}$$

(17)

where, ${C}_{v1}$ and ${C}_{v2}$ are concrete cover on both cross-sectional width directions.

Lu et al.’s model

Lu et al.,³⁸ incorporate the impacts of stirrups, the level of corrosion of longitudinal reinforcement, and the a/d in the shear strength estimation of the CRCBs. Overestimating the residual shear strength of CRCBs with stirrups and longitudinal reinforcement corrosion would be risky. In addition, diagonal tension failure and shear compression failure were also considered, and the maximum value is taken for final calculations. The shear strength of CRCBs under purposive stress is shown in Eqs. (18–23):

$${V}_{u}=\phi {V}_{c}+{V}_{s}$$

(18)

where, $\phi $ is a reduction coefficient linked with a/d.

Shear resistance of concrete (${V}_{c}$) can be calculated using Eq. (19)³⁹:

$${V}_{c}=Max\left({V}_{c1},{V}_{c2}\right)$$

(19)

where, the terms ${V}_{c1}$ is shear resistance of concrete at diagonal tension and expressed in Eq. (20)⁴⁰:

$${V}_{c1}=0.2\sqrt[3]{100{f}_{cyl,150}{\rho }_{lc}} \sqrt[4]{\frac{{10}^{3}}{{h}_{0}}}\left(0.75+\frac{1.4{h}_{0}}{a}\right)b{h}_{0}$$

(20)

where, ${\rho }_{lc}$ is the percentage of corroded longitudinal steel, and ${V}_{c2}$ represent the compression failure and can be determined by Eq. (21)⁴¹:

$${V}_{c2}=\frac{0.24\sqrt[3]{{f}_{cyl,150}^{2}}\left(1+\sqrt{100{\rho }_{lc}}\right)\left(1+\frac{3.33r}{{h}_{0}}\right)b{h}_{0}}{\left[1+{\left(\frac{a}{{h}_{0}}\right)}^{2}\right]}$$

(21)

where, $r$ is the width of the loading plate (87.2 mm).

Shear resistance of the stirrups (${V}_{s}$) is expressed in Eq. (22)¹⁰.

$${V}_{s}=\frac{{f}_{yv}{A}_{vc}j{h}_{0}}{s}$$

(22)

where, $j$ is a coefficient and is generally taken as 1/1.15¹⁰.

$$\phi =\left\{\begin{array}{c}0.008{e}^{\left(-0.122\lambda \right)}-0.003{\eta }_{w,sn}+1.01, \lambda <2.5\\ 0.1{e}^{\left(-0.122\lambda \right)}-0.003{\eta }_{w,sn}+1.38, \lambda \ge 2.5\end{array}\right.$$

(23)

Machine learning models establishment

Database setting up

A literature survey was conducted to collect experimental data on the shear capacity of CRCBs. 140 datasets have been collected from the reviewed literature^{6,10,42,43,44,45,46,47}. The parameters that affect the shear strength of CRCBs are: (i) width of the beam (b), (ii) effective depth of the beam (d), (iii) CS of concrete (f_ck), (iv) yield strength of reinforcement (f_y), (v) percentage of longitudinal reinforcement (${\rho }_{l})$, (vi) percentage of stirrups reinforcement (${\rho }_{v})$, (vii) yield strength of stirrups (f_yv), (viii) stirrups spacing (s), (ix) a/d, (x) corrosion degree of longitudinal reinforcement ((${\eta }_{l}$)), and (xi) corrosion degree of stirrups (${\eta }_{w}$). The same parameters have been used to develop ML models. The complete methodology to accomplish the objective of this work is depicted in Fig. 1 and explained in subsequent sections. Table 1 lists the statistical properties of the amassed database, and Fig. 2 displays the distribution of the collected parameters.

Table 1 Statistical parameters.

Full size table

To determine the relationship between input parameters and shear capacity, and to show the dot distribution a marginal plot is used, as shown in Fig. 3. A scatterplot with histograms, boxplots, or dot-plots in the x and y-axes' margins is referred to as a marginal plot. Figure 3a–k shows the marginal plot of all the input parameters like b, d, f_ck, f_y, ${\rho }_{l}$, ${\rho }_{v}$, f_yv, s, $\lambda $, ${\eta }_{l}$ and ${\eta }_{w}$, respectively.

Data preparation

In all ML algorithms, it is necessary to standardize the dataset in a certain form. The process of standardization increases the efficiency and accuracy of ML algorithms. The commonly used standardization ranges are: (i) 0–1, (ii) − 1 to + 1, and (iii) 0–9. In this study, the − 1 to + 1 standardized range has been adopted to normalize the collected parameters. The formulation used for normalization is expressed in Eq. (24)⁴⁸:

$${Z}_{normalized}=\left[2\times \frac{\left(z-{z}_{min}\right)}{\left({z}_{max}- {z}_{min}\right)}\right]-1$$

(24)

where, ${Z}_{normalized}$ is the normalized outcome, z is the value to be standardized in the selected dataset, ${z}_{min}$ and ${z}_{max}$ are the minimum and maximum values in the selected dataset, sequentially.

After normalization, the dataset has been processed for further processing in the different phases such as the training, and testing phases. In ANN, ANFIS, DT, and XGBoost models, the dataset has been categorized only in the training and testing phases with a percentage of 70% and 30%, respectively (Fig. 1).

Model evaluation

To estimate the performance of analytical and ML models, the used performance metrics are: correlation coefficient (R), root mean square error (RMSE), Nash–Sutcliffe efficiency index (NSEI), mean absolute percentage error (MAPE), and mean absolute error (MAE). In addition, the performance index (P_i) and over-fitting analysis (OFA) have also been done to check the fitting of the ML algorithms. The formulation of all the performance metrics is given in Table 2^49,50.

Table 2 Description of performance metrics.

Full size table

Artificial neural network

The study of biological neural links served as inspiration for ANN research. ANN algorithm is a "black box" that houses a massively parallel system with numerous processing components that are very good at information mining. The procedure inside the box enables a careful selection of the variables and a detailed investigation of their relationships^51,52. In different ANN algorithms, a back-propagation network (BPN) is frequently used to solve engineering problems with the gradient descent technique to reduce errors. A typical BPN contains three layers: (a) input layer (IL), (b) hidden layer (HL), and (c) output layer (OL) as presented in Fig. 4a. The HL neurons are connected to each input neuron, which represents an individual input parameter. Depending on the kind of operation (linear or non-linear), these neurons sum the weighted values or apply the activation function after receiving information from the appropriate IL to produce the desired output. The extra node known as bias is present in both the HL and OLs.

Three layers of neurons are linked together by connections known as weights. To estimate the output of an ANN algorithm for a specific pattern, the biases and weights must be adequate. Each neuron that receives a numerical input from the preceding layer has its relevance determined by weighting variables. The shear strength of the CRCBs can be evaluated using Eq. (25):

$${V}_{u}= {f}_{\left(H-O\right)}\left(\sum_{i=1}^{N}{W}_{i\left(H-O\right)}{N}_{i}+{B}_{\left(H-O\right)}\right)$$

(25)

where, ${f}_{\left(H-O\right)}$ is the OL activation function as expressed in Eq. (26), ${W}_{i\left(H-O\right)}$ are the OL weights, ${N}_{i}$ are the input variables and can be obtained from Eq. (27), and ${B}_{\left(H-O\right)}$ is the output bias.

$${f}_{\left(H-O\right)}=purelin=f\left(x\right)=x$$

(26)

$${N}_{i}={f}_{\left(I-H\right)}\left(\sum_{i=1}^{N}{W}_{i\left(I-H\right)}{X}_{i}+{B}_{\left(I-H\right)}\right)$$

(27)

where, ${f}_{\left(I-H\right)}$ is activation function in the HL as expressed in Eq. (28), ${W}_{i\left(I-H\right)}$ is the HL weights, ${X}_{i}$ is the normalized input values, and ${B}_{\left(I-H\right)}$ is the HL biases.

$${f}_{\left(I-H\right)}= TanSig= \frac{2}{1-{e}^{-2z}}-1$$

(28)

ANN has been trained from three neurons to eleven neurons. A trial and error process has been adopted to select the optimum neuron. On the basis of the R-value and MSE value the best-selected neuron is selected as shown in Fig. 5. The best neuron is chosen based on performance indicators. In the range of three to eleven neurons, neuron ten has the highest R-value and the lowest MSE value. The overall evaluation of neuron ten is acceptable. The correlation coefficient and MSE values of the training phase, testing phase, and whole phase is shown in Fig. 5.

The parrot-colored rectangular box depicts the location of the best neuron. The formulation to predict the shear strength of CRCBs is expressed in Eqs. (29) and (30).

$$\left[\begin{array}{c}{d}_{1}\\ {d}_{2}\\ {d}_{3}\\ {d}_{4}\\ {d}_{5}\\ {d}_{6}\\ {d}_{7}\\ {d}_{8}\\ {d}_{9}\\ {d}_{10}\end{array}\right]=tansig\left[\left(\begin{array}{ccccccccccc}-0.8071& 0.0620& -0.9229& 0.2073& 0.5096& -1.3520& 1.9673& 0.0969& -0.0353& 1.2739& 0.4922\\ 0.4208& -0.1356& -0.3379& -0.1556& 0.2532& -0.4656& 0.8207& -0.7836& 2.0054& -0.1553& -0.2239\\ -1.9789& -0.8337& 0.5156& -0.3473& 0.4251& -0.5391& -0.6441& 0.7573& -1.2279& 0.4585& 0.5222\\ 0.0980& -0.0709& 0.2388& 0.7224& -0.7378& -0.3546& -0.3905& 1.6784& -2.9510& 1.0597& -1.4663\\ -2.2629& -0.3706& 0.9284& 0.5114& 0.7825& -0.1382& -1.4820& 1.7856& -0.1469& -0.6652& 0.5620\\ 1.7002& -0.0002& -0.2672& 1.5503& 1.0243& 0.1634& 0.3901& 0.3129& 1.9112& -0.4058& 0.8284\\ 1.5318& -0.9390& 0.0097& -0.3094& -0.9207& 0.9510& -0.7813& 0.2750& -1.0490& -0.9694& 2.0092\\ 0.3404& -0.2200& 0.7990& -1.7213& 0.6330& 0.8612& -0.2832& -0.8524& -2.2531& -1.7455& 1.1284\\ 0.2358& 0.8619& 0.2756& 0.4233& 1.1082& -1.8456& 0.5682& -0.0741& 2.7171& 1.8584& -0.6170\\ -0.9259& -0.4344& -0.9707& 0.4044& 0.0502& -0.2438& 0.6536& 0.0834& -0.9066& -1.6718& -0.1058\end{array}\right)\times \left(\begin{array}{c}b\\ d\\ {f}_{ck}\\ {f}_{y}\\ {\rho }_{l}\\ {\rho }_{v}\\ {f}_{yv}\\ s\\ \lambda \\ {\eta }_{l}\\ {\eta }_{w}\end{array}\right)\right]+\left[\begin{array}{c}-1.9573\\ -1.2184\\ 0.9513\\ -1.8569\\ 1.7187\\ 0.4107\\ -1.7598\\ -1.3854\\ 1.0573\\ -1.9978\end{array}\right]$$

(29)

$$-0.1885{d}_{1}-0.4275{d}_{2}+0.4295{d}_{3}+0.5789{d}_{4}-0.7536{d}_{5}+0.9335{d}_{6}-0.4282{d}_{7}+0.5503{d}_{8}+0.4771{d}_{9}+0.4468{d}_{10}-0.3566$$

(30)

The values of ${d}_{1}$ to ${d}_{10}$ can be calculated using Eq. (29).

Adaptive neuro-fuzzy inference system

ANFIS is the name of the hybrid neuro-fuzzy network that simulates complex systems. A fuzzy inference system (FIS), which is employed with an ANN, and Takagi–Sugeno rule type make up the majority of the ANFIS model. An adaptive and feed-forward network derives fuzzy rules from inputs using an ANFIS technique. A hybrid learning approach employs the fuzzy membership function (MF) parameters and looks for connections between the inputs and outputs based on the knowledge of expert systems. The basic architecture of the ANFIS model is presented in Fig. 4c. The ANFIS structure consists of five layers, namely, the “fuzzy layer”, “product layer”, “normalized layer”, “de-fuzzy layer”, and “total OL” (Fig. 4c). The formulation of each layer and complete description is available in the literature^53,54,55.

Layer 1: All nodes in this layer are adaptive nodes. MFs like the Gaussian MF and generalized bell MF are employed as node functions.

Layer 2: Each node output in this layer displays the firing rate of a rule.

Layer 3: The normalized firing strength of each rule is represented by each node.

Layer 4: Each node in this layer is adaptive and has a node function that describes how the rules contributed to the final output.

Layer 5: The sum of all the rules outputs is computed by a single node.

The subtractive clustering approach and the grid partitioning method are used to choose the initial fuzzy model based on the fuzzy rules specified. The subtractive clustering approach is adopted in the development of the ANFIS model. Locating the cluster centres of the input–output data pairs is made easier by the cluster estimation approach. This in turn aids in the identification of the rules that are dispersed across the input–output space, as each cluster centre denotes the existence of a rule. Additionally, it aids in figuring out what the underlying premise parameters should be set to. This is crucial because, during the neural network training session, a starting value that is very near to the ultimate value will eventually force the model to quickly converge to that value⁵⁶. The potentials of all the input and output data points are determined using the Euclidian distances between them and the other data points in this clustering approach.

In the subtractive clustering approach, the squash factor, reject ratio, and accept ratio are taken as constant with values of 1.25, 0.5, and 0.15, sequentially. The cluster centre (r) is changed from 0.9 to 0.2 value. The best cluster centre is chosen based on performance indicators. In the range of 0.9–0.2 cluster centre, cluster centre 0.45 has the highest R-value and the lowest RMSE, MAPE, and MSE values. The overall evaluation of cluster centre 0.45 is acceptable. In Fig. 6, the performance of all the cluster centre is shown with a number of rules (n). The number of rules and MFs are in Figs. 7 and 8, respectively.

The shear predictions using the ANFIS model is expressed in Eq. (31).

$${V}_{u,pred.}= \frac{\sum_{i=1}^{n}{W}_{i}{Y}_{i}}{{\sum }_{i=1}^{n}{W}_{i}}$$

(31)

The values ${W}_{i}$ and ${Y}_{i}$ are expressed in Eqs. (32) and (33), respectively.

$${W}_{i}= \left[exp\left(-\frac{1}{2}{\left(\frac{b- {c}_{1}}{{\sigma }_{1}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{d- {c}_{2}}{{\sigma }_{2}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{{f}_{ck}- {c}_{3}}{{\sigma }_{3}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{{f}_{y}- {c}_{4}}{{\sigma }_{4}}\right)}^{2}\right)\right]\times \left[exp\left(-\frac{1}{2}{\left(\frac{{\rho }_{l}- {c}_{5}}{{\sigma }_{5}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{{\rho }_{v}- {c}_{6}}{{\sigma }_{6}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{{f}_{yv}- {c}_{7}}{{\sigma }_{7}}\right)}^{2}\right)\right] \times \left[exp\left(-\frac{1}{2}{\left(\frac{s- {c}_{8}}{{\sigma }_{8}}\right)}^{2}\right)\right]\times \left[exp\left(-\frac{1}{2}{\left(\frac{\lambda - {c}_{9}}{{\sigma }_{9}}\right)}^{2}\right)\right]\times \left[exp\left(-\frac{1}{2}{\left(\frac{{\eta }_{l}- {c}_{10}}{{\sigma }_{10}}\right)}^{2}\right)\right]\times \left[exp\left(-\frac{1}{2}{\left(\frac{{\eta }_{w}- {c}_{11}}{{\sigma }_{11}}\right)}^{2}\right)\right]$$

(32)

where, $b$, $d$, ${f}_{ck}$, ${f}_{y}$, ${\rho }_{l}$, ${\rho }_{v}$, ${f}_{yv}$, $s$, $\lambda $, ${\eta }_{l}$, and ${\eta }_{w}$ are the input variables (normalized values), and $\sigma $ and c are the Gaussian MF parameters.

$${Y}_{i}=\boldsymbol{ }{k}_{1} b+ {k}_{2} d+ {k}_{3} {f}_{ck}+{k}_{4} {f}_{y}+ {k}_{5} {\rho }_{l}+ {k}_{6} {\rho }_{v}+ {k}_{7} {f}_{yv}+{k}_{8} s+ {k}_{9} \lambda + {k}_{10} {\eta }_{l}+ {k}_{11} {\eta }_{w}+{k}_{12}$$

(33)

The values of k₁, to k₁₂ are given in Table 3, and c and $\sigma $ values of input parameters are in Table 4.

Table 3 Coefficients of membership cluster plot (normalized values).

Full size table

Table 4 Parameters of Gaussian MF (normalized values).

Full size table

Decision tree (DT)

A decision tree is a structure that resembles a flowchart and is used to illustrate a decision-making process. It is a kind of technique for supervised learning that may be applied to both classification and regression applications. In order to create subsets (or "leaves") that are as homogeneous as feasible with regard to the target variable, the dataset is recursively split into subsets based on the values of the input features.

The "root" node of the tree, which represents the complete dataset, is the first node in the tree. Then, based on a selected feature and a threshold value, the root node is divided into two or more child nodes. Recursive splitting occurs on each child node until a halting requirement is satisfied. For instance, the tree can be terminated when a node reaches a predetermined threshold for data points or when all of the data points in a node belong to the same class.

Each leaf node of the tree represents a class label (in the case of a classification problem) or a predicted value, and each internal node of the tree represents a test on an input characteristic (in the case of a regression problem)⁵⁷. A set of choices that result in a particular conclusion are represented by the route from the root to a leaf node. A decision tree can be used for prediction by going from the root to a leaf node and selecting the class label or predicted value linked to that leaf node. Decision trees have a number of benefits, one of which is their readability and comprehension due to the clear and logical representation of the decision-making process. They may, however, be prone to overfitting if they are not appropriately trimmed or regularized.

The simplicity of understanding and visualization, ease of data pre-processing, and insensitivity to outliers are all advantages of DT over other ML models⁵⁸. In this study, both tenfold cross-validations along with grid search are used to optimize the decision tree. The values of tuning hyper-parameters are auto, 10, 2, 5, and 100 for parameters of max features, max depth, min samples leaf, min samples split, and max-leaf nodes, respectively.

eXtreme gradient boosting (XGBoost)

XGBoost is a highly effective and scalable ML algorithm for tree boosting and has been widely employed in various domains to produce cutting-edge outcomes on specific data difficulties. The gradient boosting framework is optimized in XGBoost, which is created to be extremely effective, adaptable, and portable⁵⁹. The basic task of the XGBoost method is to optimize the value of the objective function, which consists of the regularisation term and the loss function. Although the regularisation term serves to smooth the final learned weights to limit overfitting, the loss function, which calculates the difference between the estimated and actual label for a given training sample, minimizes the error of the entire model. A few XGBoost tuning settings have a significant impact on the model's performance and training efficiency. The learning rate, maximum depth of a tree, minimal sum of instance weight, subsample, and the number of boosting iterations are the some hyper tuning parameters. The basic architecture of the XGBoost model is depicted in Fig. 4b. The pseudocode of the XGBoost algorithm is mentioned below:

Compute the gradient ${g}_{i}$ and hessian ${h}_{i}$ of the loss function $l\left({y}_{i},\right){\widehat{{y}_{i}}}^{\left(t-1\right)}$ with respect to the current model's predictions ${\widehat{y}}_{i}^{\left(t-1\right)}$. Solve for the optimal value of the new tree ${f}_{t}\left({x}_{i}\right)$ by minimizing an approximation of the negative gradient, this step is known as the line search. This is done by computing the ${\widetilde{L}}^{(t)}$ function which is a combination of the gradient and hessian of the loss function and T. Update the current model by adding the new tree ${f}_{t}\left({x}_{i}\right)$ to the ensemble, with a step size of ϵ, ${\widehat{{y}_{i}}}^{\left(t\right)}$= ${\widehat{y}}_{i}^{\left(t-1\right)}+\upepsilon {f}_{t}\left({x}_{i}\right)$. The iteration process starts and stops when the required criteria are met, such as a maximum number of trees or a minimum improvement in the error. In the end, a final ensemble of decision trees is used to make predictions on new data.

Grid search, randomized search, and tenfold cross-validation are used to optimize the XGBoost hyperparameters. Through a random search, the initial hyperparameters are obtained. Then, using a grid-search approach, the resulting hyperparameters are optimized. Ten folds are randomly selected from the training dataset. Nine folds are employed in this technique for model training, and one fold is used for performance evaluation. The cross-validation procedure was then carried out ten times, using the validation data from each of the ten subsamples exactly once each time⁶⁰. The values of the grid and random search hyperparameters used in the XGBoost model are 100, 10, 0.9, 0.4, and 123 for parameters of n estimators, alpha, subsample, learning rate, and random state, respectively.

Results and discussion

The results and discussion section is categorized into four subsections. In the first subsection, the results of the analytical models are explained with scatter plots, 2D kernel plots, and absolute error plots. The findings of the ANN, ANFIS, DT, and XGBoost models are also discussed graphically in the second part. Additionally, the line plot of the experimental, predicted, and error data is also utilized to make the produced models more visible. The comparison between the analytical and ML-based models is explained in the discussion section with the violin and Taylor plot. The influence of single parameters on the shear strength of the CRCBs is explained in the last subsection.

Outcomes of analytical models

Six analytical models and one existing ML (Gradient Boosted Regression Trees (GBRT)) model have been used to assess the performance of the ML models. When comparing analytical models, the correlation coefficient of Lu et al.'s model is the highest and the values of the models by Xu and Niu, Huo, Zhao and Jin, Yu, and Li et al. decreasing sequentially. However, Huo's model has the lowest MAPE value and this value is 14.61% lower than that of Lu et al.'s model. The other performance metrics of Lu et al.'s confirmed the accuracy of this model compared to other analytical models. Table 5 shows the values for each performance metric.

Table 5 Results of analytical models and existing ML model.

Full size table

The scatter plot (Fig. 9 (left-side), 2D kernel density (Fig. 9 (middle-plot) and absolute error plot (Fig. 9 (right-side) of all the analytical models are shown in Fig. 9. In Huo, Liu et al., Yu, Xu and Niu, Li et al., and Zhao and Jin model’s 55%, 52.14%, 49.29%, 46.42%, 36.42%, and 22.89% data lie in the range of − 30 to + 30 kN, respectively. The range of the error of the above mentioned analytical model is − 100 to 201.28 kN, − 99.50 to 246.13 kN, − 71.41 to 207.89 kN, − 268.43 to 195.69 kN, − 62.42 to 273.36 kN and − 129.27 to 144.10 kN. According to the absolute error plot (Fig. 9 (right-side)), in the model of Liu et al., approximately 80% of the dataset is inside the 50 kN error limit. Therefore, it can be inferred that the Liu et al. model performs well in comparison to other analytical models.

Outcomes of XGBoost, DT, ANFIS, and ANN models

In all the models, the dataset is divided into two categories: (i) Training (70%), and (ii) Testing (30%) dataset. The R-values of the ANN training and testing dataset are 0.9908, and 0.9962, sequentially. The MAPE, RMSE, MAE, NSEI, and a20-index of the whole dataset (ANN) are 7.4703%, 12.2962 kN, 7.0135 kN, 0.9809, and 0.9357, respectively. The overfitting value of the ANN model is 0.0567 as shown in Table 6. In the ANFIS model, the R-value of the training and testing dataset is 0.9987 and 0.9894, respectively. The MPAE value of the ANFIS model is lower than the ANN model which is 5.4623%. Similarly, the MAPE values of the ANFIS model is 14.58% less than the DT model. The overall MAE and RMSE values of the ANFIS models is also less than the ANN and DT models. The ANFIS model has a higher NSEI and an a20-index than the ANN and DT models. The overfitting values of the ANFIS and ANN models are very close to each other. The MAPE value of the XGBoost model for the whole dataset is minimal which is 0.0459% and R-value is approximately equal to one. The a20-index and NS index also approaches to the value of one. The overfitting value of the XGBoost model is 0.0021, as shown in Table 6.

Table 6 Results of ANN, ANFIS, DT, and XGBoost model.

Full size table

The scatter plot, 2D kernel plot, absolute error plot and line plot of the ANN model is presented in Fig. 10a–d, sequentially. According to the scatter plot (Fig. 10a), only 25.72% of values directly lie over the fitting line whereas 80% of the values are inside the 10 kN absolute error limit. As per Fig. 10b,c, the error range is between − 36.39 and 61.60 kN. The line plot of the measured and predicted value with the distribution of the errors is presented in Fig. 10d.

Figure 11 shows the scatter plot, 2D kernel plot, absolute error plot, and line plot of the ANFIS model. According to the scatter plot (Fig. 11a), 45.72% dataset lie over the fitting line, and 87.14% of values are inside the 10 kN absolute error limit. The range of the error between − 40.24 and 80.19 kN is shown in Fig. 11b,c. The line plot of the phases of the train and test dataset of the predicted and experimental values is displayed in Fig. 11d.

The scatter plot, 2D kernel plot, absolute error plot, and line plot of the XGBoost model is shown in Fig. 12. According to Fig. 12a, 13.57% of the dataset lie over the fitting line, and 82.14% of values inside the 10 kN absolute error limit. According to the 2D kernel plot (Fig. 12b) the range of the errors is between − 29.13 and 120.18 kN. Figure 12c,d show the absolute error and line plot of the developed DT model.

Similarly, the scatter plot, 2D kernel plot, absolute error plot, and line plot of the XGBoost model is shown in Fig. 13. According to Fig. 13a, 97.86% dataset lie over the fitting line, and 99.29% of values inside the 2.88 kN absolute error limit. According to the 2D kernel plot (Fig. 13b) the range of the errors is between − 0.49 and 4.80 kN. Figure 13c,d show the absolute error and line plot of the developed XGBoost model.

The developed XGBoost model has greater performance and reliability when compared to ANN ANFIS, and DT models, according to performance metrics and graphical representations.

Discussion

The results of the developed ML models have been compared with analytical models and existing ML-based models (Fu and Feng⁶¹). The R-value of the XGBoost model is 15.07%, 1.11%, 0.95%, 0.66%, and 1.01% higher than Lu et al., Fu and Feng, ANN, ANFIS, and DT models, sequentially. Similarly, the NSEI and a20-index of the XGBoost model is 40.06%, 1.94%, 1.47%, and 3.24% and 185.71%, 6.87%, 4.48%, and 5.26% higher than Lu et al., ANN, ANFIS and DT models, respectively. On the other hand, the MAPE, RMSE, and MAE values of the XGBoost model is the lowest as shown in the last row of Table 7. The overfitting value of the XGBoost model is 97.88% 97.82%, and 98.46% lower than ANN, ANFIS, and DT models, respectively.

Table 7 Comparison of analytical and existing ML model with developed ML models.

Full size table

The violin plot and multi-histogram of all the analytical and developed ML models is shown in Fig. 14. From Fig. 14, it is clearly depicted that the accuracy of the XGBoost model is higher as compared to other models (analytical, ANN, ANFIS and DT). The Taylor diagrams of the analytical and ML models are shown in Fig. 15a,b, respectively. Taylor diagram is the graphical representation of the predicted values in relation to the original data. Taylor diagram plotted between the R, RMSE, and standard deviation. In Fig. 15a, two analytical models (Zhao and Lu et al.) crossed the reference line of the standard deviation, and Xu and Niu, Huo, and Liu et al. models lie below the 60 kN RMSE value. On the other hand, in Fig. 15b, the XGBoost model directly lies over the reference line of the original dataset. This ensures the reliability and precision of the XGBoost model among all the analytical and ML models.

Feature importance

Lundberg and Lee developed a new technique to interpret black-box models called SHAP⁶². The SHAP is defined as “SHapley Additive exPlanations”. SHAP method uses game theory to characterize how well a machine-learning model performs.

It is crucial to carry out a variety of studies that are AI-based, adaptable, and capable of doing well on a variety of data. Through assessing its reliance on physical processes, sensitivity analysis (SA) and parametric studies help to confirm the robustness, effectiveness, and reliability of the generated MEP models. The influence of the individual parameter is shown in Fig. 16. According to the best-fitted model, the width of the beam, stirrups spacing, and the a/d is the most influencing factor with values of 60.86%, 21.67%, and 12.33% affect the shear capacity. The degree of corrosion of stirrups is only a 1.59% impact on the shear strength of the CRCBs.

Conclusions

Estimating the shear capacity of the CRCBs is a very challenging issue in the civil engineering sector. To neutralize this issue, four ML-based algorithms (ANN, ANFIS, DT, and XGBoost) have been developed. The considered parameters that can influence the shear strength of the CRCBs are the width of the beam, the effective depth of the beam, CS of concrete, yield strength of steel, percentage of longitudinal steel, percentage of stirrups steel, yield strength of stirrups, stirrups spacing, a/d, corrosion degree of longitudinal steel, and corrosion degree of stirrups. Following is a summary of the conclusions drawn from the results of the analysis:

Among analytical models, the prediction accuracy of Lu et al. is highest based on the performance metrics. The R-value and MAE of the Liu et al. model are 0.8689 and 36.54 kN, respectively.
A single hidden layer with ten neurons has been used in the ANN and the model shows the good accuracy of the developed model. The R-value of the training, and testing, data is 0.9908, and 0.9962, sequentially. The MAPE value of the whole dataset is 7.47%.
With a cluster radius 0.45 and eighteen rules, the performance of the ANFIS model is good. The R-value of the training and testing dataset is 0.9987 and 0.9894, respectively. The MAPE, MAE, and RMSE of the whole dataset are 5.46%, 4.97 kN, and 10.76 kN, respectively.
The correlation coefficient of the DT model for the whole dataset is 0.9899. In addition, the error performance metrics of the DT model are higher than the ANFIS model.
The correlation coefficient of the training and testing dataset of the XGBoost model is 0.9999 and 0.9999, respectively. The MAPE, MAE, and RMSE values of the whole dataset are 0.05%, 0.10 kN, and 0.49 kN, respectively.
The excellent effectiveness of the XGBoost model in calculating the shear capacity of CRCBs was also shown, along with Taylor's graphical representation and violin plot.
The developed model is very flexible and robust for engineers, requiring relatively few trial experiments. As a result, it saves more time and money throughout the CRCB strengthening process.

In addition to the experimental data gathered for this study, further research should utilize larger datasets. A GUI that enables interactive button-based task execution is also necessary to aid users in the practical and design interpretation of the shear capacity estimation²⁸. Therefore, to develop multi-dimensional validation and improve the methodology employed in this work, the aforementioned elements should be taken into account and dealt with in later investigations.

Data availability

All data generated or analysed during this study are included in this published article (and its supplementary information file).

References

Fang, C., Lundgren, K., Plos, M. & Gylltoft, K. Bond behaviour of corroded reinforcing steel bars in concrete. Cem. Concr. Res. 36, 1931–1938. https://doi.org/10.1016/j.cemconres.2006.05.008 (2006).
Article CAS Google Scholar
Xu, T. & Li, J. Experimental investigations of failure modes of reinforced concrete beams without web reinforcement. Eng. Struct. 185, 47–57. https://doi.org/10.1016/j.engstruct.2019.01.102 (2019).
Article Google Scholar
Xu, S. & Niu, D. The shear behavior of corroded simply supported reinforced concrete beam. J. Build. Struct 25, 98–104 (2004).
Google Scholar
Yu, F. The test research and analysis on the shear strength of diagonal section in corroded reinforced concrete beam, 455 Master’s thesis. Hohai University, China 456 (2005).
Huo, Y. Research on shear capacity of simply supported concrete beam with corroded reinforcement (Nanchang University Nanchang, 2007).
Google Scholar
Zhao, Y.-X. & Jin, W.-L. Analysis on shearing capacity of concrete beams with corroded stirrups. J. Zhejiang Univ. Eng. Sci. 42, 19 (2008).
Google Scholar
Shi-bin, L. & Xin, Z. Analysis for shear capacity of reinforced concrete beams with corrosion stirrups. J. Eng. Mech. 28, 60–063 (2011).
Google Scholar
Higgins, C. et al. Shear capacity assessment of corrosion-damaged reinforced concrete beams (Oregon. Dept. of Transportation. Research Unit, 2003).
Webster, M. P. The assessment of corrosion-damaged concrete structures, University of Birmingham, (2000).
Xue, X., Seki, H. & Chen, Z. in Proceedings of the Thirteenth East Asia-Pacific Conference on Structural Engineering and Construction (EASEC-13). C-6–2 (The Thirteenth East Asia-Pacific Conference on Structural Engineering).
Khan, I., François, R. & Castel, A. Experimental and analytical study of corroded shear-critical reinforced concrete beams. Mater. Struct. 47, 1467–1481 (2014).
Article CAS Google Scholar
Khan, N. M. et al. Application of machine learning and multivariate statistics to predict uniaxial compressive strength and static Young’s modulus using physical properties under different thermal conditions. Sustainability 14, 9901 (2022).
Article CAS Google Scholar
Nazar, S. et al. Development of the new prediction models for the compressive strength of nanomodified concrete using novel machine learning techniques. Buildings 12, 2160. https://doi.org/10.3390/buildings12122160 (2022).
Article Google Scholar
Kovačević, M., Lozančić, S., Nyarko, E. K. & Hadzima-Nyarko, M. Application of artificial intelligence methods for predicting the compressive strength of self-compacting concrete with class F fly ash. Materials 15, 4191 (2022).
Article ADS PubMed PubMed Central Google Scholar
Czarnecki, S., Hadzima-Nyarko, M., Chajec, A. & Sadowski, Ł. Design of a machine learning model for the precise manufacturing of green cementitious composites modified with waste granite powder. Sci. Rep. 12, 13242. https://doi.org/10.1038/s41598-022-17670-6 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Asteris, P. G., Skentou, A. D., Bardhan, A., Samui, P. & Pilakoutas, K. Predicting concrete compressive strength using hybrid ensembling of surrogate machine learning models. Cement Concr. Res. 145, 106449. https://doi.org/10.1016/j.cemconres.2021.106449 (2021).
Article CAS Google Scholar
Rathakrishnan, V., Beddu, S. & Ahmed, A. N. Predicting compressive strength of high-performance concrete with high volume ground granulated blast-furnace slag replacement using boosting machine learning algorithms. Sci. Rep. 12, 9539. https://doi.org/10.1038/s41598-022-12890-2 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Cai, R. et al. Prediction of surface chloride concentration of marine concrete using ensemble machine learning. Cement Concr. Res. 136, 106164. https://doi.org/10.1016/j.cemconres.2020.106164 (2020).
Article CAS Google Scholar
Taffese, W. Z. & Espinosa-Leal, L. Prediction of chloride resistance level of concrete using machine learning for durability and service life assessment of building structures. J. Build. Eng. 60, 105146. https://doi.org/10.1016/j.jobe.2022.105146 (2022).
Article Google Scholar
Nguyen, T.-A. & Ly, H.-B. Estimation of the bond strength between FRP and concrete using ANFIS and hybridized ANFIS machine learning models. J. Sci. Transp. Technol. 1(4), 36–47 (2021).
Article MathSciNet Google Scholar
Kainthura, P. & Sharma, N. Hybrid machine learning approach for landslide prediction, Uttarakhand India. Sci. Rep. 12, 20101. https://doi.org/10.1038/s41598-022-22814-9 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Ahmad, J. et al. Effects of waste glass and waste marble on mechanical and durability performance of concrete. Sci. Rep. 11, 21525. https://doi.org/10.1038/s41598-021-00994-0 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Martínez-Álvarez, F., Troncoso, A. & Riquelme, J. C. Data science and big data in energy forecasting. Energies 11(11), 3224 (2022).
Article Google Scholar
Amini Pishro, A. et al. Application of artificial neural networks and multiple linear regression on local bond stress equation of UHPC and reinforcing steel bars. Sci. Rep. 11, 15061. https://doi.org/10.1038/s41598-021-94480-2 (2021).
Article CAS PubMed PubMed Central Google Scholar
Wakjira, T. G., Abushanab, A., Ebead, U. & Alnahhal, W. FAI: Fast, accurate, and intelligent approach and prediction tool for flexural capacity of FRP-RC beams based on super-learner machine learning model. Mater. Today Commun. 33, 104461. https://doi.org/10.1016/j.mtcomm.2022.104461 (2022).
Article CAS Google Scholar
Wakjira, T. G., Ebead, U. & Alam, M. S. Machine learning-based shear capacity prediction and reliability analysis of shear-critical RC beams strengthened with inorganic composites. Case Stud. Constr. Mater. 16, e01008. https://doi.org/10.1016/j.cscm.2022.e01008 (2022).
Article Google Scholar
Uddin, M. N. et al. Developing machine learning model to estimate the shear capacity for RC beams with stirrups using standard building codes. Innov. Infrastruct. Solut. 7, 227. https://doi.org/10.1007/s41062-022-00826-8 (2022).
Article Google Scholar
Wakjira, T. G., Ibrahim, M., Ebead, U. & Alam, M. S. Explainable machine learning model and reliability analysis for flexural capacity prediction of RC beams strengthened in flexure with FRCM. Eng. Struct. 255, 113903. https://doi.org/10.1016/j.engstruct.2022.113903 (2022).
Article Google Scholar
Badra, N., Aboul Haggag, S. Y., Deifalla, A. & Salem, N. M. Development of machine learning models for reliable prediction of the punching shear strength of FRP-reinforced concrete slabs without shear reinforcements. Measurement 201, 111723. https://doi.org/10.1016/j.measurement.2022.111723 (2022).
Article Google Scholar
Deifalla, A. & Salem, N. M. A machine learning model for torsion strength of externally bonded FRP-reinforced concrete beams. Polymers 14, 1824 (2022).
Article CAS PubMed PubMed Central Google Scholar
Mohammed, H. R. M. & Ismail, S. Proposition of new computer artificial intelligence models for shear strength prediction of reinforced concrete beams. Eng. Comput. 38, 3739–3757. https://doi.org/10.1007/s00366-021-01400-z (2022).
Article Google Scholar
Salem, N. M. & Deifalla, A. Evaluation of the strength of slab-column connections with FRPs using machine learning algorithms. Polymers 14, 1517 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ebid, A. & Deifalla, A. Using artificial intelligence techniques to predict punching shear capacity of lightweight concrete slabs. Materials 15, 2732 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kaveh, A., Mohammad Javadi, S. & Mahdipour Moghani, R. Shear strength prediction of FRP-reinforced concrete beams using an extreme gradient boosting framework. Period. Polytech. Civ. Eng. 66, 18–29. https://doi.org/10.3311/PPci.18901 (2022).
GB50010-2002. Code for design of concrete structures. China Construction Industry (2002).
China Academy of building Research, Design and Construction of Reinforced Concrete Structure: Compilation of Background Data for Design Code-1985, Beijing Sanhuan Printing Plant, 1985 (in Chinese).
Zararis, P. D. Shear compression failure in reinforced concrete deep beams. J. Struct. Eng. 129, 544–553. https://doi.org/10.1061/(ASCE)0733-9445(2003)129:4(544) (2003).
Article Google Scholar
Lu, Z.-H., Li, H., Li, W., Zhao, Y.-G. & Dong, W. An empirical model for the shear strength of corroded reinforced concrete beam. Constr. Build. Mater. 188, 1234–1248. https://doi.org/10.1016/j.conbuildmat.2018.08.123 (2018).
Article Google Scholar
Tanabe, T., Higai, T., Umehara, H. & Niwa, J. Concrete structure 2nd edn. (Asakura Publishing Co., 2000).
Google Scholar
Futaha, Jun., Yamada, K., Yokozawa, K. & Okamura, H. Re-evaluation of shear strength formula of RC beams without shear reinforcement. J. Japan Soc. Civ. Eng. 1, 372. https://doi.org/10.2208/jscej.1986.372_167 (1986).
Niwa, J. Shear equation of deep beams based on analysis. In Proceedings of JCI 2nd Colloquium on Shear Analysis of RC Structures, Tokyo (1983).
Rodriguez, J., Ortega, L. M. & Casal, J. Load carrying capacity of concrete structures with corroded reinforcement. Constr. Build. Mater. 11, 239–248. https://doi.org/10.1016/S0950-0618(97)00043-3 (1997).
Article Google Scholar
Higgins, C. & Farrow, W. C. III. Tests of reinforced concrete beams with corrosion-damaged stirrups. ACI Mater. J. 103, 133 (2006).
Google Scholar
Xia, J., Jin, W.-L. & Li, L.-Y. Shear performance of reinforced concrete beams with corroded stirrups in chloride environment. Corros. Sci. 53, 1794–1805. https://doi.org/10.1016/j.corsci.2011.01.058 (2011).
Article CAS Google Scholar
Imam, A. & Azad, A. K. Prediction of residual shear strength of corroded reinforced concrete beams. Int. J. Adv. Struct. Eng. 8, 307–318. https://doi.org/10.1007/s40091-016-0133-x (2016).
Article CAS Google Scholar
Juarez, C. A., Guevara, B., Fajardo, G. & Castro-Borges, P. Ultimate and nominal shear strength in reinforced concrete beams deteriorated by corrosion. Eng. Struct. 33, 3189–3196. https://doi.org/10.1016/j.engstruct.2011.08.014 (2011).
Article Google Scholar
Liu, S. The research on shear capacity of corroded rc beams, PhD Thesis, Master's thesis, Central South University, China, (2013).
Singh, R. et al. Enhancing sustainability of corroded RC structures: Estimating steel-to-concrete bond strength with ANN and SVM algorithms. Materials 15, 8295 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Kumar, A., Arora, H. C., Kapoor, N. R. & Kumar, K. Prognosis of compressive strength of fly-ash-based geopolymer-modified sustainable concrete with ML algorithms. Struct. Concrete. https://doi.org/10.1002/suco.202200344.
Kumar, A., Arora, H. C., Kumar, K. & Garg, H. Performance prognosis of FRCM-to-concrete bond strength using ANFIS-based fuzzy algorithm. Expert Syst. Appl. 216, 119497. https://doi.org/10.1016/j.eswa.2022.119497 (2023).
Article Google Scholar
Liu, Q.-F. et al. Prediction of chloride diffusivity in concrete using artificial neural network: Modelling and performance evaluation. Constr. Build. Mater. 268, 121082. https://doi.org/10.1016/j.conbuildmat.2020.121082 (2021).
Article CAS Google Scholar
Ebid, A. M., Deifalla, A. F. & Mahdi, H. A. Evaluating shear strength of light-weight and normal-weight concretes through artificial intelligence. Sustainability 14, 14010 (2022).
Article Google Scholar
Kurtgoz, Y. & Deniz, E. in Exergetic, Energetic and Environmental Dimensions (eds Ibrahim Dincer, C. Ozgur Colpan, & Onder Kizilkan) 133–148 (Academic Press, 2018).
Amirkhani, S., Nasirivatan, S., Kasaeian, A. B. & Hajinezhad, A. ANN and ANFIS models to predict the performance of solar chimney power plants. Renew. Energy 83, 597–607. https://doi.org/10.1016/j.renene.2015.04.072 (2015).
Article Google Scholar
Kumar, K. & Saini, R. P. Adaptive neuro-fuzzy interface system based performance monitoring technique for hydropower plants. ISH J. Hydraul. Eng. 1, 1–11. https://doi.org/10.1080/09715010.2022.2115320 (2022).
Article Google Scholar
Buragohain, M. & Mahanta, C. A novel approach for ANFIS modelling based on full factorial design. Appl. Soft Comput. 8, 609–625. https://doi.org/10.1016/j.asoc.2007.03.010 (2008).
Article Google Scholar
Zhang, J., Li, J., Hu, Y. & Zhou, J. Y. The identification method of igneous rock lithology based on data mining technology. Adv. Mater. Res. 466–467, 65–69. https://doi.org/10.4028/www.scientific.net/AMR.466-467.65 (2012).
Article Google Scholar
Wakjira, T. G., Al-Hamrani, A., Ebead, U. & Alnahhal, W. Shear capacity prediction of FRP-RC beams using single and ensenble ExPlainable Machine learning models. Compos. Struct. 287, 115381. https://doi.org/10.1016/j.compstruct.2022.115381 (2022).
Article CAS Google Scholar
Chen, T. & Guestrin, C. in Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785–794).
Wakjira, T. G., Alam, M. S. & Ebead, U. Plastic hinge length of rectangular RC columns using ensemble machine learning model. Eng. Struct. 244, 112808. https://doi.org/10.1016/j.engstruct.2021.112808 (2021).
Article Google Scholar
Fu, B. & Feng, D.-C. A machine learning-based time-dependent shear strength model for corroded reinforced concrete beams. J. Build. Eng. 36, 102118. https://doi.org/10.1016/j.jobe.2020.102118 (2021).
Article Google Scholar
Chen, T., & Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785–794). https://doi.org/10.1145/2939672.2939785 (2016).

Download references

Author information

Authors and Affiliations

Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002, India
Aman Kumar, Harish Chandra Arora & Nishant Raj Kapoor
Structural Engineering Department, CSIR-Central Building Research Institute Roorkee, Roorkee, 247667, India
Aman Kumar & Harish Chandra Arora
Department of Architecture and Planning, CSIR-Central Building Research Institute Roorkee, Roorkee, 247667, India
Nishant Raj Kapoor
Department of Hydro and Renewable Energy, Indian Institute of Technology Roorkee, Roorkee, 247667, India
Krishna Kumar
Faculty of Civil Engineering and Architecture Osijek, J. J. Strossmayer University of Osijek, Vladimira Preloga, Croatia
Marijana Hadzima-Nyarko
Faculty of Civil Engineering, Transilvania University of Braşov, Braşov, Romania
Marijana Hadzima-Nyarko & Dorin Radu

Authors

Aman Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Harish Chandra Arora
View author publications
You can also search for this author in PubMed Google Scholar
Nishant Raj Kapoor
View author publications
You can also search for this author in PubMed Google Scholar
Krishna Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Marijana Hadzima-Nyarko
View author publications
You can also search for this author in PubMed Google Scholar
Dorin Radu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.K.: conceptualization, methodology, investigation, software, writing—original draft. H.C.A.: conceptualization, methodology, investigation, resources, validation, formal analysis, writing—review and editing, supervision, project administration. N.R.K.: visualization, software, formal analysis, validation, writing—review and editing. K.K.: methodology, software, validation, formal analysis, writing—review and editing. M.H.-N.: investigation, formal analysis, validation, software, writing—review and editing. D.R.: visualization, validation, writing—review and editing, acquired the funding for this research. All authors contributed extensively to discussion about the work and in reviewing the manuscript.

Corresponding author

Correspondence to Aman Kumar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kumar, A., Arora, H.C., Kapoor, N.R. et al. Machine learning intelligence to assess the shear capacity of corroded reinforced concrete beams. Sci Rep 13, 2857 (2023). https://doi.org/10.1038/s41598-023-30037-9

Download citation

Received: 04 January 2023
Accepted: 14 February 2023
Published: 17 February 2023
DOI: https://doi.org/10.1038/s41598-023-30037-9
Springer Nature Limited

This article is cited by

Ensemble learning based compressive strength prediction of concrete structures through real-time non-destructive testing
- Harish Chandra Arora
- Bharat Bhushan
- Nishant Raj Kapoor
Scientific Reports (2024)
Development of Efficient Prediction Model of FRP-to-Concrete Bond Strength Using Curve Fitting and ANFIS Methods
- Aman Kumar
- Harish Chandra Arora
- Hashem Jahangir
Arabian Journal for Science and Engineering (2024)
Estimation of ultimate shear strength of one-side corroded-plates cracks by FEM and ANNs
- Farzaneh Ahmadi
- Hashem Nowruzi
- Ahmad Rahbar-Ranji
Journal of the Brazilian Society of Mechanical Sciences and Engineering (2023)

Machine learning intelligence to assess the shear capacity of corroded reinforced concrete beams

Abstract

Similar content being viewed by others

Proposition of new computer artificial intelligence models for shear strength prediction of reinforced concrete beams

Data-driven prediction of the shear capacity of ETS-FRP-strengthened beams in the hybrid 2PKT–ML approach

Machine learning-based shear strength prediction of exterior RC beam-column joints

Introduction

Analytical models to estimate the shear capacity of CRCBs

Xu and Niu’s model

Yu’s model

Huo’s model

Zhao and Jin’s model

Li et al.’s model

Lu et al.’s model

Machine learning models establishment

Database setting up

Data preparation

Model evaluation

Artificial neural network

Adaptive neuro-fuzzy inference system

Decision tree (DT)

eXtreme gradient boosting (XGBoost)

Results and discussion

Outcomes of analytical models

Outcomes of XGBoost, DT, ANFIS, and ANN models

Discussion

Feature importance

Conclusions

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information.

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Ensemble learning based compressive strength prediction of concrete structures through real-time non-destructive testing

Development of Efficient Prediction Model of FRP-to-Concrete Bond Strength Using Curve Fitting and ANFIS Methods

Estimation of ultimate shear strength of one-side corroded-plates cracks by FEM and ANNs

Search

Navigation