1 Introduction

Reinforced concrete walls (RCW) are widely employed in structures that are subjected to significant lateral loadings, such as wind and earthquake loads (Hadzima-Nyarko 2015) (Nikoo et al. 2017). RCWs frequently function as cantilever beams with a rigid base, bearing loads down to the foundations (Darwin et al. 2016) (Al-Furjan et al. 2022). These structural elements are susceptible to fluctuating shear, which normally achieves its greatest value at the bottom, bending moments with tensile stress located near the loaded tip and compressive stress towards the distant extremity, and vertical compression owing to gravity loads. RCWs with a low height-to-length proportion are mostly governed by shear load, whereas those with a high height are primarily influenced by bending design specifications.

The design principle of RCW is typically identical to that of beams. These conventional approaches depend heavily on imprecise inductive calculations, which have several drawbacks. The ACI318 design code (Committee 1995), for example, excludes the influence of longitudinal shear reinforcement, whereas the Eurocode (Code 2005) implies that the shear capacity of RCW with a height-to-length proportion of 2 or more is exclusively dependent on transverse shear reinforcement. RCW must be constructed to be malleable in nature. As a result, its mechanical integrity should be determined by flexure rather than shear. As a result, precise modelling of the shear strength of RCW is critical (Asgarpoor et al. 2021) (Keshtegar et al. 2022) (Tabrizikahou et al. 2022).

Nonetheless, given the plethora of structural implementations and load combos that may occur on an RCW, comprehensive knowledge of its structural behaviour is indeed very complicated (Teng and Chandra 2016) (Piri et al. 2023). For example, low-rise RCW may have a height-to-length ratio of less than 2, but taller structures may have a proportion significantly more than 2 Chandra et al. (2018), (Zhu et al. 2022). RCW may also have a variety of geometrical layouts, reinforcing schemes, and gravitational and seismic loads. Given the complexities of RCW, many laboratory investigations have been undertaken to fully understand its structural behaviour (Al-Furjan et al. 2022) (Al-Furjan et al. 2022).

One of the fundamental phases in the design of structural components of reinforced concrete (RC) structures is the assessment of shear capability. Up to now, many academics and structural design codes have suggested multiple mechanical approaches to analyse the shear strength of RCW (Hadzima-Nyarko et al. 2021) (Kolahchi et al. 2022) (Tabrizikahou et al. 2022). A simple solution to the problem involves the structural code or a scientific investigation that considers numerous factors; nevertheless, both techniques provide significantly varied shear strength findings (Armaghani et al. 2019) (Marzok et al. 2020) (Luo et al. 2022). The absence of appropriate and dependable experimental or theoretical relationships for evaluating the shear capacity of RCWs has caused in the last two decades to pique the attention of academics working with non-deterministic approaches.

Various researchers used different methods to model the shear strength of RCWs. Some of the studies are summarized in Table 1.

Table 1 Algorithm used in modelling the compressive strength of rubberized concrete and shear concrete wall

The findings of the preceding investigations are undoubtedly promising, particularly given the applications of soft computing models to predict the shear strength of RCWs is still in their early stages. As a result, this study investigates the use of fuzzy-based techniques (GAFIS, PSOFIS, and ANFIS) for estimating the shear strength of RCWs. For network training, a research database is employed that is accessible in the literature and is concerned with the shear resistance of RCWs specimens of varied sizes, materials, and geometric characteristics. Since RF is a powerful prediction approach that has not been studied in forecasting the shear strength of RCWs, it is beneficial to compare its capabilities with GAFIS, PSOFIS, and ANFIS to see which way is best.

The purpose of this research is to create the RF technique as a reliable and efficient machine-learning model for predicting the shear strength of RCWs. The application of the RF model in the prediction of the shear strength of RCWs might help catch patterns in huge numbers of data obtained from empirical studies. The results of the created RF are thoroughly compared with three other robust artificial intelligence techniques, namely ANFIS, GAFIS, and PSOFIS. To the authors’ knowledge, this is the very first time the RF technique has been used to determine the shear strength of RCWs.

This paper’s novelty stems from its revolutionary way of evaluating the shear strength of reinforced concrete walls using soft computing-based methodologies. Traditional methods for constructing these structures usually rely on empirical formulations, which have limitations and produce low forecast precision. This work, on the other hand, overcomes these difficulties by utilizing the capabilities of soft computing models.

To increase shear capacity estimation reliability, the fuzzy model’s membership function range is improved by utilizing ANFIS, GA, and PSO techniques. Furthermore, the results of the developed models are compared to those of the RF model, revealing a novel use of RF in forecasting the shear strength of reinforced concrete walls.

The research also contributes to resolving the limitations of standard empirical approaches by providing an analysis of the key parameters of shear strength and investigating the use of soft computing models. The findings highlight the significance of components such as wall thickness and concrete compressive strength, supplying significant information to engineers and designers.

Section 2 presents and explains the main parameters which influence the shear strength of RCWs. Section 3 provides a comprehensive analysis of the state-of-the-art in modelling RCWs. Section 4 introduces the hybrid and RF methodologies, including the formulas and modelling layers, as well as the numerous statistical measures used to compare the different models. Section 5 then describes the characteristics of the innovative modelling techniques used in this work. Section 6 delves deeply into the model outputs, taking into account reliability, propensity, and variability. Section 7 discusses and compares the outcomes achieved by various methodologies. Finally, Sect. 8 illustrates the implications that may be taken from this research.

2 Influencing parameters

Physical, experimental, numerical, and statistical models are used to determine the lateral load-bearing capacity of RCWs in many building codes, specifications, and regulations. Most of these design regulations give a variety of layouts and statistical approaches for determining the shear capacity of RCWs. In these design codes, the shear strength of the RCWs is generally ascribed to the concrete (\(V_{\text {c}}\)) and reinforcement (\(V_{\text {s}}\)) shear strength and is determined using Eq. 1.

$$\begin{aligned} V_{\text {u}}=V_{c}+V_{\text {s}} \end{aligned}$$
(1)

However, to define the concrete and reinforcement shear strength, there are various ideas. ACI318-14-11 (American Concrete Institute 2014) proposed Eqs. 2 and 3 to calculate the values of (\(V_{\text {s}}\)) and (\(V_{\text {c}}\)).

$$\begin{aligned} V_{\text {s}}= & {} \frac{A_{v}f_{\text {yt}}{d}}{{s}}\end{aligned}$$
(2)
$$\begin{aligned} V_{\text {c}}= & {} min{\left\{ \begin{array}{ll}\left( 0.274\acute{{\lambda }}\sqrt{{\acute{f}_{\text {c}}}}+\frac{{N_{\text {u}}}}{4l_{w}d}\right) hd &{} \\ \left[ 0.05{\acute{\lambda }}\sqrt{\acute{f}_{\text {c}}}+\frac{\left( 0.104\lambda \sqrt{\acute{f}_{\text {c}}}+0.2 \frac{N_{\text {u}}}{l_{\text {w}}h}\right) }{\frac{{M_{\text {u}}}}{V_{\text {u}}l_{\text {w}}}-\frac{1}{2}}\right] hd &{} \end{array}\right. } \end{aligned}$$
(3)

Additionally, based on ACI318-14-18 (American Concrete Institute 2014) the values of (\(V_{\text {s}}\)) and (\(V_{\text {c}}\)) can be defined by using Eqs. 4 and 5.

$$\begin{aligned} V_{\text {s}}= & {} A_{\text {cw}}\times \rho _{t}f_{\text {yt}}\end{aligned}$$
(4)
$$\begin{aligned} V_{\text {c}}= & {} {A_{\text {cv}}}\times {\alpha _{\text {c}}}\acute{\lambda }\sqrt{\acute{f}_{\text {c}}} \end{aligned}$$
(5)

Eurocode 8 (Code 2005) introduces Eqs. 6 and 7 to calculate the concrete and reinforcement shear strength.

$$\begin{aligned} V_{\text {s}}{} & {} =\frac{{A_{\text {SW}}}}{s} {z} {f_{\text {ywd}}}\cot {\alpha }\end{aligned}$$
(6)
$$\begin{aligned} V_{\text {c}}{} & {} =\left[ {C_{Rd,c}}k\left( 100\rho _{1} {f_{\text {ck}}}\right) ^{\frac{1}{3}}+ {k_{1}} {\sigma _{cp}}\right] b_{w}d\nonumber \\{} & {} \ge \left( 0.035k^\frac{2}{3}f_{\text {ck}}^\frac{1}{2}+k_{1}\sigma _{cp}\right) b_{w}d \end{aligned}$$
(7)

Figure 1 illustrates the schematic of the concrete wall and the influencing parameters on the shear capacity of RCW such as concrete’s compressive strength, dimension proportion, applied load, transverse and longitudinal reinforcement proportions, and the wall’s gross area.

Fig. 1
figure 1

Schematic of the concrete wall and the influencing parameters studied in this paper

According to the formulae provided, the shear strength of RCWs is determined by the concrete’s compressive strength, applied load, azimuth and elevation reinforcement proportions, the wall’s cross-sectional area, the ratio of horizontal and vertical reinforcement and yield strength of the reinforcements. However, investigations by Wood (1990) and, Gulec et al. (2007) found that the lateral load-carrying capacity of RCWs results based on these empirical-based equations can be varied significantly based on different specifications. Therefore, in this study, different feasible methods are used to predict the shear capacity of RCWs.

3 Review of RCW modelling

Chandra et al. (2018) based on 84 different concrete walls developed a model for estimating the shear capacity of RCSW. Their results demonstrated that the model predicted the shear strength with about 0.36 difference \(({V_{\text {exp}}}/V_{\text {n}}=1.36)\) with a coefficient of variation (CoV) of 0.20. The suggested technique can also estimate the shear strength of RC walls with high accuracy over a wide variety of wall height-to-length ratios, concrete compressive strengths, and reinforcing percentages in the boundary components.

Baghi and Siavashi (2019) introduced a particle swarm optimization method for developing a quantitative formula for estimating the shear strength of RCSW that attributed to variations in the flexural stress criterion and the propensity of the essential perpendicular fracture, the impact of vertical and horizontal reinforcement capacity, the boundary column reinforcement throughout the length, and the dowel action. The model made use of a dataset that comprises 209 experimental data from the literature. The suggested model produced an average value of 1.13 with a COV of 33% by assessing the ratio between the experimental findings and the analytical predictions (\(V_{\text {exp}}\)/\(V_{\text {n}}\)). The average values of \(V_{\text {exp}}\)/\(V_{\text {n}}\) for the models related to ACI318-14 and Chandra et al. were 1.08 and 1.17, respectively, with COVs of 52.0 and 47.0%.

Kusunoki et al. (2019) investigated pushover analysis, which is frequently used to assess failure modes and stress distribution at the risk ultimate limit. To validate the accuracy of these functions, they evaluated a large empirical record of 507 RCW trials. The anticipated initial stiffness was discovered to be underestimated, the final flexural and shear strengths were typically lower than the equivalent actual numbers, and reliability was dependent on the existence of boundary columns and the presence of apertures. Likewise, the anticipated yield and maximum shear strength deformations appeared to be lower than the empirical values. These findings underscore the necessity for more precise and powerful prediction techniques.

Nguyen et al. (2021) scrutinized a machine-learning-based method for estimating the shear resistance of squat grooved RCW. They contended that available experimental models in current design codes and documented research result in considerable discrepancies in estimating shear capacity. As a consequence, they presented an ANN-based on 369 tests and 13 input factors gathered from the literature to more appropriately estimate the shear strength of squat flanged RCW than current algorithms. Furthermore, a prediction technique relying on the ANN framework was suggested to estimate the shear capacity of squat flanged walls, and a graphical interface system was developed to aid in functional design.

Mangalathu et al. (2020) researched the inadequate safety profitability of RCSW as well as the inadequacy of experimental models for instant failure mode assessment of current shear walls. They developed an appropriate statistical method using eight machine-learning methods based on a database of 393 empirical findings for shear walls with varied geometric arrangements. The random forest technique was devised and shown to be 86% accurate in determining the failure mechanism of shear walls. The important characteristics determining the failure mode of shear walls were discovered to be the wall aspect ratio, boundary element reinforcement coefficients, and wall length-to-wall thickness proportion.

Gondia et al. (2020) compared RCSW’s squat with boundary functions to empirical data from several situations, accounting for the purported inaccuracy of relevant current shear strength prediction expressions like ASCE/SEI 43-05. By employing a dataset of 254 shear walls, genetic programming (GP) is used to suggest an expression of shear strength prediction. The RCSW’s squat with fronting components, including suggested inexactness in critical current shear resistance forecasting terms such as ASCE/ SEI 43-05, was evaluated using practical data from various circumstances.

Fig. 2
figure 2

Steps in the RF method

Keshtegar et al. (2021) presented a hybrid machine intelligence system based on an ANN and an adaptive harmony search optimization (AHS) method for estimating the final shear strength of RCWs. Various quantitative measures were utilized to evaluate the effectiveness of the ANN model integrated with AHS (ANN-AHS) to three known experimental correlations and two ANN models linked with harmony search (ANN-HS) and global-best harmony search (ANN-GHS). Their findings showed that the suggested ANN-AHS model outperformed the ANN-HS and ANN-GHS models in estimating the shear strength of RCW and was more precise than the recognized experimental equations.

Keshtegar et al. (2021) examined a hybrid model to estimate the shear capacity of RCSW. The proposed modelling technique (RSM-SVR) is comprised of the integration of the support vector regression (SVR) and response surface model (RSM) techniques. It was demonstrated that the suggested RSM-SVR modelling technique produced more precise estimates of RCSW shear strength. The method accurately appreciated the effect of the main design factors, exhibiting a consistent trend and significantly decreased ambiguity. As a result, the suggested model might be used in intelligent creative models and to improve relevant requirements in design codes.

4 Methodology

4.1 Random Forests (RF)

Breiman’s RF model was proposed as a powerful learning algorithm utilizing an assembly of decision trees (Breiman 2001). The RF is applied for both regression and classification issues. In the RF algorithm, each tree is grown on a separate training set that is a bootstrap replicate of the original data (Goudarzi and Shahsavani 2012). The suggested technique is built with a mix of decision trees created from arbitrary bootstrap samples of the input datasets. In comparison with a single regression tree, RF can reduce the risk of overfitting while improving prediction performance. The number of trees (\({\textbf {n}}_{\text {tree}}\)) and the predictor parameters (\({\textbf {m}}_{\text {try}}\)) are two essential factors for executing the RF method that may be used to modify the model. The following are the major processes for creating an RF model, as seen in Fig. 2:

  1. (1)

    Considering (\({\textbf {n}}_{\text {tree}}\)) in the forest.

  2. (2)

    Attributing to each tree a bootstrap sample of size \({\textbf {n}}_{\text {tree}}\) from the input or training dataset.

  3. (3)

    Choosing a predictor (\({\textbf {m}}_{\text {try}}\)) from a training dataset with p randomly selected predictors for each node’s split point.

  4. (4)

    Distinguishing the split point and the best parameter from among predictors and dividing each node into two sub-nodes.

  5. (5)

    Anticipate the new data, i.e. the predictions, by summing the random forest since it is a strong model that can cope with a high-dimensional database with numerous predictors.

To minimize overfitting in random forest prediction, the major tuning parameters (\({\textbf {n}}_{\text {tree}}\); \({\textbf {m}}_{\text {try}}\)) should be chosen carefully. The grid-search approach was used to optimize the RF values for the parameters \({\textbf {n}}_{\text {tree}}\) and \({\textbf {m}}_{\text {try}}\) at the same time, based on the root mean squared error (RMSE) of the out-of-bag (OOB) data for each parameter setting.

Fig. 3
figure 3

A simple structure of ANFIS with two inputs

In this research, the values of \({\textbf {n}}_{\text {tree}}\)=40 and \({\textbf {m}}_{\text {try}}\)=7 was used to construct the final model.

4.2 Fuzzy inference system (FIS)

To compare the results of ANFIS, PSOFIS, and GAFIS algorithms based on fuzzy rules, it is necessary to develop a fuzzy inference system. Fuzzification, fuzzy database, and defuzzification are the main sections of FIS. To build the Sugeno FIS structure, grid partitioning (GP) and subtractive clustering (SC) are used. Since the grid partitioning needs more computational effort compared to the subtractive clustering (Ansari et al. 2020), in this research genfis3 generates a FIS using fuzzy clustering by extracting a set of rules that models the data behaviour used. The number of clusters is equal to 5 with 100 iterations used for FIS development. The fuzzy rules in the Takagi and Sugeno fuzzy inference system (Takagi and Sugeno 1985) were used and are shown as follows:

$$\begin{aligned}{} & {} f_{i}=p_{i}x+q_{i}y+r_{i} \ , \ \ x=A_{i} \ , \ \ y=B_{i}\ , \ \ \nonumber \\{} & {} i=1, 2, 3, ..., n \end{aligned}$$
(8)

4.3 Adaptive neuro-fuzzy inference system (ANFIS)

ANFIS is a multilayer network fuzzy inference system that provides a connectivity structure of Jang’s Sugeno fuzzy system (Jang 1993). ANFIS functions in an equivalent way as artificial neural networks (ANN) and fuzzy inference systems (FIS).

To optimize the membership functions parameter, the approach employs a mix of back-propagation and least square (hybrid) or back-propagation (Chen 2013). Figure 3 depicts the ANFIS’s overall design, which includes two inputs and one output. The ANFIS algorithm has five computation levels. Each layer’s job is briefly discussed below.

Layer 1: Assigns a linguistic value to each input, such as small, medium, big, and so on. Every node is a node function that adapts to its surroundings.

$$\begin{aligned} O_i^1=\mu _{A_{i}}\left( X\right) i=1,2,3, \ \ \ O_i^1=\mu _{B_{i-3}}\left( Y\right) i=4,5,6\nonumber \\ \end{aligned}$$
(9)

The linguistic value with their member functions (\(\mu _{Ai}\) and \(\mu _{Bi-3}\)) is the output of the first layer where X and Y are the node i model inputs, \(A_{i}\) and \(A_{i-3}\) are the linguist value.

Layer 2: Amplifies the input nodes. The outcome of this layer reflects firing strength, which indicates how well the norms match the inputs.

$$\begin{aligned}{} & {} O_k^2=w_{k}=\mu _{A_{i}}\left( X\right) \times \mu _{B_{i-3}}\left( Y\right) , \ \ \ i=1,2,...,6, \nonumber \\{} & {} k=1,2,3 \end{aligned}$$
(10)

where \(O_k^2\) or \(w_{k}\) is the firing strength of node k.

Layer 3: Determines each node’s normalized firing strength. This strength’s ratio of the \(k_{th}\) regulates to the sum of the firing strength principles.

$$\begin{aligned} O_k^3=\overline{w}_{k}=\frac{w_{k}}{\sum w_{k}}=1,2,3 \end{aligned}$$
(11)

where \(O_k^3\) is the normalized firing strength of node k.

Layer 4: Determines the node function of each node as well as the effect of each node on the overall outcome.

$$\begin{aligned} O_k^4=\overline{w}_{k}f_{k}=\overline{w}_{k}\left( p_{k}X+q_{k}Y+r_{k}\right) =1,2,3 \end{aligned}$$
(12)

Layer 5: Denotes the total output as a sum of all input nodes.

$$\begin{aligned} O_k^5=Z=\sum \overline{w}_{k}f_{k}k=1,2,3 \end{aligned}$$
(13)

The Gaussian membership function is utilized for model creation in this study, and the hybrid optimization approach is applied for membership function (MF) parameters. The results of parameters used in the ANFIS model are summarized in Table 2.

Table 2 Summary of ANFIS model

4.4 Integrated genetic algorithms with FIS (GAFIS)

Holland et al. (1992) established the search genetic algorithm (GA), which is a probabilistic and population-based optimization approach. Natural processes including heredity, mutation, crossover, and natural selection influenced the model’s concept (Rani and Moreira 2010) (Al-Obaidi et al. 2017) (Koza 1994). In each iteration, the optimizer strives for the best possible results by analysing and choosing the parent population from the triggered population using the roulette wheel method and the implementation of evolutionary operators, such as mutation and crossover, on the parent to generate the next generation.

The method begins by creating the starting population at random. The fitness function is then used to measure the individual’s fitness. Then, during the selecting step, techniques such as the Roulette Wheel are used (Goldberg 2006). The fitness of each population is determined using Eq. 14 in this technique.

$$\begin{aligned} p_{i}=\frac{f_{i}}{\sum _{i=1}^nf_{i}} \end{aligned}$$
(14)

where \(p_{i}\) is the probability of individual, \(f_{i}\) is the fitness of individual, and n is the population size.

Crossover and mutation are two processors employed to produce offspring (Sharafati et al. 2020).

Table 3 provides an overview of the GA parameters that were employed in this study.

Table 3 Summary of GA parameters

4.5 Integrated particle swarm optimization with FIS (PSOFIS)

Particle swarm optimization (PSO) is a bio-inspired population-based algorithm that generates and uses random variables driven by the intelligent behaviour of wildlife swarms like birds or fish (Kennedy and Eberhart 1995).

First, the algorithm distributes the particles randomly within a given space of solutions. Then, the positions of the particles are assessed as the global best locations following their own experience as well as of the neighbours. Particles’ speed is computed since each particle has its velocity, and it does its own repeated selection for the optimum location. Eqs. 15 and 16 calculate velocity and new locations.

$$\begin{aligned} V_{i,(t+1)}{} & {} =wV_{i,(t)} + c_{1}r_{1}\left( Bp_{i,(t)}-X_{i,(t)}\right) \nonumber \\{} & {} +c_{2}r_{2}\left( Bg_{t}-X_{i,(t)}\right) \end{aligned}$$
(15)
$$\begin{aligned} X_{i,(t+1)}{} & {} =X_{i,(t+1)}+V_{i,(t+1)} \end{aligned}$$
(16)

where w is inertial weight representing the impact of the velocity vector (\(V_{i}\)) on the new vector, t is the velocity of particle i at iteration t, \(c_{1}\) and \(c_{2}\) are acceleration constants, \(Bp_{i}\), (t) and \(Bg_{t}\) are particle i and global best positions, respectively, and \(X_{i}\), t is the present position of particle i.

The optimum location is referred to as the result or forecast of the outcome parameter. Locating a newer location is influenced by two criteria throughout the finding phase of every single unit; the first is the particle’s best experience up to that iteration. The steps are repeated as long as the stopping criteria are fulfilled.

PSO and GA algorithm was used in this study to optimize the parameters of the fuzzy inference system. In contrast to the GA method, the PSO algorithm may function effectively with a tiny population. The best local and global outcomes were identified by RMSE for each iteration. Table 4 lists all of the PSO parameters used in this study.

Table 4 Summary of PSO parameters

Figure 4 depicts the approaches utilizing GA and PSO.

Fig. 4
figure 4

Steps in PSO and GA model

5 Dataset for modelling process

The total number of data used in this dataset comprised of 143 experimentally tested RCWs. The dataset was divided into train and test data sets at random. 65% (95 number) of the data was chosen for model creation, while the remaining 48 numbers (35%) was chosen to test the performance of the generated model. As a result, the model’s liability is assessed on data that is unknown to the model and was not provided during the training stage.

Table 5 shows the statistical characteristics of the input variables and the related maximum force \(({V_{\text {u}}})\) for the datasets. \(\hbox {X}_{\text {{min}}}\), \(\hbox {X}_{\text {{max}}}\), mean, and standard deviation (STD) are the variables’ minimum, maximum, average, and standard deviation, respectively.

Table 5 Statistical properties of datasets for shear strength of RC wall

\({L_{\text {w}}}\), \({h_{\text {w}}}\), and \(b_{\text {w}}\) represent length, height, and thickness of the wall, respectively. \({A_{\text {g}}}\) is the gross area of the wall, \({h_{\text {w}}/l_{\text {w}}}\) is height-to-length ratio, \({\rho _{x}}\) is ratio of reinforcement by longitudinal reinforcement, \({\rho _{y}}\) is transverse reinforcement ratio, \(f_{\text {c}}\) is concrete compressive strength, \({F_{\text {yv}}}\) is yield strength of bars at vertical direction of wall, \({F_{\text {yt}}}\) is strength of horizontal bars, \(N_{\text {u}}\) is applied load during experimental test, and \(V_{\text {u}}\) is shear strength capacity of the RCW.

5.1 Comparative metrics

Some statistical indices, such as root mean square error (RMSE), mean absolute error (MAE), \(\hbox {R}^2\), Willmott index of agreement (WI), performance index (PI), and discrepancy ratio (DR), are employed and examined to assess predictive model adequacy in both training and testing phases:

$$\begin{aligned}{} & {} \!\!\!R^{2}=\frac{\left[ \sum _{i=1}^n\left( o_{i}-\overline{o}\right) \times \left( p_{i}-\overline{p}\right) \right] ^{2}}{\sum _{i=1}^n\left( o_{i}-\overline{o}\right) ^{2}\times \sum _{i=1}^n\left( p_{i}-\overline{p}\right) ^{2}}\end{aligned}$$
(17)
$$\begin{aligned}{} & {} \!\!\!RMSE=\sqrt{\frac{\sum _1^n\left( o_{i}-p_{i}\right) ^{2}}{N}}\end{aligned}$$
(18)
$$\begin{aligned}{} & {} \!\!\!MAE=\frac{1}{n}\sum _{i=1}^n|\left( o_{i}-p_{i}\right) |\end{aligned}$$
(19)
$$\begin{aligned}{} & {} \!\!\!WI{=}1{-}\frac{\sum _{i=1}^n|\left( o_{i}{-}p_{i}\right) |^{2}}{\sum _{i=1}^n\left[ |\left( o_{i}-\overline{o}\right) |{+}|\left( p_{i}{-}\overline{o}\right) |\right] ^{2}} \ , \ \ 0<WI\le 1\nonumber \\ \end{aligned}$$
(20)
$$\begin{aligned}{} & {} \!\!\!PI_{A}{=}\frac{1}{4}\times \left( \frac{R_{min}^2}{R_A^2}{+}\frac{WI_{min}}{WI_{A}}{+}\frac{RMSE_{A}}{RMSE_{max}}{+}\frac{MAE_{A}}{MAE_{max}}\right) \nonumber \\ \end{aligned}$$
(21)
$$\begin{aligned}{} & {} \!\!\!DR_{i}=(p_{i}/o_{i})-1 \end{aligned}$$
(22)

where \(o_{i}\) and \(p_{i}\) show observational and predicted values of the model, respectively, and \(\overline{o}\) and \(\overline{p}\) show mean observational and predicted values of the model, A and n denote the model under review and the total number of data, respectively.

MAE is an index of the difference in uncertainty among two measurements reflecting the identical phenomena (Willmott and Matsuura 2005). Variations of anticipated against reported and one analysing method versus another evaluation method are examples of \(o_{i}\) versus \(p_{i}\).

WI or index of agreement was proposed by Willmott (Willmott 1981) as a predefined indicator of model projection uncertainty that ranges from 0 to 1. The WI is the proportion of mean square error to prospective error. A score of 1 shows a flawless coincidence, whereas a value of 0 implies no compatibility of any kind.

DR distinguishes the disparity between the reported and modelled amount of instances to the difference between the anticipated population and the estimation interval’s limit.

With lower values for MAE, RMSE, and PI, higher values for \(\hbox {R}^2\), and WI, the greatest accuracy can be obtained. Among other indices, DR shows information about error distribution. The predicted and observed values are equal when DR = 0, the predicted values are underestimated when DR<0, and the predicted values are overestimated when DR>0.

6 Results and discussion

For impactful model assessment and a preliminary examination of prediction accuracy, graphical techniques and quantitative indicators are essential. The scattered plot is a common graphical approach for assessing data distribution and variation.

The scatterplots for the training and testing datasets for anticipated and experimentally observed \(V_{\text {u}}\) are shown in Fig. 5. For the projected data of all models, the perfect equity line of P = O, as well as the lines with +20% and −20% errors, are displayed.

Fig. 5
figure 5

Measured and predicted value of \(V_{u}\) dimensions for test data

According to the results shown in Fig. 5, the number of errors with predicted data in the more than +20% and less than −20% range for ANFIS is 71 (48 data in training and 23 data in testing), 44 for PSOFIS (27 data in training and 17 data in testing), 39 for GAFIS (24 data in training and 15 data in testing), and 37 data points (34 data in training and 15 data in testing) by using RF model.

Table 6 displays the performance of all examined models based on the RMSE, \(\hbox {R}^2\), MAE, and WI criteria, providing for a more detailed comparison of the approaches. The index values for the RF, PSOFIS, and GAFIS models are near to each other in the training set. PSOFIS is ranked top based on \(\hbox {R}^2\), RMSE, and MAE values, with RF, GAFIS, and ANFIS ranking in next places, respectively.

Based on \(\hbox {R}^2\) and RMSE in the testing phase, the RF with the greatest \(\hbox {R}^2\) and lowest RMSE (RMSE = 151.89 and \(\hbox {R}^2\) = 0.9351) is superior for testing data. PSOFIS is ranked second with (RMSE = 186.20 and \(\hbox {R}^2\) = 0.891), and GAFIS and ANFIS are ranked third and fourth, respectively, with (RMSE = 188.498 and \(\hbox {R}^2\) = 0.889) and (RMSE = 225.7 and \(\hbox {R}^2\) = 0.84). Based on MAE, RF is preferable with an MAE of 111.52, followed by GAFIS, PSOFIS, and ANFIS with MAEs of 114.39, 116.62, and 157.83, respectively.

A multi-index parameter (PI) is also utilized to measure the performance of the models. The index incorporates statistical indicators such as \(\hbox {R}^2\), RMSE, MAE, and WI. According to equation 1, PI is between 0 and 1, with PI closer to 0 indicating the highest model accuracy. For test data, the performance index for RF is 0.81, putting it in the first place. PSOFIS and GAFIS, both with PI = 0.87, are ranked second, while ANFIS, with PI = 1, is ranked last.

Figure 6 depicts a comparison of frequency cumulative error (FCE) versus absolute relative error for the RF, PSOFIS, GAFIS, and ANFIS models in test and train data. It is possible to calculate the percentage of data that is less or more than the required number. For example, RF and hybrid techniques (PSOFIS and GAFIS) predicted 50% of data with less than 10% error in train data and 75% of data with less than 30% error in test data. In the test phase, PSOFIS predicted 7% for data with more than 60% inaccuracy, whereas GAFIS, RF, and ANFIS predicted 10, 15, and 22%, respectively.

Table 6 Statistical metrics in the train and test phase

The RMSE/d coefficient captures low-error based on the RMSE statistic and high-tendency based on the WI index for the optimal model performance. Thus, the model with the lowest RMSE/WI has the best predictive performance of all the models. Figure 7 shows the values of RMSE/WI ratios for the examined models, which include the RF, PSOFIS, GAFIS, and ANFIS. It can be observed that the suggested RF model has the lowest RMSE/WI value, while hybrid approaches are ranked second, and ANFIS is the third place.

Taylor diagram, which is composed of three statistical variables, including root, mean squared error, index of determination, and standard deviation, depicts the performance of the methods utilized in one basic illustration (Fig. 8).

The RF, PSOFIS, GAFIS, and ANFIS models may be rated from best to worst accuracy predictions based on the test phase findings of the Taylor diagram. PSOFIS, RF, GAFIS, and ANFIS were rated best to worst during the train phase.

The effect of altering \(b_{\text {w}}\) and \(\rho _{x}\) on the proposed models’ conclusions was investigated. The discrepancy ratio (DR), defined as the ratio of predicted and actual values, was used to determine the sensitivity of the proposed model to the \(b_{\text {w}}\) and \(\rho _{x}\) parameters. A DR of one implies complete cooperation, whereas numbers greater (or less than zero) indicate over- (or under-) prediction of the value of \(V_{\text {u}}\). This is significant if the errors are negative; it indicates that the model forecasts the data value less than the true value, which has a negative performance, and large positive mistakes may raise the project’s cost.

Fig. 6
figure 6

The cumulative frequency against the absolute relative error for the models

Fig. 7
figure 7

RMSE/WI ratio for different models in testing phases

Figures 9 and 10 illustrate changes in DR values plotted against \(b_{\text {w}}\) and \(\rho _{x}\) for all models during the train and test phases. Among the models, the RF’s DR errors are within a tolerable range (−0.25 to 1). PSOFIS, GAFIS and ANFIS ranges are (−1.5 to 1.5), (−2.5 to 2.2), and (−2 to 2.5), respectively. As a result, these three models frequently under-predict and over-predict the value of \(V_{\text {u}}\).

For \(b_{\text {w}}\) parameter, PSOFIS, GAFIS, and ANFIS model for \(40< b_{\text {w}} <100\) exhibited over-prediction and under-prediction, while DR values for \(100< b_{\text {w}} <300\) are close to zero that demonstrates a good match between the observed and anticipated \(V_{\text {u}}\).

For \(\rho _{x}\) parameter, ANFIS model for all range of \(\rho _{x}\) and hybrid models for \(0< \rho _{x} < 1\), both have over-prediction and under-prediction. The RF model has consistent behaviour across all \(b_{\text {w}}\) and \(\rho _{x}\) ranges. RF outperforms the other models considered in this study in terms of DR.

Fig. 8
figure 8

Taylor diagram for different models in the train and test phase

Fig. 9
figure 9

DR values of \(b_{\text {w}}\) for different models in test and train data

Fig. 10
figure 10

DR values of \(\rho _{x}\) for different models in test and train data

6.1 Sensitivity analysis

Sensitivity analysis based on variable importance was conducted to decide the most effective parameters in the prediction of shear strength by the RF model. The significance of each variable is determined by examining the increase of prediction error when the OOB data for that variable is randomly permuted while all other variables remain constant (Goudarzi et al. 2014).

After setting the pre-selected optimal values of \({\textbf {n}}_{\text {tree}}\) and \({\textbf {m}}_{\text {try}}\) to decrease the uncertainty, 100 distinct forests were built, and the mean of significance was determined.

The sensitivity analysis for the variable’s relevance is shown in Fig. 11. It can be depicted that the wall thickness (\(b_{\text {w}}\)) and concrete compressive strength (\(f_{\text {c}}\)) have the greatest and least influence on shear strength, respectively. Rebars strength at the vertical direction (\(F_{\text {yv}}\)), the length of the wall (\(l_{\text {w}}\)), and the gross area of the wall (\(A_{\text {g}}\)) all have quite the same importance (around 8%). The next rank of varying importance is the height-to-length ratio (\(h_{\text {w}}/l_{\text {w}}\)), length of the vertical steel reinforcement, (Tran.web), rebars strength at the lateral direction (\(F_{\text {y}}t Trans.\)) and \(N_{\text {u}}\). Finally, the height of the wall (\(h_{\text {w}}\)) and the compressive strength of the concrete (\(f_{\text {c}}\)) have the least impact on shear strength.

7 Discussion and future research perspectives

The results provided therein provide substantial insight into the performance and prediction capacities of several models used to estimate the shear strength of reinforced concrete walls. This section’s goal is to thoroughly examine and compare the results of the RF, PSOFIS, GAFIS, and ANFIS models, as well as to investigate the sensitivity of the suggested models to the parameters \(b_w\) and \(\rho _x\).

Scatter plots were used to evaluate data distribution and variation to calculate prediction accuracy. The results indicate that all models forecast shear strength with varying degrees of accuracy. The perfect equity line (P=O) and lines representing +20% and −20% errors are provided for reference.

In the training set, RF, PSOFIS, GAFIS, and ANFIS all perform well, with PSOFIS having the greatest \(\hbox {R}^2\), RMSE, and MAE values. RF, on the other hand, emerges as the superior model in the testing phase, with the greatest \(\hbox {R}^2\) (0.9351) and the lowest RMSE (151.90), as well as the lowest MAE (111.53). GAFIS and ANFIS are ranked second and third in terms of total prediction accuracy, respectively.

Additionally, the FCE study sheds light on the models’ ability to anticipate various degrees of error properly. RF and hybrid approaches (PSOFIS and GAFIS) outperform in the testing phase, successfully predicting a considerable percentage of data with less than 10 and 30% inaccuracy, respectively. ANFIS, on the other hand, reveals higher percentages of data with larger errors.

Based on the RMSE/WI ratios, in terms of overall accuracy, hybrid techniques (PSOFIS and GAFIS) rank second, while ANFIS trails behind.

The Taylor diagram, which combines numerous statistical variables, gives a succinct picture of the models’ performance. Based on the test phase results, the accuracy forecasts of RF, PSOFIS, GAFIS, and ANFIS may be ordered from best to worst, with PSOFIS and RF beating the other models during the training phase.

The suggested models’ sensitivity to the parameters \(b_w\) and \(\rho _x\) is also investigated. The discrepancy ratio (DR) is used to assess how well the models respond to changes in these parameters. The range of DR values for RF is consistently smaller (-0.25 to 1), indicating more dependable predictions. PSOFIS, GAFIS, and ANFIS have wider ranges, indicating a tendency to both underestimate and overestimate shear strength values.

Regarding the parameter \(b_w\), it is observed that PSOFIS, GAFIS, and ANFIS models exhibit over-prediction and under-prediction for \(40< b_w < 100\), while DR values tend to approach zero for \(100< b_w < 300\), indicating a better match between observed and predicted shear strengths.

ANFIS reliably over-predicts and under-predicts the parameter \(\rho _x\) over the full range, whereas hybrid models exhibit comparable performance for \(0< \rho _x < 1\). In contrast, the RF model consistently outperforms the other models in terms of DR throughout all \(b_w\) and \(\rho _x\) ranges.

Fig. 11
figure 11

Sensitivity analysis of parameters

The current study on determining the shear strength of reinforced concrete walls lays a solid platform for future research in this area. The data and technique given in this paper suggest various future research areas. We may provide light on the relevance of this study and its implications for future improvements.

  • Incorporation of additional input features: To estimate shear strength, the current study relied largely on a set of input features. Additional significant elements, such as concrete mix design parameters, reinforcing details, curing conditions, and environmental influences, should be investigated in future studies. By adding these elements, the prediction capacities of the models may be improved, resulting in more accurate shear strength calculations.

  • Expansion to other types of concrete elements: This study concentrated on reinforced concrete walls. The created models, however, may be applied to different types of concrete parts such as beams, columns, slabs, and joints. Investigating the models’ applicability to these elements would help to a more thorough knowledge of shear strength prediction in diverse structural components.

  • Experimental data inclusion: The current investigation relied on both predicted and experimentally observed shear strength data. However, future studies may concentrate on gathering more experimental data from a broader range of test specimens. Integrating such experimental data into models can improve their dependability and generalizability.

  • The current study focused on reinforced concrete walls with precise material attributes and mixed proportions. Future research can look into how the established models can be used in diverse concrete materials, such as high-strength concrete, self-compacting concrete, and fibre-reinforced concrete. This would provide a more comprehensive knowledge of shear strength prediction across different concrete compositions.

  • Validation on real-life structures: To verify their practical feasibility, the created models may be validated on real-life reinforced concrete structures. Comparing projected shear strength values to actual measurements from existing structures would give useful information into the accuracy of the models and potential areas for improvement.

  • Design guidelines development: Accurate calculation of shear strength in reinforced concrete buildings is critical for design and assessment. Future research might focus on providing design guidelines or suggestions for professionals and engineers based on the findings of this study. These recommendations would make it more straightforward to choose appropriate models and give insights into the aspects that influence shear strength in reinforced concrete components.

8 Conclusions

While RCWs proved their feasibility as a most effective lateral resisting system for disturbances such as wind and seismic loads in civil structures, due to their diverse structural configurations, different loading situations, and the complex nonlinear relations between the input variables and the structural response, predicting their shear capacity is rather complicated. In this paper, the authors deploy different soft computing-based modelling approaches (RF, ANFIS, PSOFIS, and GAFIS) for predicting the shear capacity of RCWs. Based on this work, the following conclusions can be drawn:

  • Current design codes have substantial uncertainty and do not completely reflect the effect of major design factors on shear strength in a logical way. The increased model variability vs the main design variables computed findings in this study for these approaches explicitly state this.

  • The parametric analysis and different analytical measurements show that the developed innovative hybrid model addressed the impacts of the main design factors as well as the highly nonlinear relationship between the input design requirements and the projected shear capacity adequately.

  • The suggested models might be advanced further to give specifications with higher prediction reliability and resilience for this realistic albeit complicated design challenge, therefore improving design precision.

  • When compared to other techniques, the RF with the highest \(\hbox {R}^2\) (0.9351), lowest RMSE (151.89), and the best PI (0.81) was more accurate. It is also possible to conclude that RF is superior since its DR values are nearly equivalent to zero. Additionally, in terms of RMSE/WI, RF has the lowest value, making it more accurate than other models.

  • Sensitivity analysis reveals that the thickness of the wall (\(b_{\text {w}}\)) is the most important element influencing shear strength.

  • PSOFIS and GAFIS (hybrid models) are more competent than ANFIS with the conventional optimization algorithm of enhancing the MF of fuzzy inference systems.

  • There is a need to investigate new metaheuristic optimization algorithms that may be better adjusted for integration with the FIS model.