1 Introduction

Deep pile foundation has become one of the most preferable options for high-rise buildings and transport infrastructure (e.g., bridges and heavy freight rail tracks) around the world as it can carry massive loads in a relatively limited area. While it is well understood that the axial bearing capacity of a pile is mainly contributed by its shaft friction and base (or toe) resistance, immense effort has gone to establishing different methods to estimate bearing capacity of piles over the past years. Many solutions are based on complex mathematical derivations and numerical analysis [14, 27, 38, 41], while others employ empirical equations derived from field tests such as Standard Penetration Test (SPT value) [19, 30, 49], Cone Penetration Test (CPT) [22, 36, 67] and field static/dynamic load tests [10, 20, 30]. Despite these various solutions, a significant limitation that most conventional approaches commonly share is their limited consideration of past experience as well as data. Indeed, empirical parameters which are normally determined based on experience of the designers are often used to optimize predictions [11, 20, 23]; however, there is usually a lack of systematic method to compute these parameters properly. For example, the effect of displacement on the ultimate bearing capacity of piles is normally evaluated through field load tests, which are usually time-consuming and highly resource demanding, while empirical methods to ease this process have not been established very well. This context thus remains a need for novel approaches that can incorporate experience-based factors as well as the valuable existing data, and thereby generate user-friendly processes for enhanced practical designs.

In recent years, the use of data-based techniques such as machine learning (ML) to predict geotechnical issues has received increasing attentions [26, 64, 68, 72]. Of numerous emerging ML techniques, artificial neural networks (ANN) is the most common approach to develop forecasting models of various geotechnical issues such as slope stability [40], soil properties [29, 48], bearing capacity [7, 28], deep excavation [2, 71], mining [12, 57], tunnelling [59], jet grouting [58], among others. As strictly developed based on existing data, the quality and size of data play a pivotal role in building ANN models. Nevertheless, one of the major issues in the past ANN modelling of pile foundations was the use of relatively limited data (i.e., common number of trained data points < 175) covering a variety of pile shapes, materials and construction methods (see Table 1). This can lead to low accuracy and/or reliability when applying these ANN models to real projects because the new inputs can easily be out of the trained range due to data scattering. The limited data also allows only one or several subsets of training and testing data while establishing the model, thus hindering our understanding of the effect that data randomness can have on the confidence of predictions. In addition, Table 1 shows that past studies mainly addressed small and medium piles, i.e., diameters < 1.0 m and length < 45 m, whereas larger piles were hardly found. In fact, large and long piles are commonly employed in alluvial and coastal regions where soft soil is usually a significant barrier to the infrastructure development [47]. Therefore, an ANN model established based on a considerably large database of super-large and -deep piles in soft soil becomes essential.

Table 1 Typical past studies using artificial neural network (ANN) model for pile foundations

A common issue across past ANN models of pile foundations was the use of CPT data to estimate the shaft and base resistances of piles for input parameters while modelling [4, 6, 43, 45, 51]. Because the CPTs are very different from the actual working condition of piles, especially for large and long piles (e.g., the ratio of diameter and displacement, penetration speed, the stress level along the shaft), the pile shaft and base resistances computed from the sleeve and tip values measured by CPTs can deviate significantly from their real values [63]. Further, the soil deformation based on CPTs is much larger than that of piles, while failures of soil based on CPT data do not represent very well the ultimate state of piles under loading, resulting in inaccurate understanding of the mobilizations of base and shaft resistances in pile foundation. For example, Fellenius (2015) [22] indicated that the pile toe resistance calculated from CPT results can considerably overestimate the actual value measured in field tests. Besides, CPTs are normally limited up to 50 m depth [21, 62], whereas large bored piles can easily exceed this depth, causing a lack of method to estimate the shaft and base resistances for these long piles. These limitations of past ANN models require a more rigorous determination of influencing factors on the bearing capacity of piles that can be used as the key input parameters for establishing the model.

The major motive of this study is to overcome the above limitations by using a large database of pile foundations to examine and advance the application of ANN in predicting base resistance of super-large and -deep piles, which was not available in any past ANN models of pile foundation. The key innovations are the random selection of training and testing datasets derived from 1137 data points (86 different piles) across 37 real projects over the past 10 years in Mekong Delta and nearby areas in Vietnam (Fig. 1). The equivalent pile diameter varied from 1.0 up to 2.5 m, while the embedded depth of these piles was from 40.2 to 99 m with the test load up to 10,910 tons, all of which were much larger than those used in previous ANN models for piles. A simple approach based on combining multiple learning rates is also proposed to enhance training process. Further, the outcomes yielded from the established ANN models are then innovatively employed to determine the contribution that different factors can make over the base resistance, which advances our knowledge of pile foundation. The study will also provide empirical equations, which do not exist in literature as well as state of the practice, to assist practical designs. It is noteworthy that although advanced ML combined with optimization techniques can be used, this study focuses on the most fundamental concept of ANN model due to its simplicity and thus a wider range of practical implications.

Fig. 1
figure 1

Location of pile foundation collected in the current study

2 An overview of super-large and long piles used in Mekong delta

2.1 Geological condition

Mekong Delta is a weak geology region in Vietnam with a very thick mud deposit (Fig. 2a) from the Mekong River. Located at the margin region of the Mekong delta and downstream of Sai Gon River, the geology of Ho Chi Minh City is extremely complicated with a very thick layer of soft to very soft clay, ranging from 6 to 30 m below the ground surface as described in Fig. 2b, resulting in considerable challenges for designing foundation. This clayey soil has an average void ratio e0 = 2.2, water content wn = 80% and plasticity index PI = 53 [32]. The liquid and plastic limits are around 89% and 36%, respectively; thus they are well classified as CH type. The unconfined compressive strength (qu) of this soil varies from 20.3 to 49.1 kPa while the SPT value (N) is only around 0–2. Medium and dense sands are about 3 m thick and distributed widely from 10 to 60 m depth. The SPT value of this soil ranges from 8 to more than 25 blows/ft [33]. Underneath this layer is a 1.5-3 m-thick layer of stiff to very stiff clay where qu varies from 80 to 250 kPa and SPT value is between 25 and 50 blows/ft. Followed this layer is a dense fine sand with the SPT value ranging from 30 to greater 70 blows/ft.

Fig. 2
figure 2

Typical geological cross section of a Mekong Delta, b Ho Chi Minh City

2.2 Characteristics of pile foundation in Ho Chi Minh city

Most pile foundation for high rise buildings in Ho Chi Minh city is often designed to carry heavy load, i.e., from 800 to more than 3000 tons. The test load is usually required to be 2–3 times larger than the design load, resulting in a pile load in test cases of up to 9000 tons. For this very large magnitude, the bored piles are normally required to be installed up to the stiff soil layer which is located from 60 to 100 m depth. In addition to this very deep installation, the cross sections of these piles must be large enough to meet the requirement of pile slenderness. Generally, there are two common types of piles being used in this region; they are bored piles with diameter from 1 to 2 m and rectangular barrette piles in the range 0.6 m × 2.4 m to 1.2 m × 2.8 m (equivalent to 2.5 m in diameter considering the same perimeter). Given this range of diameters plus an installation depth varying from 40.2 to 99 m, these piles can be classified as super-large and -long piles compared to the normal range, i.e., 0.8–1.3 m in diameter and 10-40 m in length of large piles commonly used in past studies [1, 4, 8, 18].

Static load test (SLT) and O-cell load test (OLT) are currently the two most common field tests used to examine the design load and determine the ultimate load capacity of piles. The SLT needs large space for setup, while its test load is normally limited up to 4,500 tons. Therefore, for test loads exceeding 4,500 tons, the OLT is often adopted as it can be conducted in narrow space for very long piles. To measure the development of the shaft and base resistances according to loading point displacement, strain gauges are attached to steel reinforcement bars in the cages along the piles; however, their positions varied depending on the distribution of soil layers as depicted in Figs. 3a and 4a. Approximately, 3 to 9 strain gauges are installed at each cross section, depending on its shape and dimension. When the load is applied, displacement at the loading point is recorded over time. The axial forces are then computed based on the measured strain of steel bars under loading. The difference in axial forces between two adjacent cross sections is actually the total shaft resistance that the surrounding soils act on the pile segment. Furthermore, strain gauges are installed at the tip cross section which is 0.5–1.0 m above the pile toe to measure the base resistance. The two following bored and barrette piles undergone SLT and OLT (Figs. 3 and 4) are described as representatives for the test pile data used to develop ANN model in the current study.

Fig. 3
figure 3

Lakeside project: a soil profile and static load test of bored pile TP2, b static load–displacement, c axial load distribution, d displacement-mobilized base resistance

Fig. 4
figure 4

Landmark 81 project: a soil profile and O-cell load test of bored pile TP2, b O-cell load–displacement, c axial load distribution, d displacement-mobilized base resistance

Figure 3 shows a large diameter bored pile (1.2 m in diameter and 80 m in length) named TP1 at Lakeside project in District 7, Ho Chi Minh City. Soil profiles and strain gauge arrangement are presented in Fig. 3a. The pile toe was placed into medium dense sand with the SPT value of 31. In this pile, strain gauges were distributed at 10 layers along the pile, while the deepest strain gauge section was set up at the depth of 79.3 m. The ultimate load was designed at 1100 tons, and the test load was approximately 2750 tons. The load–displacement curves of three loading and unloading cycles are represented in Fig. 3b which shows the pile load reaches 2750 tons at the largest vertical displacement of 49.46 mm under the 3rd loading cycle. The axial load distribution and mobilized base resistance of the pile at each loading levels are represented in Fig. 3c and d, respectively. The data show that majority of the shaft friction is distributed along the upper half of the pile, i.e., around 45 m depth from the ground surface, whereas it drops apparently over the lower half of the pile. The larger the test load, the more non-uniform distribution the axial load along the pile. As the displacement reaches the largest level of 49.46 mm, the base resistance increases to 3853 kPa. These results mean that the contributions that the shaft friction and base resistance make over the total ultimate bearing capacity need to be estimated with respect to the test load and the displacement variations, especially for large-diameter and long piles.

Figure 4 shows a typical barrette pile under O-cell load test, i.e., TP2 in the 461 m-high (81 floors) Landmark 81 building which was the highest building in Vietnam (i.e., one of top 10 tallest buildings in the world constructed in 2018). This pile had a cross section of 1.0 m × 1.8 m with its toe in dense sand at a depth of 80 m. Strain gauges were installed at 12 different levels along the pile as presented in Fig. 4a. The strain gauges to measure the base resistance were installed at a distance of 1.0 m from the pile toe. It is noteworthy that unlike static load test, the loading point in this pile test was located at the O-cell which positioned at a depth of 65 m from the ground surface as shown in Fig. 4a. The design and test loads for this pile were 4000 and 9319 tons, respectively. Figure 4b shows the load–displacement curves where the largest downward and upward displacements at loading point are 17.27 mm and 18.21 mm, respectively. Figure 4c shows the axial load distribution in the pile with the applied load at the O-Cell point. Mobilization of the base resistance according to the downward displacement at the loading point is reported in Fig. 4d. The base resistance reached 2794 kPa at the displacement of 17.27 mm.

3 Artificial neural network model and data collection

3.1 Artificial neural networks (ANNs)

A typical ANN model includes the input, hidden and output layers that imitates the information processing system of the nervous system in the human brain to identify and predict characteristics of a problem based on its given dataset [9]. In all these layers, the nodes (i.e., neurons) are reasonably arranged and communicate with each other through weighted connections. Each neuron adopts an activation function that aims to receive information from prior neurons, process and transmit signals to subsequent neurons or output of the network. Performance of an ANN model can be evaluated by comparing the predicted results with real data via several assessment factors such as the coefficient of determination (R2), mean squared error and absolute errors [43, 45, 55]. Back-propagation is the most common algorithm widely used for training ANN models [16] due to its simplicity while ensuring high degree of accuracy in predicting data behaviour. The current study also adopted the back-propagation approach, while the gradient descent was employed to optimize the error and obtain the global minimum.

3.2 Data collection and features

The current ANN model was developed based on a database consisted of 1137 sets which were collected from load tests of 86 super-large and long bored piles at various projects in Mekong Delta, Vietnam. Basic information such as the location, size and loading parameters of these piles is summarized in Table 2. These collected piles had different cross-sectional dimensions, ranging from 1.0 to 2.0 m in diameter for bored piles and 0.6 m × 2.4 m to 1.2 m × 2.8 m in rectangular shape for barrette piles. The embedded length varied from 40.2 to 99 m and the test load was up to 10,910 tons by O-cell load test and 4513 tons by static load test. The distribution of pile dimension including the cross section and embedded length is depicted in Fig. 5, while a summary of the data range of the piles is provided in Table 3. Apparently, the current database of piles is much more comprehensive and rigorous compared to most past studies using ANN models to predict bearing capacity of pile foundations [4, 6, 42, 43, 51, 54, 61]. Moreover, all the piles are ranged from large to super-large size including only two types, i.e., 75% bored piles and 25% barrette piles (Fig. 5a). These unique features of the current database will ensure a more robust and reliable development of ANN model to predict behaviour of pile foundation.

Table 2 Summary of pile data used in the current ANN model development
Fig. 5
figure 5

Distribution of dimensional range of the piles: a the cross section, and b the embedded length

Table 3 The range of pile features

The average SPT values at pile toe were used to feature soil properties. It is important to consider the failure zone of soil below the pile toe to obtain the average SPT values. Meyerhof (1976) [41] recommended the failure zone to be 10D above and 4D below the pile toe (D is the equivalent diameter of pile). According to [19], the failure zone under pile toe of small diameter piles was in the range of 2D to 8D depending on physical properties of soil layers above the pile toe. For lager diameter piles, the failure zone extends to 1D below pile toe as proposed by [5]. In this study, the average SPT values in 1D zone below the pile toe was used because of the large diameter bored piles. The range of SPT values is shown in Table 3.

4 Model development

4.1 Model inputs and outputs

As the main objective of this study is to predict the mobilized base resistance of piles based on ANN approach, the input parameters are identified as those directly influence behaviour of the base resistance considering the load test data (e.g., Figs. 3 and 4). It can be determined that the base resistance mainly varies with 5 major different parameters which are the applied load (P), displacement of loading point (Dp), axial stiffness (AE), SPT values (N) of the soil beneath the pile toe and the distance from loading point to the pile toe (Lp). The pile test data (e.g., Figs. 3 and 4) show that increasing the applied load P and the corresponding displacement Dp will cause the base resistance to increase; thus they were considered as two primary input parameters in the current model. It is apparent that stiffness of the soil beneath the pile toe will play an important role in governing the mobilized base resistance, i.e., the larger the SPT value, the larger the base resistance. In other word, the SPT value must be included in the influencing factors while establishing the ANN model. Furthermore, the distance between the loading point to the pile toe is a crucial parameter because varying this distance will result in different load distribution along the pile, thus affecting the base resistance. Figure 6 shows that the mobilized base resistance (i.e., Qb/(P/A)) increases when the distance Lp decreases. In particularly this ratio rises swiftly from around 10% to more than 30% when Lp becomes less than 10 m. This observation, in fact, corroborates well previous data for large diameter bored piles [1, 10, 17, 38, 44]. The longer the distance, the harder the base resistance can be mobilized. Note also that for bored piles, the distance between loading point and pile toe is actually the length of the pile as the loading point is located at the pile head, whereas this distance is the O-Cell point to the pile toe. The distribution of Lp in the current study is shown in Fig. 7.

Fig. 6
figure 6

Mobilized base resistance of bored piles according to different distance between loading point and pile toe at the displacement of around 20 mm

Fig. 7
figure 7

Distribution of distance between loading point and pile toe

It is important to note that the effect of pile size was included through the axial stiffness which was computed based on Young’s modulus of the pile material and the cross-sectional area [35]. In addition, the pile length is reflected via the distance between the loading point and the pile toe, as described earlier. The shaft friction between soil and pile can certainly affect the load distributed at pile toe, which varies with soil properties along the pile. For simplicity, the current study only considered the contribution of base soil to the load distribution, while the effect of friction that the soils around piles can have been assumed to be included through the displacement–load characteristics incorporated in the current model. Unlike traditional methods where specific relationship between individual parameters needs to be quantified, ANN approach can capture and quantify the influence of parameters on each other via their multiple forth and back interactions designed through its network and algorithm.

4.2 Data division and reprocessing

In the current model, 1137 datasets were used for developing and testing the ANN model. The available data is randomly divided into two subsets including a training dataset for model calibration and an independent testing dataset for model verification. Specifically, 966 data points (85%) were used for training and the remanding 171 data points (15%) were used for testing. The major innovation in the current data processing was that the study constantly repeated the training and testing process for 250 different random selections from the entire database and then the probability analysis was applied to evaluate the results. Most previous studies assumed homogenous data and randomly selected training and testing datasets at only one or few times [4, 45, 46, 51, 55], resulting in incomplete understanding of how the random distribution of data can affect the prediction outcomes. Specifically, the questions of whether the established model would provide identical predictions over different subsets of training and testing data, and how this random division can affect the prediction confidence have not been clarified. Indeed, for any data collected from real construction sites, the variation in data properties is usually significant, requiring an investigation on the probability of outcome distribution while varying data selection.

The input and output variables are scaled in a unit range to eliminate their dimension and ensure that all variables receive equal attention during training [54]. The scaling range is normally chosen with respect to the ultimate limits of activation function used in the hidden layers, for instance, the scaling range between -1 and 1 when the tanh function is used, while the range from 0 to 1 is normally used for the sigmoid function. While there might be some advanced activation functions developed in recent times (Shen et al. 2022) [60], our preliminary investigations on 4 common activation functions including Sigmoid, Tanh, ReLU and Leaky ReLU showed that the Sigmoid function provided the most accurate and relevant outcomes (i.e., R2 = 0.96 in training phase), given the fundamental form of ANN. The Sigmoid function was hence adopted in this study with scaling process given by:

$${x}_{n}=\frac{x-{x}_{\mathrm{min}}}{{x}_{\mathrm{max}}-{x}_{\mathrm{min}}}$$
(1)

where xn, xmin and xmax are the normalized (scaled), the minimum and maximum values of variable x.

4.3 Determination of model architecture

The performance of an ANN model essentially depends on its designed network architecture, therefore, defining the optimum network architecture is vital to establish an effective ANN model. The network architecture includes the number of hidden layers associated with their nodes and optimum weights. Hornik, Stinchcombe, and White [31] indicated that an ANN model with one hidden layer can approximate any continuous function. In designing ANN architectures, Lawrence [37] recommended that increasing the number of hidden layers should be the last option, whereas optimizing the number of hidden nodes (Nhn) can enhance performance of the ANN model more effectively. The optimal Nhn depends on the numbers of input and output variables as proposed in previous studies (Table 4). However, the predictive performance of an ANN model is commonly evaluated through the coefficient of determination (R2) as well as the root mean squared error (RMSE), therefore, the trial-and-error approach [54] can be used to determine the best predictive performance. In other words, the current study examined various network architectures with different numbers of hidden nodes using the same input and output data to find out the best architecture. This optimum architecture was then applied to different random sets of data to investigate the probability distribution of predicted outcomes.

Table 4 The recommended numbers of hidden nodes in past studies

In this work, an ANN model with one hidden layer was adopted for network construction as presented in Fig. 8, while Nhn were varied from 5 to 40 nodes to determine the optimal network architecture. Several previous studies [2, 53] were based on the error of training datasets to select optimal network architecture; however, it is noteworthy that the error in testing or validating datasets is always the main target of a predictive model. An optimal model does not only have the capacity of minimizing the error during testing data due to overfitting or underfitting, but also has the simplest structure (i.e., less input parameters as well as nodes) for reduced computational cost. Figure 9 presents the influence of the Nhn on the predictive performance of the ANN model. For the simple networks in which the Nhn is less than 10, the largest value of R2 in training and testing datasets were relatively small, i.e., only 0.889 and 0.805, respectively. When Nhn increases and exceeds 30, the value of R2 in training datasets rises considerably to 0.969 and 0.983 for Nhn = 30 and 40, respectively. In the meantime, the value of R2 in testing datasets drops significantly from 0.899 to around 0.8 when Nhn increases from 20 to 40, indicating the over-fitting of the model when Nhn is too large. This means that the network with 20 hidden nodes where the values of R2 in the training and testing data were large and close to each other can be considered as the best performing model. In fact, the time for training the 40-hidden node system was also approximately double the time used for 20-hidden node computation; thus the network with 20 hidden nodes was selected for further analysis in this study. This, in fact, disagrees with the previous empirical equations shown in Table 4.

Fig. 8
figure 8

Architecture of the current ANN model

Fig. 9
figure 9

Variation of coefficient of determination (R2) with different hidden nodes

4.4 Criteria of termination

During training process, it is crucial to decide when to stop the training as this can critically affect the performance and reliability of an ANN model. Therefore, it requires rigorous training strategy and supporting techniques to avoid over-fitting which usually happens if model training is excessive. In contrast, the insufficient model training will lead to inaccurate outcomes and/or under-fitting. There are many techniques to determine when to stop training process as discussed in previous studies [56], meanwhile cross-validation techniques can be used to mitigate over-fitting issue [46]. In the current work, the coefficients of determination R2 calculated from the actual and predicted values of pile base resistance in both training and testing datasets were considered to terminate the training process. The training dataset was then used to modify the connection weights, while the testing dataset was used to evaluate performance of the trained network. The values of R2 at various loops in training and testing data were obtained, and the training process was terminated only when the R2 value of testing data was unchanged or begun to decrease.

4.5 Optimizing learning rate for enhanced training process

The initial connection weights of the model were randomly generated in the range from 0 to 1 and the optimal weights were obtained by training the network to reach the lowest prediction error. The training cycles to achieve termination criteria and the lowest prediction error depended on the learning rate (LR), which determined the size of the corrective steps that the model can take to optimize the error in each back-propagation. Shahin [54] shows that an LR of 0.2 is normally the minimum for an accepted error in predicting driven piles and drilled shafts. Alkroosh [3] indicated the optimum LR of 0.08 for minimal prediction errors through training their ANN model using a LR ranging from 0.05 to 0.6. A high LR can shorten the iteration number of the training process, but can be difficult to achieve the global minimum. On the other hand, a smaller LR can make the training time-consuming and become stuck in the local minimum. In this study, the trial-and-error approach was used to determine the appropriate values of LR for the network training. An LR in the range from 0.005 to 0.05 was assumed for training the ANN models consisting of 20 hidden nodes, while the initial weights matrix was kept identical across different cases.

The training results using different LR values are shown in Fig. 10. The network training did not converge when LR > 0.05. When LR was large, i.e., from 0.03 from 0.05, the convergence rate of training was great in the first cycles, resulting in an R2 value of 0.942 within the first 2 million loops. However, the convergence rate significantly dropped in later cycles, causing R2 unchanged at 0.952 despite the training running through more than 4 million additional loops. For the lower values of LR, i.e., 0.005 and 0.01, the convergence rate in the first 2 million training cycles was medium to achieve an R2 value of 0.931 and 0.907, respectively. This rate remained unchanged in the next 3 million training episodes with R2 rising to 0.959 and 0.957, respectively. In the later loops, R2 increased slowly and reached the largest value of 0.966 after around 10 million loops. It is noteworthy that the smaller the LR value, the larger the ultimate level of R2. Figure 10 shows that the LR of 0.01 (blue line) was the optimal value for training the current network as it needed less training loops to reach the largest value of R2.

Fig. 10
figure 10

Variation of coefficient of determination (R2) with the number of training loops by various learning rate (LR) values

Many past studies [3, 45, 46, 54] used only one value of LR in the network training, which can lead to the aforementioned limitations such as excessive training time and local minimum. A recent review study [70] shows that a fixed value of LR does not always work well. To overcome these problems, a relevant combination of multiple LR values in the training process was investigated in this study. Specifically, high LR values (i.e., 0.05 and 0.03) were employed in the first training episodes to quickly achieve the near global minimum on the error surface. In the next cycles, lower LR values (i.e., 0.01 and 0.005) were used to reach the lowest global minimal. The results shown in Fig. 10 indicated that the proposed combination significantly improved the training time. The network training obtained the largest value of R2 = 0.966 only after 6 million loops compared to 10 million loops needed for training the network with a constant LR value (i.e., LR = 0.01 and 0.005). This approach is more effective with the slope of error reduction maintained at large corrective steps and thus can be well considered while designing an ANN model.

5 Results and discussion

5.1 Performance of ANN model

The reliability of ANN models in predicting the mobilized base resistance of the current piles is presented in Fig. 11. It can be seen that the random division of database into training and testing sets has a huge influence on the obtained results. Different cases of random data division resulted in different degrees of accuracy despite the same model features (e.g., model architecture, multiple learning rates of 0.05–0.03–0.01–0.005 and criteria of termination) being used. In 250 investigated random cases, the optimal R2 values of training and testing datasets varied in a wide range, i.e., from 0.91 to 0.98 and 0.80 to 0.96, respectively. This proves that the investigation on the prospective range of predicted outcomes that the random data division can result in is vital to the confidence of using ANN model in practice. One cannot conclude about the predicted results with only one or few random cases, for example, R2 in the testing cases can drop to low level, i.e., 0.8 that requires a serious attention.

Fig. 11
figure 11

Variation of coefficient of determination (R2) of training and testing datasets according to random data division

Figure 12 shows the probability distribution of the computed R2. In 250 random cases, the most common values of R2 in the training and testing datasets were around 0.96 and 0.89, respectively, which can be used as representative to evaluate the ANN model performance. The obtained random results indicate that the probability distribution of the R2 error in training and testing datasets is similar to a left (negative)—skewed distribution [15, 65] where the left side of the peak R2 receives more distribution compared to the right side. This is understandable as it would certainly become harder to achieve a precision that is larger than the peak level (e.g., 0.96 in training), especially for the present real and large database, resulting in a drop in distribution on the right side. The distribution of R2 is more uniform in the testing data (Fig. 12b); in fact, most R2 values are allocated around 0.89 (from 0.87 to 0.91). However, the number of predictions with R2 value > 0.91 decreases more apparently compared to the opposite side (i.e., R2 < 0.87). Besides, the red solid lines in Fig. 12 demonstrate the cumulative probability of the R2 value, which shows that R2 develops more steeply in training compared to testing data. Overall, for a required R2 > 0.85 with a confidence of 80% in testing cases, the current ANN model can be considered acceptable, however for a higher degree of confidence, more effort is certainly required.

Fig. 12
figure 12

Distribution of R2 and cumulative probability of 250 random cases: a training dataset, b testing dataset

Figure 13 shows representative predicted base resistance using the current ANN model (one hidden layer, Nhn = 20 and multiple LR = 0.05 + 0.03 + 0.01 + 0.005 for this computation) in comparison with the measured values. The obtained R2 values are approximately 0.96 for training data and 0.89 for testing, indicating an acceptable accuracy of the model. Most of the predicted points agree relatively well with the measured values, while a few data points (i.e., around Qb = 1000–1200 kPa) show significant deviation. This can be attributed to a certain data inconsistency that usually happens with data collection at real sites, especially a large amount of data used in the current study. The results proved that the proposed 5 input parameters, i.e., the load (P), displacement of loading point (Dp), axial stiffness (AE), SPT values (N) of the soil, distance from loading point to the pile toe (Lp) were relevant to develop an ANN model to estimate the mobilized base resistance of bored piles.

Fig. 13
figure 13

Base resistance of piles predicted by ANN model and measured in the field: a training dataset, b testing dataset

5.2 Influencing factors to the base resistance of piles based on the prediction outcomes

Sensitivity analysis was carried out to identify the influence of individual input variables on the predicted results as well as the performance of ANN model. Garson [24] firstly, proposed a simple technique, namely weights method, to determine the relative importance (RI) of the input variables in one-hidden layer neural network by examining the connection weights of the trained network. The weights method fundamentally involves a procedure of partitioning the hidden—output connection weights of each hidden neuron h into components associated with each input neuron i as shown in Eq. (2). The detailed procedures and algorithm of this method can be found elsewhere [25], which can be described by:

$${RI\left(\%\right)}_{i}=\frac{\sum_{h=1}^{nh}{Q}_{ih}}{\sum_{h=1}^{nh}\sum_{i=1}^{ni}{Q}_{ih}}\times 100$$
(2)

in which

$${Q}_{ih}=\frac{\left|{W}_{ih}\right|}{\sum_{i=1}^{ni}\left|{W}_{ih}\right|} \mathrm{with} \,{W}_{ih}={w}_{ih}\times {w}_{ho}$$

where o denoted the output neuron and w is the connection weights.

Figure 14 shows the results obtained from the sensitivity analysis. It is interesting that the displacement has the most dominant effect (i.e., 28.3% relative importance to the predicted outcome) on the mobilized base resistance followed by the load-pile toe distance and the SPT value of soil beneath the pile. The load P has the least significant influence with only about 10.5% relative importance. This can be explained by the fact that the magnitude of displacement had included both the effects of the load P and the soil–pile interaction, thus its role was exacerbated, whereas the contribution of the load became less significant. It can be seen that the distance Lp also has a great impact on the base resistance, which suggested the importance of considering this parameter when estimating the pile base resistance. The deeper the pile is, the harder the base resistance can increase. SPT value of the soil at the pile toe has a medium effect on the base resistance compared to other factors. This is understandable as most of large diameter bored piles were based on stiff soil layers where the SPT values were large and in relatively similar range.

Fig. 14
figure 14

The relative importance of input variables for the base resistance

5.3 Empirical estimate of displacement and mobilized base resistance

In order to facilitate the practical application of the current model outcomes, all the data points representing the relationship between the mobilized base resistance and the loading point displacement were combined and their empirical relationships were developed. Specifically, empirical relationships between the loading point displacement Dp (mm) and pressure P/A (ton/m2) were established differently for the static and O-cell load tests based on the measured data. The hyperbolic function which is widely used to describe the nonlinear behaviours of pile load–displacement curve [38, 69] was adopted. The fitting curves are shown in Fig. 15a for static load tests and Fig. 15b for O-cell load tests. These expressions of the pile pressure (P/A) are depicted in Eqs. (3) and (4) for static load and O-cell load tests as follows.

Fig. 15
figure 15

Empirical correlations between the loading point displacement Dp (mm) and pressure P/A (ton/m2) of large diameter bored pile: a for static load tests, b O-cell load tests

$$\frac{P}{A}=\frac{2175{D}_{p}}{20.5+0.39{D}_{p}}$$
(3)
$$\frac{P}{A}=\frac{4000{D}_{p}}{19.5+0.81{D}_{p}}$$
(4)

Although there is a certain deviation between the results estimated by the proposed equations and field data, they are a fast approach that enables practical engineers to roughly estimate the pile load with respect to a given pile displacement.

Consequently, the above empirical relationships were employed to estimate the vertical displacement of the load point which was then be used to estimate the base resistance using the validated ANN model. The resulting charts representing the variation of the base resistance with different degrees of point load displacement and the distance Lp are shown in Figs. 16 and 17 for bored piles and barrette piles tested by the static and O-cell loading methods. It is noted that these charts describe the common practical context of piles where 3 typical parameters including the embedded length, cross section and SPT values (N) of pile are usually considered. For piles tested by static load, the displacement less than 50 mm was employed to investigate the mobilized base resistance by the trained network. The number of data points with the displacement higher than 50 mm were relatively small (Fig. 15a), so the prediction using the trained network for displacement larger than 50 mm can reduce the reliability. Similar consideration was also applied to the piles tested by O-cell load where the vertical displacement of loading points was less than 20 mm (Fig. 15b).

Fig. 16
figure 16

Base resistance mobilized according to loading point displacement in cases of static load test: a bored pile D = 1.0 m, b bored pile D = 1.2 m, c bored pile D = 1.5 m

Fig. 17
figure 17

Base resistance mobilized according to loading point displacement in cases of O-cell load test: a bored pile D = 1.5 m, b barrette pile 0.8 × 2.8 m, c barrette pile 1.0 × 2 m

The obtained results in Figs. 16 and indicate that the mobilized base resistance considerably depends on the displacement and embedded length of piles. There are nonlinear relationships of the pile base resistance with the pile head displacement. For those piles tested by static load and the pile head displacement lower than 20 mm during the innital loading steps, the base resistance was hardly mobilized; in fact, its incremental rate only begins to increase when the displacement > 20 mm. The mobilized base resistance increases rapidly as the displacement exceeds 25 mm. Besides, the longer the pile, the slower the mobilization of base resistance considering the same magnitude of displacement. At a similar displacement of 40 mm, the piles installed at 10 m deeper would result in 10%-20% lower mobilization of base resistance. For the piles tested by O-Cell load, the distance from loading point to pile toe was significantly shorter, the base resistance begins to increase signficantly when the displacement > 5 mm, which occured much earlier compared to the cases under static loading. Figure 16c and 17a show that for bored piles with D = 1.5 m, the base resistance at a vertical displacement of 16 mm in O-cell load test equals to that of 50 mm displacement in static load test. This indicates a huge influence that the distance betweet the loading point and pile toe can have on the mobilized base resistance. Moreover, Figs. 16 and 17 show that the dimension of pile cross section also influences the mobilized base resistance. At a vertical displacement of 20 mm, the base resistance of barrette piles with the cross-sectional dimension of 0.8 × 2.8 m and 1.0 × 2.8 m was 10%-15% lower than that of bored piles having a diameter of 1.5 m.

6 Conclusions and practical implications

The current study used 1,137 datasets of super-large and long bored piles collected across 37 real projects to develop an ANN model to predict base resistance of piles. Salient findings in terms of model development and prediction results can be highlighted as follows.

  1. 1.

    A relevant combination of multiple leaning rates, e.g., descending order of LR (i.e., fast-to-slow) from 0.05 to 0.005 in the training process was proved to improve the training time as well as avoiding the issue with local minimum significantly. For example, using the combined different values of LR can save 40% the time for training compared to constant LR approach.

  2. 2.

    The random selection of training and testing datasets can affect considerably the predicted results, for example, R2 in the testing cases can drop from 0.95 to 0.8 using the same model features. Therefore, one cannot conclude well about the confidence of the predicted results if only one or few random cases are selected. The model results based on 250 random divisions of data showed that the probability distribution of R2 was likely to follow a left (negative)-skewed form with much more dispersion in training processes.

  3. 3.

    The sensitivity analysis of the input parameters indicated that the displacement Dp has the most dominant effect on the mobilized base resistance (28.3%) followed by the load-pile toe distance (Lp) and the SPT value (N) of soil beneath the pile. The load P has the least impact with only about 10.5% relative importance. These five different input parameters could be considered relevant to develop an ANN model to predict the base resistance of super-large and long piles.

  4. 4.

    The mobilized base resistance at the displacement of 20 mm rose swiftly from around 10% to more than 30% when Lp became less than 10 m. The base resistance of the piles in O-Cell tests begun to develop much earlier compared to static tests. Empirical equations and design charts were also proposed for practical engineers to estimate the settlement and base resistance of piles based on the outcomes from the current ANN model

It is important to note that while the current findings were established based on the data of bored and barrette piles, they can bring significant values to a broader context of pilling foundations such as cement columns and driven piles where the base resistance and bearing capacity share the same mechanism. For example, the 5 key parameters and their contributions to the base resistance of piles can be applied to different types of pile foundation, whereas the effect of data randomness should be considered in general application of ANN techniques.