A soft ground micro TBM’s specific energy prediction using an eXplainable neural network through Shapley additive explanation and Optuna

Kilic, Kursat; Ikeda, Hajime; Narihiro, Owada; Adachi, Tsuyoshi; Kawamura, Youhei

doi:10.1007/s10064-024-03670-5

A soft ground micro TBM’s specific energy prediction using an eXplainable neural network through Shapley additive explanation and Optuna

Original Paper
Open access
Published: 15 April 2024

Volume 83, article number 175, (2024)
Cite this article

Download PDF

You have full access to this open access article

Bulletin of Engineering Geology and the Environment Aims and scope Submit manuscript

A soft ground micro TBM’s specific energy prediction using an eXplainable neural network through Shapley additive explanation and Optuna

Download PDF

Kursat Kilic ORCID: orcid.org/0000-0003-4362-0704¹,
Hajime Ikeda¹,
Owada Narihiro¹,
Tsuyoshi Adachi¹ &
…
Youhei Kawamura²

Abstract

In tunnel construction, efficiently predicting the energy usage of tunnel boring machines (TBMs) is critical for optimizing operations and reducing costs. This research proposes a novel method for predicting the specific energy of micro slurry tunnel boring machines (MSTBMs) using an explainable neural network (xNN) that leverages operator-monitored data. The xNN model provides transparency and interpretability by integrating the Shapley additive explanation (SHAP) technique, enabling tunneling engineers and operators to gain valuable insights into the prediction process. Extensive data from MSTBM umbrella pipe support excavation are the foundation for training, testing, and unseen data in the xNN model. The specific energy formula derived from the operational parameters of the MSTBM defines the dependent variable for the xNN model. The test dataset evaluates the model’s performance with an R² of 98.7%, an MSE of 2.40, and an MAE of 0.003, demonstrating its accuracy and reliability. Ten percent of the dataset was reserved as unseen data to assess the model’s generalization capabilities. Upon evaluation, the model achieved an R² value of 89%, an MAE of 0.01, and a root mean squared error (RMSE) of 0.01. The xNN empowers operators to optimize operational parameters and promote more efficient and sustainable tunneling practices by identifying influential factors affecting energy consumption through its interpretable nature. This research has significant implications for the future of underground construction, paving the way for improved resource management.

Soft ground micro TBM jack speed and torque prediction using machine learning models through operator data and micro TBM-log data synchronization

Article Open access 27 April 2024

Optimizing sustainable building retrofits with Emperor Penguin Optimization: a machine-learning approach for energy consumption prediction

Article 13 February 2024

Quantification of Energy Consumption and Carbon Dioxide Emissions During Excavator Operations

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The tunnel boring machine (TBM) represents an advanced engineering solution that automates the process of tunnel excavation. It has a rotating cutter head, and cutting tools effectively bore through various geological formations, ranging from soft soil and clay to hard rock and abrasive materials. By employing state-of-the-art technology and engineering principles, TBMs can tunnel through the most challenging terrains with precision and efficiency. In contrast to conventional tunneling techniques such as drilling and blasting, mechanized tunneling with TBMs minimizes the need for human involvement within the tunnel face. TBMs operate in a closed environment, providing a controlled working environment that further enhances safety standards (Acaroglu et al. 2008; Hartlieb et al. 2017; Zhang et al. 2017; Ren et al. 2018; Feng et al. 2021). However, there are drawbacks to the impact of TBM tunnelling on excavation efficiency. The impacts can be classified into two main types: ground-machine interactions and human‒machine interactions. Numerous studies have been conducted to identify and predict the reason for the inefficiency of TBM tunnelling. These studies can be categorized into empirical/theoretical and artificial intelligence applications. Empirical/theoretical models have been introduced to examine the predominant characteristics of TBMs (Roxborough and Phillips 1975; Farmer and Glossop 1980; Wijk 1992; Rostami and Ozdemir 1993; Barton 2000; Cardu and Oreste 2011; Macias et al. 2016; Cardu et al. 2017). Empirical/theoretical models (She et al. 2024) have considered various rock parameters, including the cerchar abrasivity index (CAI), uniaxial compressive strength (UCS), quartz content, Vickers hardness number (VHNR), fracturing degree, porosity, drilling rate index, and rock mass classification, as well as machine properties such as the thrust force (MN), torque (MN.m), penetration rate (m/h), revolutions per minute (rpm), and TBM diameter, to estimate cutter tool life. Nevertheless, this approach has limitations, such as the need for specialized laboratory equipment, reliance on a limited set of input parameters, and high computational time due to the complexity of the experimental setup process.

Many researchers have developed artificial intelligence models for predicting TBM performance to address the limitations of empirical methods. Artificial intelligence models have been used to predict TBM performance based on cutter wearing and penetration rate (Alvarez Grima et al. 2000; Benardos and Kaliampakos 2004; Mahdevari et al. 2012, 2014; Ghasemi et al. 2014; Jahed Armaghani et al. 2018; Salimi et al. 2018, 2022; Koopialipoor et al. 2019; Feng et al. 2021; Kilic et al. 2022; Yang et al. 2022). However, while artificial intelligence models have demonstrated their effectiveness in predicting TBM penetration rates, accurately forecasting specific energy remains a considerable challenge within mechanized tunnelling. The specific energy is a critical parameter that directly impacts the TBM performance and energy consumption throughout tunnel excavation. Understanding and accurately predicting specific energy is vital for optimizing tunnelling operations, enhancing overall efficiency, and minimizing construction expenses (Mirahmadi and Dehkordi 2019; Tang et al. 2023). Several researchers have developed approaches such as empirical, experimental, and intelligent models to analyze the specific energy consumption of mechanized tunneling. Snowdon et al. (1982) Balci and Tumac (2012) Cho et al. (2013) Copur et al. (2014), and Pan et al. (2019) used a linear cutting machine (LCM) to analyze the penetration rate and specific energy relationship. Nevertheless, conducting full-scale linear cutting tests has certain limitations. One of the drawbacks is the challenge of acquiring large rock blocks, which can be difficult or sometimes unfeasible. Additionally, many researchers may not have access to the necessary testing equipment, such as a full-scale linear cutting machine. On the other hand, Altindag (2003) reported a significant relationship between the brittleness of a rock mass and its specific energy. Preinl et al. (2006) identified a correlation between rock mass excavability (RME) and specific energy. The impact of brittleness and destruction energy on specific energy was explored by Atici and Ersoy (2009). A method that utilizes fuzzy logic to estimate the specific energy requirements of constant cross-section disc cutters during rock cutting was proposed by Acaroglu et al. (2008). Regression analyses to explore the connection between excavation parameters and rock cutoff were performed by Tang et al. (2023). They established a prediction model for UCS and a classification model for rock cutoff based on SE. Moreover, the mechanical analysis of the shield excavating process via a nonlinear multiple regression model using on-site data, leading to the development of a diagnostic model for SE, was integrated by Zhang et al. (2012). Introduced a novel specific energy (SE) equation that accounts for variations in the disc-cutter radius, enhancing the predictive accuracy for TBM performance by Wang et al. (2012). However, simple regression models overlook bottlenecks in predicting specific energy because the models have been constructed with limited data and cannot capture reliable information. Zhou et al. (2022) developed a physics-informed machine learning model to predict the energy consumption of a TBM using the physics constraints of tunnel geology. However, due to the tailored physics formula, physics-based machine learning models are complex and unsuitable for generalization. Wang et al. (2023) emphasized that TBM energy consumption is the primary factor influencing excavation efficiency. They argued that evaluating TBM efficiency solely based on operator experience lacks reliability in optimizing excavation effectiveness. To address this concern, the authors devised a hybrid algorithm named QPSO-ILF-ANN, which combines quantum particle swarm optimization with an enhanced loss function grounded on a neural network. This model successfully predicted that increased penetration and rotational speeds increase excavation speeds while concurrently reducing energy usage. However, its complexity arises from the requirement for quantum knowledge and its involvement in traditional hyperparameter tuning methods. In summary, previous researchers did not consider optimizing the operator’s decision-making. Additionally, determining and forecasting the SE is notably more intricate in nonhard rock conditions due to the infrequent utilization of soil mechanics parameters and their inherent uncertainty, as noted by Yu and Mooney (2023). Previous studies revealed two significant gaps in SE prediction research. First, optimizing TBM operators through intelligent system feedback underscores the significance of human–machine interactions. Second, there is a call for a new equation to calculate the specific energy, considering the TBM operational parameters, soil-machine interactions, and material removal flow rate instead of the penetration rate-guided approach.

In this paper, a different point of view was evaluated for the performance evaluation of the TBM. This paper diverges from previous studies by offering data-driven and methodologically distinct approaches. The data-driven approach involves operator decisions (monitoring), which determine how human–machine interactions impact the energy consumption of a TBM. The operator-based dataset was used to develop the model instead of focusing on limited ground-machine interactions. The method will provide new insight into machine operation for the human side because the AI model can assist the operator and then concentrate on crucial parameters for energy consumption while avoiding entire control panel management. The model’s novelty lies in its methodological approach, where a new specific energy formula, tailored for MSTBMs in soft ground, incorporates TBM operational parameters, the soil-machine friction coefficient, and the sludge removal flow rate. By integrating cutting-edge automatic hyperparameter tuning, we can overcome the high computation time and time consumption of model optimization. Finally, incorporating SHAP with xANN enables the identification of critical parameters influencing energy consumption.

Data

Project description

The tunneling project is a highway bypass tunnel in Japan. The hydraulic excavator excavates the main tunnel project. The tunnel project entrance consists of soft ground; therefore, an umbrella pipe support excavation is used to increase the stability of the tunnel roof. Figure 1 illustrates the umbrella pipe support excavation with the MSTBM.

The pipe geology consists of sandy clay, sand, clay (brown), clay (blue), and sandy clay. Owing to the similarities between soil samples, they are grouped as TUC, TUS, and TC based on the site investigation report. Table 1 shows the mechanical properties of the soft ground.

Table 1 Mechanical properties of pipe geology

Full size table

Micro slurry TBM machine specifications

The micro tunnelling method was used to excavate umbrella pipe supports for excavating highway bypass tunnels in Japan. Figure 2 briefly demonstrates the working principle of the micro slurry TBM, and Table 2 provides the specifications of the MSTBM.

Table 2 Micro slurry TBM specifications

Full size table

Data preprocessing

In this research, an operator-based dataset was used to predict the specific energy consumption of the MSTBM. This dataset aimed to identify how to impact operator decisions on specific energy consumption levels. In this section, the operational parameters of the MSTBM were investigated, and the input parameters for the model were selected using a correlation matrix. The operational parameters include 27 operating parameters. Figure 3 presents the frequency distributions and values of the operational parameters.

Owing to the frequency plot distribution, normalization and feature transformation rescaled the data in the range of [0–1] and decreased the influence of magnitude on the variance (Huang et al. 2022). Equation 1 expresses the min–max normalization formula (Munkhdalai et al. 2019).

$$x_\text{scaled}=\frac{x-x_\text{min}}{x_\text{max}-x_\text{min}}$$

(1)

where x_(scaled) is the normalized data, x is the raw data, and x_max and x_min are the maximum and minimum values of the dataset, respectively.

Figure 4 shows the correlation matrix of the operational parameters used to determine the correlations among the variables. According to Fig. 4, 11 operational parameters were selected from the entire dataset due to their reasonable correlations with the specific energy.

Based on the correlation matrix in Fig. 4, the following operational parameters were selected to serve as input variables for the xNN model. The cutter torque (kN∙m), cutter current (ampere; A), cutting-edge water pressure (MPa), earth pressure (kN/m²), jack stroke (mm), original pressing force (MPa), thrust propulsion force (kN), slurry density (t/m³), sludge density (t/m³), slurry dry solids flow rate (l/min), and sludge dry solids flow rate (l/min). The selected parameters have a positive and reasonable correlation with the specific energy.

Methodology

After data preprocessing, the dataset consisted of 11 selected features for building the xANN model. The dataset was split into training (80%), test (20%), and unseen data (10%). The target variable specific energy was derived from the MTBM torque power consumption during cutting rotation with respect to the friction coefficient of the soil machine during sludge removal. As elaborated in the introduction, this derivation was necessitated by the existing specific energy formulas that are primarily applicable to laboratory-scale tests and large hard rock TBMs. Initially, operator-monitored data is collected and subjected to exploratory data analysis to understand data characteristics, with a particular focus on the frequency distribution of data. This analysis, augmented by a correlation matrix, informs feature selection, ensuring that only the most relevant features are chosen, thereby enhancing model performance, and reducing unnecessary complexity. Data is then split into distinct sets: 80% for training, 20% for testing, and 10% set aside as unseen data for final validation. Before training, min-max standardization normalizes the feature scales, promoting faster convergence during the learning phase and preventing any single feature from dominating the model’s attention due to scale differences. The training process itself is optimized through iterative hyperparameter tuning using Optuna, striking a balance between performance and computational efficiency. Post-training, the model is evaluated for accuracy using the test set. SHAP analysis provides insights into feature importance, contributing to the model’s explainability and identifying opportunities to refine the model further. Finally, validation against unseen data ensures that the model’s predictive power holds up against new, real-world data, confirming its robustness and generalization capability. This methodical process not only bolsters model transparency but also seeks to improve the overall quality of predictions, ensuring that the model is not only high-performing but also understandable and trustworthy.

The detailed process of deriving this specific energy equation is described in the “Derivation of the micro slurry TBM-specific energy formula” section. The specific energy equation is used as an output of the xNN model. The process of integrating the Optuna model into the ANN model is described in the “Optuna automatic hyperparameter tuning” section. The “Explainable neural network” section describes explainable neural networks fine-tuned using Optuna. The model has been incorporated with SHAP to address the black box concern inherent in neural networks, thereby ensuring transparency and interpretability of the model’s outcomes in the “Shapley additive explanation (SHAP)” section.

Consequently, the “Evaluation metrics” section outlines the assessment metrics, including R², MSE, and MAE. Figure 5 briefly summarizes the methodology and model description.

Derivation of the micro slurry TBM-specific energy formula

The first concept of the specific energy (J/m³⁾ in the drilling process based on the F is thrust on the bit (kN), A is hole section (m²), N is rotation speed (rps), T is rotation torque (kN.m), and V is the rate of penetration (m/s), provided by Teale (1965). Equation 2 shows the first concept of the specific energy.

$$E= \frac{F}{A}+\frac{2\pi . NT}{AV}$$

(2)

Celada et al. (2009) synthesized earlier specific energy equations. They emphasized that geomechanical parameters, namely, UCS, Young’s modulus, and the rock mass rating (RMR), and TBM-related factors, such as thrust, torque, rotation speed, and drilling fluid pressure, were predominantly employed in deriving specific energy formulas. Notably, this highlights the potential utilization of TBM operational parameters in formulating a specific energy equation tailored to micro slurry TBM excavation. Equation 3 shows the proposed specific energy equation.

$$\mathrm{SE}=\frac{P_{\mathrm{cutting\;net}}\,\left(\mathrm{kW}\right)}{\mathrm{Sludge\;removal\;flow\;rate}\,\left(\frac{\mathrm m^3}{\mathrm h}\right)}$$

(3)

The MSTBM-based specific energy formula was developed inspired by Bilgin et al. (2013) the net power requirement of the TBM cutting ground. The net power requirement formula was improved by considering the frictional coefficient of the pipe-soil interaction and sludge removal flow rate to calculate the slurry TBM energy consumption as a function of the flow rate. Bilgin et al. (2013) stated that $\eta$ (the energy factor of cutterhead motors is 0.75, and the frictional coefficient of the clay shield is 0.20).

The cutting power $P_{\mathrm{cuttingnet}}\left(\mathrm{kW}\right)$ is given in Eq. 4.

$$P_{\mathrm{cutting\;net}}=\left(\frac{2\,\pi\,.T\,.\,\mathrm{RPM}}{60}\right)\,.\,\eta$$

(4)

The previous net power requirement was improved by Eq. 5, which included the frictional coefficient ($\mu$) of the soil machine during cohesive soil excavation. $\mu$ was assumed to be 0.20 for the clay.

$$P_{\mathrm{cutting\;net}}=\left(\frac{2\pi\,.\,T\,.\,\mathrm{RPM}}{60}\right)\,.\,\eta.(1+\mu)$$

(5)

Equation 6 indicates the incorporation of the $P_{\mathrm{cuttingnet}}$ and the sludge removal flow rate (m³/h).

$$\mathrm{SE}=\frac{\left(\frac{2\pi\,.\,T\,.\,\mathrm{RPM}}{60}\right)\,.\eta\,.\,(1+\mu)}{f\left(\frac{\mathrm m^3}{\mathrm h}\right)}$$

(6)

where SE is kWh/m³, T is the cutter torque (kN∙m), revolutions per minute (RPM), η (0.75) is the mechanical efficiency of the TBM, $\mu$ is the friction coefficient of the pipe (0.20) and clay, and f is the sludge removal flow rate (m³/h).

Optuna automatic hyperparameter tuning

Optuna is a new generation of hyperparameter tuning models, as explained by Akiba et al. (2019). Artificial intelligence models have several hyperparameters that are vital for improving model performance. However, it is not easy to manually obtain the best parameters for the model structure because it is computationally expensive. In this regard, the proposed xNN model hyperparameters were determined using the Optuna hyperparameter tuning model with 100 trials. The 100 trials were used to determine the number of neurons, layers, learning rate, and weight decay for the xNN model.

Optuna iteratively selects different sets of hyperparameters θ to train the ANN model and evaluates its performance using the R² score on the test set. The aim is to find the set of hyperparameters that results in the maximum R² score, indicating the best fit between the predicted and actual values. The optimization can be mathematically conceptualized as navigating the hyperparameter space to find the optimal point θopt that yields, shown in Eq. 7.

$$\theta\mathrm{opt}={\mathrm{arg\,max}}_{\theta} \,R^2\,\left(\theta\right)$$

(7)

the process of using Optuna to optimize the hyperparameters of an ANN regressor to maximize the R² score. The process involves defining a suitable objective function, iteratively exploring the hyperparameter space, and evaluating the model’s performance based on R².

The “Explainable neural network” section describes the proposed model structure and feature importance of the hyperparameters.

Explainable neural network

The xNN model uses the Optuna automatic hyperparameter tuning algorithm described in the “Optuna automatic hyperparameter tuning” section. The number of hidden neurons, the number of layers, the learning rate, and the weight decay parameters were determined 100 times through trials with Optuna. Afterwards, the model was retrained based on the best trial parameters. The Optuna can provide the important features of the hyperparameters for the model and relationship hyperparameters with each other. Table 3 presents the selected hyperparameters and their values. Figure 6 illustrates the features and significance of the hyperparameters for the model.

Table 3 xNN model hyperparameter and hyperparameter values

Full size table

Additionally, Fig. 7 demonstrates the Optuna decision process as a parallel coordinate plot for the proposed xANN model structure. Optuna is a Bayesian method, and it allows us to pursue global optimization by progressively constructing a probabilistic model that maps hyperparameter values to the objective function. This model encapsulates assumptions regarding the function’s behavior, creating a posterior distribution over the objective function (Frazier 2018).

Shapley additive explanation (SHAP)

SHAP is based on game theory and is used to extract important features for the output (Wen et al. 2021; Kavzoglu and Teke 2022; Kilic et al. 2023). Therefore, meaningful features provide explainable neural networks. SHAP has different explainer models, such as a tree explainer for tree-based algorithms, a kernel explainer for kernel-based and neural networks, and a deep explainer for a deep neural network model (Lundberg and Lee 2017). In this research, the kernel explainer was integrated into the neural network model to explain the model black box. According to (shap. KernelExplainer), the kernel model utilizes a specific weighted linear regression to calculate the importance of each feature. The evaluated essential values are Shapley values from game theory and the coefficient from a linear regression.

Evaluation metrics

The explainable neural network model was evaluated using R², MSE, and MAE to evaluate the model performance. Equation 8 expresses the equation R². R² calculates the extent to which the model inputs account for the variability in the dependent variable (Chicco et al. 2021).

$${R}^{2}=1-\frac{{\sum }_{i=1}^{n}{({\widehat{y}}_{i}-{y}_{i})}^{2}}{{\sum }_{i=1}^{n}{\left({\overset{-}{y}}_{i}-{y}_{i}\right)}^{2}}$$

(8)

where ${\widehat{y}}_{i}$ is the estimated value of the data, ${y}_{i}$ is the actual value, ${\stackrel{-}{y}}_{i}$ is the mean of the prediction value, and n is the total dataset number.

The MSE is applicable when identifying outliers is necessary. The L2 norm is particularly effective at assigning greater importance to these data points. To elaborate, when the model generates an inferior prediction, the error amplification is intensified through the squaring mechanism in the function. Equation 9 indicates the equation of the MSE (Chicco et al. 2021).

$$\mathrm{MSE}=\frac1m\sum_{i=1}^m\left(X_i-Y_i\right)^2$$

(9)

where ${X}_{i}$ is the predicted value and ${Y}_{i }$ is the actual value.

The MAE is suitable in cases where outliers indicate flawed segments within the dataset. The MAE does not excessively penalize outliers during training, offering a comprehensive and constrained performance evaluation for the model. Conversely, when the test set contains numerous outliers, the model’s performance will be moderate. Equation 10 shows the formulation of the MAE.

$$\mathrm{MAE}=\frac1m\sum_{i=1}^m\left|X_i-Y_i\right|$$

(10)

where ${X}_{i}$ is the predicted value and ${Y}_{i }$ is the actual value.

Results

The performance of the model was evaluated using R², MSE, and MAE. The test data provided the model’s performance with an R² of 98.7%, an MSE of 2.40, and an MAE of 0.003. Ten percent of the dataset was split as unseen data to evaluate the model’s generalization capabilities. Based on the unseen data, the model achieved an R² value of 89%, an MAE of 0.01, and a root mean square error (RMSE) of 0.01. In addition, the model outcome was visualized using predicted and actual plots. Figure 8 illustrates the model prediction performance compared to the actual values. It can be observed that a majority of the data points cluster around the best-fit line, proving the model’s ability to predict specific energy values accurately.

On the other hand, the model prediction error was assessed with a prediction error histogram to indicate the model performance. Figure 9 shows the frequencies of different error ranges, where the x-axis denotes the error magnitude and the y-axis represents the frequency and density of occurrences.

A well-performing model would exhibit a symmetric and narrow distribution around zero error, indicating minimal prediction discrepancies. Figure 9 shows a central peak around zero error, which suggests that the model generally provides accurate predictions.

Figure 10 shows the model performance in terms of the learning curve. During the training processes of the 100 epochs, the model was trained on increasing amounts of data. The model learning curve includes training and validation errors based on the MSE against the number of training iterations. According to Fig. 10, the model demonstrated that training and validation errors gradually decrease and appear to stabilize; thus, the learning is effective and generalizable well.

Additionally, the xANN model was validated using an unseen dataset. The unseen data can be used to simulate a real-world scenario to prove model generalizability. Figure 11 illustrates the unseen data-predicted and actual results. The model provided robust predictions with an R² of 89%. The observed difference between the test score (98.7%) and the unseen data score (89%) can be attributed to data drift, which occurs when the distribution of the input data changes over time. If the test data fails to represent these changes, the model’s performance on unseen data may deteriorate.

Figure 12 shows the frequency distributions of the unseen data and the actual and predicted data densities. Most unseen data reasonably match the actual and predicted data densities.

In addition, the model explanation is critical in preventing the model’s black box and providing an explainable neural network. In the “Shapley additive explanation (SHAP)” section, SHAP was incorporated with a neural network to explain the contribution of operator decisions for specific energy. Figures 13 and 14 show the force and summary plot of the SHAP, respectively. Figures 13 and 14 allow us to determine the underlying reason for the operator’s energy consumption. Figure 13 shows that the original pressing force (MPa), earth pressure (kN/m²), jack stroke (mm), cutter torque (kN.m), and cutter current (A) of the MSTBM are the most critical parameters for energy prediction. In contrast, the thrust propulsion force (kN) is less important for the specific energy.

In addition to Fig. 13, Fig. 14 demonstrates the feature importance of the TBM parameters using a SHAP summary plot. The cutter torque, cutter current, and slurry density (t/m³) are the most significant parameters.

Figure 13 is related to the microlevel explanation of individual predictions, and Fig. 14 is linked to a macrolevel understanding of the model. The model is shown in Fig. 13 to illustrate the decision-making process for individual predictions.

Discussion

This research has provided significant insights into the relationship between influencing the decision of a TBM operator and the specific energy consumption during pipe excavation. Eleven operational parameters were selected among the 27 operational parameters using Fig. 4 in the “Data preprocessing” section. However, three of the chosen parameters are the most critical for the specific energy consumption in pipe excavation, as shown in Fig. 13. The cutter current is more critical than the cutter torque, and the jack speed for predicting a specific energy owing to the cutter current is directly related to the machine driving motors. Figure 13 indicates that the thrust propulsion force impact is lower than that of the other operational parameters of the MSTBM for predicting the specific energy. Figure 15 (a) illustrates the changes in the cutter current and specific energy with respect to the excavation distance. When the cutter current increases, the specific energy of the machine tends to increase. It can be interpreted that more challenging cutting conditions require more power for cutting. Figure 15 (b) illustrates the relationship between the cutter torque and specific energy. A higher torque, which indicates greater resistance against the cutter head, generally leads to higher specific energy values, suggesting that more effort (and thus energy) is needed to excavate the material. Figure 15 (c) presents the machine jack speed control and its variation and energy consumption. Mokhtari and Mooney (2020) stated that the operator attempts to control the jack speed when the TBM increases the advance rate to maintain face stability.

On the other hand, the specific energy has been investigated based on soil formation during pipe excavation. Figure 16 presents the specific energy consumption for each section of the pipe geology. The energy consumption is greater in the clay zone from 60 to 85 m of pipe excavation than in the sand and sandy clay zones. The main reason is that the operator increased the torque and tended to adjust the jack speed control in cohesive soil.

Practical applications

The insights gained from this study have practical implications for the tunnelling industry. TBM operators can leverage the relationships between the cutter torque, jack stroke, cutter current of the driving motor, and specific energy to optimize excavation processes. By carefully adjusting the cutter torque, cutter current, and jack stroke based on the specific energy requirements of different geological zones, tunneling operations can be managed for higher efficiency and cost-effectiveness. The study also highlights the importance of specific energy consumption depending on geological information. This research shows the importance of selecting a TBM based on the geological setting to reduce energy consumption while monitoring the TBM parameters. Figure 17 illustrates the practical application of the xNN model.

Despite its strengths, the proposed model has several limitations. They are summarized as follows:

(1)
The analysis focused on a specific set of micro slurry TBM parameters, but geomechanical parameters, cutter head design, and cutter tool types were not considered to derive specific energy formulas. They may influence the specific energy; therefore, further research could explore the impact of the additional features of the specific energy.
(2)
The proposed model did not predict a different case due to the data availability. Therefore, the model can be applied to different scenarios to determine its performance in some cases.
(3)
The kernel explainer computation time is high, making it a bottleneck for explainable artificial intelligence models.
(4)
The model was constructed using operational parameters from micro tunnelling TBMs, implying that utilizing operational parameters from larger TBMs or those designed for hard rock conditions would yield divergent outcomes.

Conclusions

This research thoroughly explored the use of explainable neural networks (xNNs) for predicting the specific energy consumption of mechanized soft ground tunnel boring machines (MSTBMs) under soft ground conditions. The xANN model, which uses 11 selected operational parameters, provided new insights into the prediction process and its influencing key factors. A unique aspect of this study is its focus on soft ground tunnelling, diverging from previous research that predominantly concentrated on hard rock mechanized tunnelling. This shift necessitated an innovative approach to feature extraction, using correlation analysis to identify relevant parameters for the xNN model. Moreover, the research introduced a novel specific energy formula designed explicitly for soft ground MSTBM operations, addressing the inadequacies of existing models. A significant advancement of this study is the incorporation of the advanced Optuna algorithm, which facilitates automatic hyperparameter tuning, optimizes the model’s performance, and reduces manual intervention. The SHAP technique further enhances the model by identifying crucial features that impact specific energy consumption, providing actionable insights for operators. One of the most noteworthy outcomes of this research is the model’s ability to provide feedback to the operator, enabling them to optimize machine parameters and adjust energy consumption in real time. This feature is particularly beneficial for managing the energy demands of various soil types, especially in clay zones where the energy consumption is notably high due to operational adjustments. This research presents a groundbreaking xNN-based approach for predicting the specific energy of MSTBMs operating under soft ground conditions. The model paves the way for more efficient, adaptive, and sustainable tunnelling practices by enabling real-time feedback and parameter optimization.

References

Acaroglu O, Ozdemir L, Asbury B (2008) A fuzzy logic model to predict specific energy requirement for TBM performance prediction. Tunn Undergr Space Technol 23:600–608. https://doi.org/10.1016/j.tust.2007.11.003
Article Google Scholar
Akiba T, Sano S, Yanase T et al (2019) Optuna: a next-generation hyperparameter optimization framework. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. Association for Computing Machinery, New York, pp 2623–2631
Altindag R (2003) Correlation of specific energy with rock brittleness concepts on rock cutting. J South Afr Inst Min Metall 103:163–171. https://doi.org/10.10520/AJA0038223X_2948
Article Google Scholar
Alvarez Grima M, Bruines PA, Verhoef PNW (2000) Modeling tunnel boring machine performance by neuro-fuzzy methods. Tunn Undergr Space Technol 15:259–269. https://doi.org/10.1016/S0886-7798(00)00055-9
Article Google Scholar
Atici U, Ersoy A (2009) Correlation of specific energy of cutting saws and drilling bits with rock brittleness and destruction energy. J Mater Process Technol 209:2602–2612. https://doi.org/10.1016/j.jmatprotec.2008.06.004
Article CAS Google Scholar
Balci C, Tumac D (2012) Investigation into the effects of different rocks on rock cuttability by a V-type disc cutter. Tunn Undergr Space Technol 30:183–193. https://doi.org/10.1016/j.tust.2012.02.018
Article Google Scholar
Barton NR (2000) TBM tunnelling in jointed and faulted Rock. CRC
Google Scholar
Benardos AG, Kaliampakos DC (2004) Modelling TBM performance with artificial neural networks. Tunn Undergr Space Technol 19:597–605. https://doi.org/10.1016/j.tust.2004.02.128
Article Google Scholar
Bilgin N, Copur H, Balci C (2013) Mechanical excavation in mining and civil industries. CRC
Book Google Scholar
Cardu M, Oreste P (2011) Earth Sci Res J 15:5–11. http://www.scielo.org.co/scielo.php?script=sci_abstract&pid=S1794-61902011000100001&lng=en&nrm=iso&tlng=en
Cardu M, Iabichino G, Oreste P, Rispoli A (2017) Experimental and analytical studies of the parameters influencing the action of TBM disc tools in tunnelling. Acta Geotech 12:293–304. https://doi.org/10.1007/s11440-016-0453-9
Article Google Scholar
Celada (2009) The use of the specific drilling energy for rock mass characterisation andtbm driving during tunnel construction [tunnel engineering - mechanized tunneling] - Geotechpedia. https://geotechpedia.com/Publication/Show/211/THE-USE-OF-THE-SPECIFIC-DRILLING-ENERGY-FOR-ROCKMASS-CHARACTERISATION-AND-TBM-DRIVING-DURING-TUNNEL-CONSTRUCTION. Accessed 5 Aug 2023
Chicco D, Warrens MJ, Jurman G (2021) The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput Sci 7:e623. https://doi.org/10.7717/peerj-cs.623
Article Google Scholar
Cho J-W, Jeon S, Jeong H-Y, Chang S-H (2013) Evaluation of cutting efficiency during TBM disc cutter excavation within a Korean granitic rock using linear-cutting-machine testing and photogrammetric measurement. Tunn Undergr Space Technol 35:37–54. https://doi.org/10.1016/j.tust.2012.08.006
Article Google Scholar
Copur H, Aydin H, Bilgin N et al (2014) Predicting performance of EPB TBMs by using a stochastic model implemented into a deterministic model. Tunn Undergr Space Technol 42:1–14. https://doi.org/10.1016/j.tust.2014.01.006
Article Google Scholar
Farmer IW, Glossop NH (1980) Mechanics of disc cutter penetration. Tunn Tunn U K 12:6
Google Scholar
Feng S, Chen Z, Luo H et al (2021) Tunnel boring machines (TBM) performance prediction: a case study using big data and deep learning. Tunn Undergr Space Technol 110:103636. https://doi.org/10.1016/j.tust.2020.103636
Article Google Scholar
Frazier PI (2018) A tutorial on Bayesian optimization. https://arxiv.org/abs/1807.02811
Ghasemi E, Yagiz S, Ataei M (2014) Predicting penetration rate of hard rock tunnel boring machine using fuzzy logic. Bull Eng Geol Environ 73:23–35. https://doi.org/10.1007/s10064-013-0497-0
Article Google Scholar
Hartlieb P, Grafe B, Shepel T et al (2017) Experimental study on artificially induced crack patterns and their consequences on mechanical excavation processes. Int J Rock Mech Min Sci 100:160–169. https://doi.org/10.1016/j.ijrmms.2017.10.024
Article Google Scholar
Huang X, Zhang Q, Liu Q et al (2022) A real-time prediction method for tunnel boring machine cutter-head torque using bidirectional long short-term memory networks optimized by multi-algorithm. J Rock Mech Geotech Eng 14:798–812. https://doi.org/10.1016/j.jrmge.2021.11.008
Article Google Scholar
Jahed Armaghani D, Faradonbeh RS, Momeni E et al (2018) Performance prediction of tunnel boring machine through developing a gene expression programming equation. Eng Comput 34:129–141. https://doi.org/10.1007/s00366-017-0526-x
Article Google Scholar
Kavzoglu T, Teke A (2022) Advanced hyperparameter optimization for improved spatial prediction of shallow landslides using extreme gradient boosting (XGBoost). Bull Eng Geol Environ 81:201. https://doi.org/10.1007/s10064-022-02708-w
Article Google Scholar
Kilic K, Toriya H, Kosugi Y et al (2022) One-dimensional convolutional neural network for pipe jacking EPB TBM cutter wear prediction. Appl Sci 12:2410. https://doi.org/10.3390/app12052410
Article CAS Google Scholar
Kilic K, Ikeda H, Adachi T, Kawamura Y (2023) Soft ground tunnel lithology classification using clustering-guided light gradient boosting machine. J Rock Mech Geotech Eng. https://doi.org/10.1016/j.jrmge.2023.02.013
Article Google Scholar
Koopialipoor M, Tootoonchi H, Jahed Armaghani D et al (2019) Application of deep neural networks in predicting the penetration rate of tunnel boring machines. Bull Eng Geol Environ 78:6347–6360. https://doi.org/10.1007/s10064-019-01538-7
Article Google Scholar
Lundberg S, Lee S-I (2017) A unified approach to interpreting model predictions. https://arxiv.org/abs/1705.07874
Macias FJ, Dahl F, Bruland A (2016) New rock abrasivity test method for tool life assessments on hard rock tunnel boring: the Rolling Indentation Abrasion Test (RIAT). Rock Mech Rock Eng 49:1679–1693. https://doi.org/10.1007/s00603-015-0854-3
Article Google Scholar
Mahdevari S, Torabi SR, Monjezi M (2012) Application of artificial intelligence algorithms in predicting tunnel convergence to avoid TBM jamming phenomenon. Int J Rock Mech Min Sci 55:33–44. https://doi.org/10.1016/j.ijrmms.2012.06.005
Article Google Scholar
Mahdevari S, Shahriar K, Yagiz S, Akbarpour Shirazi M (2014) A support vector regression model for predicting tunnel boring machine penetration rates. Int J Rock Mech Min Sci 72:214–229. https://doi.org/10.1016/j.ijrmms.2014.09.012
Article Google Scholar
Mirahmadi M, Dehkordi MS (2019) Application of the cohesion softening–friction softening and the cohesion softening–friction hardening models of rock mass behavior to estimate the specific energy of TBM, case study: Amir–Kabir water conveyance tunnel in Iran. Geotech Geol Eng 37:375–387. https://doi.org/10.1007/s10706-018-0617-5
Article Google Scholar
Mokhtari S, Mooney MA (2020) Predicting EPBM advance rate performance using support vector regression modeling. Tunn Undergr Space Technol 104:103520. https://doi.org/10.1016/j.tust.2020.103520
Article Google Scholar
Munkhdalai L, Munkhdalai T, Park KH et al (2019) Mixture of activation functions with extended min-max normalization for Forex market prediction. IEEE Access 7:183680–183691. https://doi.org/10.1109/ACCESS.2019.2959789
Article Google Scholar
Pan Y, Liu Q, Peng X et al (2019) Full-scale linear cutting tests to propose some empirical formulas for TBM disc cutter performance prediction. Rock Mech Rock Eng 52:4763–4783. https://doi.org/10.1007/s00603-019-01865-x
Article Google Scholar
Preinl Z, Tamames B, Fernández J, Hernández Álvarez M (2006) Rock mass excavability (RME) indicator: new way to selecting the optimum tunnel construction method. Tunn Undergr Space Technol 21:237–237. https://doi.org/10.1016/j.tust.2005.12.016
Article Google Scholar
Ren D-J, Shen S-L, Arulrajah A, Cheng W-C (2018) Prediction model of TBM disc cutter wear during tunnelling in heterogeneous ground. Rock Mech Rock Eng 51:3599–3611. https://doi.org/10.1007/s00603-018-1549-3
Article Google Scholar
Rostami J, Ozdemir L (1993) New model for performance production of hard rock TBMs. In: Proceedings - rapid excavation and tunneling conference 793–809. https://www.researchgate.net/publication/288383954_New_model_for_performance_production_of_hard_rock_TBMs
Roxborough FF, Phillips HR (1975) Rock excavation by disc cutter. Int J Rock Mech Min Sci Geomech Abstr 12:361–366. https://doi.org/10.1016/0148-9062(75)90547-1
Article Google Scholar
Salimi A, Faradonbeh RS, Monjezi M, Moormann C (2018) TBM performance estimation using a classification and regression tree (CART) technique. Bull Eng Geol Environ 77:429–440. https://doi.org/10.1007/s10064-016-0969-0
Article Google Scholar
Salimi A, Rostami J, Moormann C, Hassanpour J (2022) Introducing tree-based-regression models for prediction of hard rock TBM performance with consideration of rock type. Rock Mech Rock Eng 55:4869–4891. https://doi.org/10.1007/s00603-022-02868-x
Article Google Scholar
She L, Hu C, Li Y et al (2024) An empirical method for estimating TBM penetration rate using tunnelling specific energy. Tunn Undergr Space Technol 144:105525. https://doi.org/10.1016/j.tust.2023.105525
Article Google Scholar
Snowdon RA, Ryley MD, Temporal J (1982) A study of disc cutting in selected British rocks. Int J Rock Mech Min Sci Geomech Abstr 19:107–121. https://doi.org/10.1016/0148-9062(82)91151-2
Article Google Scholar
Tang Y, Yang J, Wang S, Wang S (2023) Analysis of rock cuttability based on excavation parameters of TBM. Geomech Geophys Geo-Energy Geo-Resour 9:93. https://doi.org/10.1007/s40948-023-00628-x
Article Google Scholar
Teale R (1965) The concept of specific energy in rock drilling. Int J Rock Mech Min Sci Geomech Abstr 2:57–73. https://doi.org/10.1016/0148-9062(65)90022-7
Article Google Scholar
Wang L, Kang Y, Cai Z et al (2012) The energy method to predict disc cutter wear extent for hard rock TBMs. Tunn Undergr Space Technol 28:183–191. https://doi.org/10.1016/j.tust.2011.11.001
Article CAS Google Scholar
Wang X, Wu J, Yin X et al (2023) QPSO-ILF-ANN-based optimization of TBM control parameters considering tunneling energy efficiency. Front Struct Civ Eng 17:25–36. https://doi.org/10.1007/s11709-022-0908-z
Article Google Scholar
Wen X, Xie Y, Wu L, Jiang L (2021) Quantifying and comparing the effects of key risk factors on various types of roadway segment crashes with LightGBM and SHAP. Accid Anal Prev 159:106261. https://doi.org/10.1016/j.aap.2021.106261
Article Google Scholar
Wijk G (1992) A model of tunnel boring machine performance. Geotech Geol Eng 10:19–40. https://doi.org/10.1007/BF00881969
Article Google Scholar
Yang H, Song K, Zhou J (2022) Automated recognition model of geomechanical information based on operational data of tunneling boring machines. Rock Mech Rock Eng 55:1499–1516. https://doi.org/10.1007/s00603-021-02723-5
Article Google Scholar
Yu H, Mooney M (2023) Characterizing the as-encountered ground condition with tunnel boring machine data using semi-supervised learning. Comput Geotech 154:105159. https://doi.org/10.1016/j.compgeo.2022.105159
Article Google Scholar
Zhang Q, Qu C, Cai Z, et al (2012) Modeling specific energy for shield machine by non-linear multiple regression method and mechanical analysis. In: Gaol FL, Nguyen QV (eds) Proceedings of the 2011 2nd International Congresson Computer Applications and Computational Science. Springer, Berlin, Heidelberg, pp 75–80
Zhang X, Xia Y, Zhang Y et al (2017) Experimental study on wear behaviors of TBM disc cutter ring under drying, water and seawater conditions. Wear 392–393:109–117. https://doi.org/10.1016/j.wear.2017.09.020
Article CAS Google Scholar
Zhou S, Liu S, Kang Y et al (2022) Physics-based machine learning method and the application to energy consumption prediction in tunneling construction. Adv Eng Inf 53:101642. https://doi.org/10.1016/j.aei.2022.101642
Article Google Scholar

Download references

Funding

Open Access funding provided by Akita University.

Author information

Authors and Affiliations

Department of Geosciences, Geotechnology and Materials Engineering for Resources, Graduate School of International Resource Sciences, Akita University, Akita, 010-8502, Japan
Kursat Kilic, Hajime Ikeda, Owada Narihiro & Tsuyoshi Adachi
Division of Sustainable Resources Engineering, Faculty of Engineering, Hokkaido University, Kita 13, Nishi 8, Kita-ku, Sapporo, 060-8628, Japan
Youhei Kawamura

Authors

Kursat Kilic
View author publications
You can also search for this author in PubMed Google Scholar
Hajime Ikeda
View author publications
You can also search for this author in PubMed Google Scholar
Owada Narihiro
View author publications
You can also search for this author in PubMed Google Scholar
Tsuyoshi Adachi
View author publications
You can also search for this author in PubMed Google Scholar
Youhei Kawamura
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: Kursat Kilic, Youhei Kawamura, Hajime Ikeda; methodology: Kursat Kilic; formal analysis and investigation: Kursat Kilic, Owada Narihiro; writing — original draft preparation: Kursat Kilic; writing — review and editing: Hajime Ikeda, Tsuyoshi Adachi; funding acquisition: Youhei Kawamura; resources: Hajime Ikeda, Youhei Kawamura; supervision: Hajime Ikeda, Youhei Kawamura.

Corresponding author

Correspondence to Kursat Kilic.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kilic, K., Ikeda, H., Narihiro, O. et al. A soft ground micro TBM’s specific energy prediction using an eXplainable neural network through Shapley additive explanation and Optuna. Bull Eng Geol Environ 83, 175 (2024). https://doi.org/10.1007/s10064-024-03670-5

Download citation

Received: 22 August 2023
Accepted: 03 April 2024
Published: 15 April 2024
DOI: https://doi.org/10.1007/s10064-024-03670-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A soft ground micro TBM’s specific energy prediction using an eXplainable neural network through Shapley additive explanation and Optuna

Abstract

Similar content being viewed by others

Soft ground micro TBM jack speed and torque prediction using machine learning models through operator data and micro TBM-log data synchronization

Optimizing sustainable building retrofits with Emperor Penguin Optimization: a machine-learning approach for energy consumption prediction

Quantification of Energy Consumption and Carbon Dioxide Emissions During Excavator Operations

Introduction