1 Introduction

The in situ stress field is widely known to be a fundamental factor for not only the analysis, design calculation, and stability assessment of rock engineering projects, but also the study of rock-mass deformation and failure (Guo et al., 1983; Gong et al., 2010). The accuracy of the in situ stress field obtained by numerical simulation directly influences the final results, especially for excavations in high level tectonic stress areas (González de Vallejo and Hijazo, 2008). Therefore, it is essential to know the magnitude and direction of in situ stress for any underground project before construction, requiring extensive measurements and calculations.

The in situ stress field results from a highly complicated process and is affected by many factors, including topography, geological history, the behavioral evolution of rocks, tectonic evolution, the geothermal field, and groundwater. In situ measurement is the most direct approach for determining the geo-stress field (Liu et al., 2014). However, such field tests are often costly and time-consuming. Furthermore, due to the complexity brought by the formation of in situ stress and the excessive working impact, the measured results are only a reflection of the local stress field. Thus, based on the measurement results and geological structure conditions, it is crucial to derive a reasonable stress field with wide applicability using a convenient and efficient back analysis approach coupled with an advanced numerical simulation method (Mckinnon, 2001; Saati and Mortazavi, 2011).

In numerical simulations, the in situ stress fields are generally acquired by a number of methods using limited measurement and topography data used for design and construction at various engineering sites (Zhang et al., 2006; Guo et al., 2008; Qin et al., 2008). These methods include multiple linear regression (MLR) analysis, artificial neural network (ANN) methods (Kartam et al., 1997; Xu, 2000; Li et al., 2012; Samui et al., 2015), and genetic algorithm optimization approaches, which have been adopted extensively within stress boundary conditions and displacement boundary conditions in particular. However, these methods have unique limitations, specifically in terms of shortcomings in the speed and accuracy of the inverse analysis of the in situ stress field.

Under uniform lithology and relatively simple geological conditions, the MLR analysis method for in situ stress estimation has been applied in many construction sites to assess the regional in situ stress because of its speed and convenience. However, the interaction of the main factors affecting the in situ stress field is ignored by the substantial assumptions about linearity and continuity. Moreover, the relationship between the in situ stress field and the main factors is too complicated to express by a sub-harmonic multivariate function. Therefore, the MLR analysis method usually lacks conformity with the practical situation and eventually leads to significant inaccuracy. As a typical surrogate direct inversion, the ANN approach attempts to use machine learning to map the nonlinear inverse relationship between the stress measurements and the boundary conditions (Grossberg, 1988; Kartam et al., 1997) but often has difficulty traversing the typically non-convex optimization search space because of its inherent limitations, including non-uniqueness, non-existence, and instability. Moreover, it relies mostly on a specifically designed algorithm and a high-accuracy digital computer to ensure satisfactory results, often at the expense of convergence speed. Alternatively, the genetic algorithm is a well-known optimization-based approach used to solve the inverse problem to determine the distribution of the in situ stress field. However, this method is prone to problems of premature convergence. In other words, it easily converges to a local optimal solution and requires a large number of forward analyses to estimate a solution. This means that it is typically computationally expensive with unnecessary time costs. Moreover, its solution accuracy depends strongly on the selection of the initial population. Given the limitations of these methods, one logical approach is to combine stepwise regression (SR), difference evolution (DE), support vector machine (SVM), and numerical analysis techniques to solve the inverse problem and estimate a stress field that more effectively addresses the need for both accuracy and computational efficiency (Grossberg, 1988; Kartam et al., 1997).

This paper presents the details of a particularly promising approach incorporating linear regression, SVM, and DE, and applies this approach to the problem of inversely obtaining the in situ stress field in underground projects. In the following section, considering the effect of the unloading of land surface denudation, we determine a series of parameters to be estimated related to the formation of the rock-mass in situ stress field. First, we provide an outline and the technological process of a two-stage optimization algorithm and its incorporation into an inverse solution method, namely the SR-DE-SVM algorithm, for estimating the in situ stress in a rock-mass. Then, we use this method to estimate a set of unknown parameters at the underground caverns at the Huangdeng hydropower station, using a numerical analysis to assess the efficacy of the SR-DE-SVM approach. Finally, we present validation tests.

2 Land surface denudation theory for the magnitude of in situ stress

2.1 Denudation effect of unloading

In the long term, the geological history and behavioral evolution of rocks may significantly influence the state of in situ stress fields. Some research has shown that the distribution of the rock-mass in an in situ stress field is strongly influenced by the land surface denudation effects of unloading resulting from an intense geological process when the geological environment is complicated (Hijazo and González de Vallejo, 2012). Consequently, the unloading geological behaviors stemming from land surface denudation must be considered to obtain a reasonable assessment of the in situ stress. The in situ stress field in high-mountain gorge areas is the inevitable result of various unloading geological behaviors, including land surface erosion and valley denudation, which disequilibrate the state of rock-mass stress before the formation of a valley, leading to a new balanced state through the simultaneous adjustment of its stress and strain.

To express the effect of land surface denudation, several reasonable and necessary assumptions should be made, namely, that the land was sufficiently even in ancient times and, based on geological mechanics analysis, that the in situ stress comes mainly from gravity and the geological tectonic stress field (Guo, et al., 2008) applied in ancient times. The formation of a rock-mass in situ stress field involves many related factors, such as topography, lithology, geological and structural anisotropies, sedimentary loads, tectonic evolution, thermal conditions, and groundwater. However, the thermal stress is equal to only 1/9 of the gravity stress. Thus, the thermal stress effect on the rock-mass in situ stress field is negligible for common underground projects. Conversely, groundwater has an intricate mechanism of influence on the in situ stress. Specifically, an increase in pore pressure can reduce the shear strength of the rock in the potential structural plane. Moreover, seepage pressures usually change the in situ stress field in underground projects. The thermal conditions and groundwater are usually ignored in analyses because of their comparatively small influence and difficult quantification. The last assumption is that the estimated in situ stress field is created by land denudation and erosion in the context of an ancient in situ stress field.

2.2 Land surface denudation simulation

Six types of displacement boundary conditions were considered in the simulation analysis to represent the different effects of land surface denudation and river erosion (Fig. 1) (Jiang et al., 2008), reflecting the distribution of the in situ stress field under different effects of unloading in underground projects.

Fig. 1
figure 1

Schematic diagram of the displacement boundary conditions

(a) Tectonic movement boundary in the x direction, \({U_x}\); (b) Tectonic movement boundary in the y direction, \({U_y}\); (c) Shear tectonic movement boundary in the xy plane, \({U_{xy}}\) (d) Shear tectonic movement boundary in the xz plane, \({U_{xz}}\) (e) Shear tectonic movement boundary in the yz plane, \({U_{yz}}\) (f) Boundary condition under a gravity function. γ is the unit weight of the rock mass; H is the depth of its location

3 Two-stage optimization algorithm for estimating inverse in situ stress

The two-stage optimization algorithm is a nonlinear global optimization method that combines SR and SVM with DE. It has been shown to be capable of effectively solving global optimization problems with relatively low use of computational resources (Jiang et al., 2013). SR can ascertain the set of unknown parameters that best matches the modeling prediction and determine the range of parameters to be recognized. DE is applied to determine the optimum parameters of the SVM. The SVM is used to create the DE-SVM nonlinear reflection model that maintains the global search capabilities over the whole search space to efficiently determine an optimal solution interval.

3.1 Methodology of the stepwise regression

As a prominent multivariate function, the SR method uses its specific and available efficiency to determine the importance of numerous influencing parameters to obtain the best combination of variables to establish functional relations, especially in a complex system. The selection of the initial parameters strongly influences the evolution velocity of the SVM. Improper parameter selection may lead to generation stagnancy or premature phenomena, failing to satisfy the analysis request. The proposed method introduces SR into the in situ stress field inversion analysis to define the causes of the formation of the in situ stress more clearly and to provide an important basis for underground cavern design and stability analysis.

Simulation of a tectonic stress field is usually conducted by applying normal or tangential displacement or stress boundary conditions, whose consistency relative to each other is ignored. However, the entry of one factor will lead to degradation of the others. Thus, after calculation and comparison, SR is used to exclude those factors that do not play a dominant role in the region of interest. The SR filters and inspects each tectonic stress factor that may be influential and that is consistent with the existing structural features to obtain an optimized solution matching the engineering geological investigation data.

The in situ stress value can be represented by the following equation:

$$\sigma _{jk}^0 = {C_0} + \sum\limits_{i = 1}^n {{C_i}\sigma _{jk}^i + \varepsilon },$$
(1)

where \({\sigma _{jk}}^0\) is the calculated initial geo-stress value of stress component j on point k, C0 is a constant coefficient, C i represents a stress component caused by each movement pattern, including gravity movement and sub-tectonic movement patterns, \({\sigma _{jk}}^i\) is the calculated geo-stress value of stress component j under gravity and sub-tectonic movement on point k, and ε is the model error.

First, we assume that H0: C0=C1=C2=… =Cn−1=C n =0 and check H0 using test data as a statistical sample. If H0 is rejected, the linear regression is significant, which indicates that a linear relationship exists between the independent and dependent variables. If not, the regression is not significant, which indicates that the regression model does not work.

The core concept of SR is to introduce impact factors one by one under the condition that the factor has an obvious sum of squares of partial regression. The previous factors are then inspected individually after adding new factors to exclude those without obvious relationships, ensuring that the optimal subset of variables can be obtained in several steps. The method is simple and fast because there is no need to determine the partial correlation coefficients.

The least square SR method is adopted to conduct a statistical analysis of the effect of different parameters to be estimated on target parameters. The F-test is applied to check whether the regression is significant, where the statistical magnitude of F has the computational formula shown in Eq. (2). Furthermore, the ratio of the regression sum of squares and the residual sum of squares obeys the distribution F, whose degree of freedom is m and nm−1:

$$F = {{U{{/}}m} \over {Q{{/}}(n - m - 1)}} \sim F(m,\;n - m - 1),$$
(2)

where \(U = \sum\nolimits_{k = 1}^n {{{({{\hat y}_k} - \bar y)}^2}}\) is the regression sum of squares, which is the sum of squares of the difference between the regression value ŷk and the average value \({\bar y}\), m is the number of variables, n is the sample number, and \(Q = \sum\nolimits_{k = 1}^n {{{({y_k} - {{\hat y}_k})}^2}}\) is the residual sum of squares, namely, the sum of squares of the difference between the measured value yk and the regression value ŷk.

The F statistic can be used to quantify the overall effect of the regression; the larger the F value is, the better the regression equation is. For the given confidence level λ (λ=95% is commonly adopted), the F statistic should satisfy

$$P(F \leq {F_\lambda }(m,\;n - m - 1)) \leq 1 - \lambda {.}$$
(3)

If Eq. (3) does not hold, the assumption fails.

The procedure for SR is as follows:

  1. 1.

    Independent variables are adopted to conduct the linear regression statistics for dependent variables and to check the F value.

  2. 2.

    The effect on the dependent variables of the independent variables that are sequentially added to the regression equation is observed. The effect can be determined by both the coefficient of partial correlation between the two variables and the F value regressed by them.

  3. 3.

    If the independent variables introduced earlier become less significant due to the introduction of latter variables, the earlier variables are eliminated.

  4. 4.

    Steps 2 and 3 are repeated until no additional insignificant variables are present in the equation and no additional significant variables are added into the equation. Each time that steps 2 and 3 are repeated, the F value is checked to confirm the significance of the effect of the independent variables on the dependent variables.

3.2 Nonlinear model based on DE-SVM

The SVM is a preeminent machine learning tool that minimizes an upper bound on its expected error. It attempts to convert low-dimensional nonlinear problems into high-dimensional linear problems and then replace inner product computation with a kernel function. More specifically, given data points x1, x2, …, xn and class labels y1, y2, …, yn, the SVM has the form:

$${y_i} = f(x) = (w \cdot x) + {b{\ast}} = \sum\limits_{i = 1}^n {(\alpha _i^{\ast} - {\alpha _i})} ({x_i} \cdot x) + {b{{\ast}}}{.}$$
(4)

Let w be the weighted vector and x i ∈ ℝd, yi∈ℝ. Furthermore, let (α i and \({\alpha _i}^\ast\) be the support vectors, on which all of the classifications performed by the SVM solely depend. Finally, b* represents the classification threshold. Thus, an identical SVM would result from a training set that omits all of the remaining examples. The data representation in this feature space need not be explicitly calculated if there is an appropriate Mercer kernel operator for which

$$k(x,\;{x_i}) = \exp \left({ - {{{{\left\vert {x - {x_i}} \right\vert }^2}} \over {2{\sigma ^2}}}} \right){.}$$
(5)

A radial basis function is adopted as the typical Mercer kernel operator due to its excellent performance, meaning that the data that are not linearly separable in the original space may become separable in this feature space. For data sets that are non-separable, slack variables ξi and \({\xi _i}^ \ast\) are introduced to relax the constraint as

$$\min \left[ {{1 \over 2}{{\left\Vert w \right\Vert }^2} + c\sum\limits_{i = 1}^n {({\xi _i} + \xi _i^{\ast})} } \right],$$
(6)

subject to ξi≥0, \({\xi _i}^ \ast \geq 0\), and c≥0, representing a constant called the penalty parameter. An optimal model can usually be achieved by seeking the best values of c and k, which play an important role in its prediction accuracy. However, the SVM never provides systematic theoretical approaches for the two parameters.

A DE algorithm is designed to fulfill user requirements, as its control variables are robust and easy to select and it has good convergence to the global minimum in consecutive independent trials (Xing et al., 2015). In this study, we used the advantages of the DE algorithm to explore the best parameters of c and k in SVM modeling due to its simple but efficient global optimization performance (Cheng et al., 2013). The improved DE-SVM algorithm includes the following steps:

  1. 1.

    Initializing of the DE-SVM parameters, which entails randomly generating Np bivectors about c and k in SVM modeling that should be satisfied to restrain the upper and lower bound independent variables. Because the DE algorithm is designed to be a stochastic real parameter optimization algorithm, its formula is as follows:

    $${x_{ij}} = x_{ij}^{\rm{L}} + {\rm{rand}}(x_{ij}^{\rm{U}} - x_{ij}^{\rm{L}}),$$
    (7)

    subject to i=1, 2, …, Np, where Np represents the population size, j=1, 2 is the dimension of the vectors, \({x_{ij}}^{\rm{U}}\) and \({x_{ij}}^{\rm{L}}\) are the upper and lower bound independent variables, respectively, and the function rand is random and rand∈[0, 1].

  2. 2.

    In mutation, the difference between two independent vectors is minimized and then merged with a third independent vector in its population, forming a new variation vector. For the target vector x ij , the mutant vector is obtained according to

    $${v_{ij}}(G + 1) = {x_{{r_1}j}}(G) + F({x_{{r_2}j}}(G) - {x_{{r_3}j}}(G)),$$
    (8)

    where r1, r2, r3∈[1, n], which are random indexes. r1, r2, and r3 are chosen to be different from the running index i; thus, n≥4 to allow for this condition. F∈[0, 2], where F determines the amplification of the differential variation \({x_{{r_{2j}}}}(G) - {x_{{r_{3j}}}}(G)\), and G stands for generation.

  3. 3.

    Crossover increases the diversity of the perturbed parameter vectors. Each vector in the current population is combined with a mutant to produce a trial population in this step (Jiang et al., 2013). The trial vectors are expressed as

    $${u_{ji,G + 1}} = \left\{ {\begin{array}{*{20}c} {{v_{ji,G + 1}},} & {{r_j} \leq {\rm{CR}}\;{\rm{or}}\;j = r{n_i},}\\ {{x_{ji,G}},} & {{r_j} > {\rm{CR}}\;{\rm{and}}\;j \neq r{n_i},} \end{array}} \right.$$
    (9)

    where j=1, 2, …, n; rj is a random number between 0 and 1; CR∈[0, 1], called a crossover constant, is determined by the user, and rni∈[1, n], which ensures that uji,G+1 obtains at least one vector parameter from νji,G+1.

  4. 4.

    Selection is then performed to determine whether the target or trial vector survives to become a member of the next generation. If the trial vector uji,G+1 yields a smaller cost function value than the target vector xji, G, then the target vector x ji,G can replace the trial vector uji, G+1. That is, the target vector xji, G is not itself replaced.

  5. 5.

    Loop iteration is conducted, repeating steps 2 to 4, until the end condition for the optimization process is reached. This end condition is set as a maximum generation Gmax or maximum number of function evaluations, which can be specified by the user to avoid over-fitting. Therefore, the optimal parameters will be obtained when the process reaches the convergence stage, indicating that the optimum parameters are provided for the SVM, allowing it to create an excellent model.

3.3 SR-DE-SVM algorithm

To describe the SR-DE-SVM algorithm as specifically implemented here for the in situ stress inverse problem, in which the boundary conditions are set to be unknown parameters and the objective function minimizes the error between the measured stress and the calculated stress at several measurement points, a nonlinear model based on the SVM is defined as follows:

$$\begin{array}{*{20}c} {Y = {\rm{SVM}}(X),}\\ {X = ({x_1},\;{x_2},\;\; \cdots, \;{x_n}),}\\ {Y = ({y_1},\;{y_2},\;\; \cdots, \;{y_m}),} \end{array}$$
(10)

where Y is the measured stress of the engineering site, X is the estimated boundary condition, and m and n are the numbers of input variables and output variables, respectively.

Thus, this constrained optimization problem can be expressed as

$$\begin{array}{*{20}c} {\min E({x_1},\;{x_2},\; \cdots, \;{x_n}) = {1 \over m}\sum\limits_{i = 1}^m {{{({y_i} - {\rm{SV}}{{\rm{M}}_i})}^2}},}\\ {\;\;x_i^{\rm{a}} \leq {x_i} \leq x_i^{\rm{b}},\;\;\;\;i = 1,\;2,\; \cdots, \; n,\quad } \end{array}$$
(11)

where SVM i represents the stress predicted by the SVM algorithm, and \({x_i}^{\rm{a}}\) and \({x_i}^{\rm{b}}\) are the upper and lower bounds of x i , respectively.

Therefore, the objective of the algorithm is to minimize the function min E(X). Fig. 2 shows an overview of the SR-DE-SVM algorithm. The recognition process of the boundary conditions is summarized briefly in the following steps:

  • Step 1: The stepwise regression algorithm attempts to determine the range of parameters to be recognized and the genetic parameters (Feng et al., 2000). This avoids the blindness of the sample generation based on Eq. (1).

  • Step 2: Inversion parameter samples are created based on uniform design.

  • Step 3: The 3D geological mechanics model of the engineering site is established completely. After the numerical calculation of each combination of boundary conditions, the corresponding calculated stress is obtained at several measurement points. Each combination of boundary conditions with its corresponding calculated stress comprises a so-called learning sample.

  • Step 4: Based on the object function, Eqs. (5) and (6), the DE approach is typically used to discover optimal parameters for SVM modeling.

  • Step 5: A nonlinear model based on DE-SVM is created to describe the relationship between the boundary conditions to be estimated and the actual measured stress.

  • Step 6: Based on the object function, Eq. (11), the SR-DE-SVM algorithm is used to refine the search using the input-output information generated by the random search. Using the optimal inversion parameters, the distribution of the in situ stress field can then be obtained for the engineering site by computerized simulation.

Fig. 2
figure 2

Flow chart of the SR-DE-SVM algorithm

4 Application to the Huangdeng hydropower station

4.1 On-site in situ stress measurement in underground cavern zones

4.1.1 Brief description of geological features

The Huangdeng hydropower station is located in Lanping County, Nujiang, Yunnan Province, China. It is part of the fifth stage of a reservoir with seven stations planned along the Lantsang River, and connects the Toba hydropower station upstream to the Dahuaqiao station downstream. Its power generation system is located on the left bank of the station hydropower complex, consisting of the power house, auxiliary plant, transformer chamber, tailrace surge chamber, overhaul gate chamber, pressure tunnels, tailrace branch tunnels, and tailrace tunnels, with large-scale caverns arranged in a crisscross pattern.

Tectonically, the Huangdeng hydropower station is situated on the east side of overturned synclines and on the west side of overturned anticlines in Kedengjian. The rock stratum runs perpendicular to the river at attitudes of N10°-20°E, NW∠75°-90°. Generally, the rock stratum in the power house, auxiliary plant room, transformer chamber, maintenance chamber of the unit tail water, and the surge tank is composed of metamorphic volcanic breccias, a metamorphic volcanic granule conglomerate mixed with metamorphic tuff, whose rock quality designations (RQDs) are 75%–90% and 60%–75%, respectively, and in the corresponding Q-system for rock mass classification Q=10–40 and Q=1–10, respectively, in moderately weathered, slightly weathered, and fresh states. Fig. 3 shows the main faults crossed over by the Huangdeng underground cavern group at an elevation of 1480 m, in which five main geological faults, F9, F230-1, F14, f20, and f230-1, and tuff interlayers tp230-1 are a key focus of attention. The rock is relatively intact and slightly permeable, meaning that although the geological conditions are relatively complex in this region, the lithology is rather uniform.

Fig. 3
figure 3

Main faults crossed over by the Huangdeng un-derground cavern group at an elevation of 1480 m

Furthermore, the Huangdeng engineering area lies in the secondary tectonic units from the central part of the geosynclinal fold system in Tanggula, Lanping, and Simao, which is located in the southwestern part of the Sichuan-Yunnan rhombic block, and the northern parts of the Indochina and Yunnan-Bruma blocks. The crustal blocks show the characteristics of heterogeneous and asymmetrical uplifting in the whole engineering area. This means that, in general, the terrain is higher in the north and lower in the south. Note that it is part of active structure zones. The direction of fault structures is NNW or trending towards a NS direction, and the faults can be divided into two major types, the Early and Middle Pleistocene and the Quaternary faults. However, no Late Pleistocene faults are found in the underground cavern group zone, and there are no fault structures in the region that might induce a strong destructive earthquake. Moreover, the Huangdeng project is located in the interior of active blocks, and no large or deep active faults cross over the region. Furthermore, it does not lie at the boundary of geotectonic and neo-tectonic areas, and no moderate or intense earthquakes have been recorded. Therefore, on the whole, the Huangdeng engineering region belongs to an area with good structural stability.

4.1.2 Test results and discussion

To study the in situ stress distribution of the engineering area, in situ stress measurements were made in the underground power house with high-pressure branch pipes using the aperture deforming method with three boreholes. This approach can provide information for the design of the arrangement of the engineering structures, the selection of the excavation methods, and the stability analysis and supporting design of every building. For the disposed in situ stress, the aperture distortion needs to be measured, and the 3D in situ stress state of the measuring points is obtained according to the relationship between the pitch deformation and the intrinsic in situ stress. In the feasibility phase, this project sets four groups of testing points on the left bank of the PD230 exploration audit of the power house and its inferior branch, which are all located at an elevation of 1527.3 m. A sketch map of the in situ stress measuring points is shown in Fig. 4. Using the method of aperture deformation with three boreholes, four measured stresses could be obtained: σ230-1, σ230-2, σ230-3, and σ230-4. The results are shown in Table 1.

Fig. 4
figure 4

Layout of the in situ stress measuring points in the underground cavern zones

Table 1 Principal stress results from in situ stress measurements in the underground cavern zones

Both Fig. 4 and Table 1 show that, although the maximum, middle, and minimum principal stresses present significantly dispersed features, they still show a certain regularity. As observed from the maximum measured in situ stress, the σ1 values for all measured locations are actually in the pressure stress state, with magnitudes between 7.0 MPa and 15.0 MPa. Furthermore, σ1 increases as the measuring point depth increases. The values of σ2 for all measured locations ranged from 2.5 MPa to 7.5 MPa, and those of σ3 ranged from 1.5 MPa to 5.0 MPa. Moreover, the measured maximum principal stress σ1 was consistent in its direction, which was south-north or east-west, with a dip angle of S15°-40°E with a positive gradient slope. Its dip angle was slightly smaller than its slope angle, which decreased as the point depth increased.

Generally, the angle ϕ between the direction of the measured maximum principal stress σ1 and the longitudinal axis of the power house should be kept within 15°–30°, according to statistical data from several in-built underground cavern projects in China (Table 2). As Table 1 shows, the direction of the measured maximum principal stress σ1 is S39.7°E. Therefore, the direction of the longitudinal axis of the power house is determined as NW294.5°. Considering the influence of the geological conditions of the engineering area and the overall coordination of the project, this means that the angle ϕ is equal to 25.8°. We conclude that the angles between the direction of the measured maximum principal stress σ1 and the longitudinal axis of the transformer chamber and the tailrace surge chamber are acute angles, which is beneficial to the stability of the whole underground cavern group.

Table 2 Statistical data from several in-built underground cavern projects in China

Fig. 5 plots a large dataset of stress magnitudes versus depth based on national data from typical hydropower projects in China, including the Huangdeng hydropower station, and shows the minimum and maximum envelope lines reported by different researchers (Feng et al., 2000; Xue and Chen, 2006; Zhao et al., 2007; Zhang, 2011; Li et al., 2012; Liu et al., 2014). The K values have been calculated as defined by Brown and Hoek (1978), K=(σHh)/(2σV), to compare the envelope lines expressed by scatter points in Fig. 5, and σH and σh represent horizontal stresses in two directions of a particular plane, while σV represents the vertical stress. The four solid scatter points represent the actual in situ stress measurements of the Huangdeng hydropower station. Table 3 shows the relationship among KH, Kh, and depth for the other main hydropower projects in China, defining KH=σH/σV and Kh=σh/σV. The maximum envelope lines are generally similar, with less data scatter for depths greater than 1000 m, with K tending toward 1.0 with depth. The minimum envelope lines show greater discrepancies for depths below 500 m. These results show that at typical depths for most underground excavations, K-depth relationships cannot be used in practice.

Fig. 5
figure 5

K -depth relationships based on national data for China

Table 3 K -depth relationships for typical hydropower projects in China

4.2 Numerical calculation model

Having comprehensively considered many factors, such as the characteristics of topography and geomorphology, an elaborate numerical simulation model was built based on the characteristics of the layout and structure of the actual engineering project. Particular attention was paid to the modeling range, affected zones, and boundary constraint conditions of the Huangdeng underground cavern group. The scope of the computational model was 478 m in the x-direction, 450 m in the y-direction, and 596 m from the isolated rock-mass, to eliminate the effect of artificial boundary errors on the structure of the underground caverns (Fig. 6). There are more than 104 583 nodes and 618 027 quadrilateral elements in this numerical calculation model, including five main geological faults, F9, F230-1, F14, f20, and f230-1, and tuff interlayers tp230-1 located in the underground cavern area.

Fig. 6
figure 6

Numerical simulation model of the Huangdeng underground cavern group

The Mohr-Coulomb model was adopted in the Abaqus analysis. In the inversion of the in situ stress field, the simulation of land surface denudation mainly entails taking the highest point of the calculated region as a baseline, which is necessary to choose an appropriate region. Gravity stress revises the coefficient parameter of the volume-weight to serve as an influence parameter that is expected to be output; tectonic stress is created by applying the displacement boundary in the computing area. A gradual elastoplastic excavation simulation is adopted to record the denudation effect of unloading. Because roller boundary conditions are reasonable for the pre-excavation process of the in situ stress field inversion (Zhao et al., 2012), the bottom edge constraint is applied for normal constraints, while the outer boundary of the rock formation is considered to be a free boundary (Zhang and Yin, 2014). The physical and mechanical properties of the rock-mass used in the numerical experiment, which are taken mainly from the results of previous laboratory tests, are given in Table 4. In other words, the whole process of numerical simulation follows the principle of geometry simulation, constitutive simulation, and stress state simulation.

Table 4 Physical properties of the rock-mass applied in the numerical model

4.3 Recognition of the boundary conditions to be determined

4.3.1 Process of undetermined parameter recognition

Based on the coordinate system of the numerical simulation model, the principal stress results from the in situ stress measurements were converted into stress components (Table 5). The SR-DE-SVM method is proposed to obtain optimal values of parameters that best match the modeling prediction with the measured stress data. Specifically, the appropriate boundary conditions for the numerical calculation model, such as the gravity correction coefficient and various tectonic movement patterns, are recognized. SR is used to search for the best parameter system, having a decisive role in the formation process of the in situ stress field, which is expressed using the C# programing language. Based on the results of SR analysis, the shear structure factors τxy and τyz were regarded as unsatisfactory results under the boundary conditions. In other words, five undetermined parameters were defined in the inversion analysis, including the gravity correction coefficient g* and other shear structure factors τ xx , τ yy , τ yx , and τ xz . Furthermore, the ranges of identification parameters were determined as follows: g*∈[7.85, 17.66] m/s2, τ xx ∈[0.10, 0.25] m, τ yy ∈[0.00, 0.15] m, τ yx ∈[0.25, 0.45] m, and τ xz ∈[0.10, 0.25] m. Finally, 35 training and test samples with the calculated stress components at measured points and undetermined parameters of boundary conditions were created by uniform design U35(353).

Table 5 Stress components of in situ stress measurement in the underground cavern zone

4.3.2 Process of undetermined parameter recognition

In this section, the results of the proposed DE-SVM method are verified against the data generated in Section 4.3.1. Taking 30 schemes as training samples and five schemes as test samples, the DE-SVM algorithm can be used to obtain the best parameters of c and k in SVM modeling. After conducting learning and training using the corresponding parameters input from the training samples, a nonlinear mapping model was established between the measured stress of the engineering site and the boundary conditions to be estimated. Fig. 7 shows the forecast effect tests of SVM for five test samples. The tendency is for the predictions by SVM to be close to the stress values of the testing samples, meaning that the nonlinear relationship between the measured stress and unknown boundary conditions can be reflected by a nonlinear model based on DE-SVM.

Fig. 7
figure 7

Forecast effect tests of measured points by SVM

(a) σ230-1; (b) σ230-2; (c) σ230-3; (d) σ230-4

4.3.3 Definition of boundary conditions

In this section, the optimal values of the parameters from the measured stress data obtained using the SR-DE-SVM nonlinear reflection model are compared with those obtained by an MLR analysis method and an ANN method to estimate the in situ stress field for the actual underground cavern at the Huangdeng hydropower station. Fig. 8 shows the three algorithms used to explore the boundary conditions.

Fig. 8
figure 8

Various algorithms used for boundary condition exploration

First, a nonlinear model based on DE-SVM was introduced to the objective function, Eq. (12), to explore the best matched values of the boundary conditions. Based on the results of SR, the initial values of five optimization variables were set as follows: g*= 11.78 m/s2, τ xx =0.18 m, τ yy =0.08 m, τ yx =0.35 m, and τ xz =0.15 m. The population size N p was 20, and the maximum number of iterations Gmax was 500. The value of the scaling factor F strongly influences the rate of convergence and the crossover constant CR in the optimized recognition process, both of which are determined by the user. Fortunately, the convergence value tends to be stable, which means that there is almost no difference for different values of F and CR. When the DE curve became astringed, the optimal undetermined parameters were obtained (Table 6). The inversion results of the MLR and ANN methods are also presented for comparison.

Table 6 Stress components of the in situ stress measurement in the underground cavern zone

Because of the limited number of measuring points, the MLR and ANN methods, which are widely applied, were selected to provide a reasonable comparison with the SR-DE-SVM algorithm. The procedure used in this study showed superior accuracy and efficiency in the inversion analysis of the in situ stress field. To verify the rationality of the identification parameters, the obtained boundary conditions of different algorithms were input into a numerical calculation model for elastic-plastic finite element analysis. Tables 7 and 8 show the computational stress components at four monitoring points.

Table 7 Computational stress components at four monitoring points
Table 8 Computational stress components at selected nodes

Fig. 9 compares the normal stress at each of the four monitoring points with three different results respectively, while Fig. 10 compares the normal stress at each of the six selected nodes A–F, which were chosen on Pile No. K0+070 m cross section in the x-direction (Fig. 11).

Fig. 9
figure 9

Comparison of normal stress at each of the four monitoring points: (a) σ x ; (b) σ y ; (c) σ z

Fig. 10
figure 10

Comparison of normal stress at each selected node: (a) σ x ; (b) σ y ; (c) σ z

Fig. 11
figure 11

Selected nodes on Pile No. K0+070 m cross section in the x -direction

The results of each algorithm are in good agreement with the monitored in situ stresses, especially for the ANN and SR-DE-SVM methods. Furthermore, note that for each of the six selected points, the results of all the algorithms are very consistent. Thus, the efficiency and error comparison of MLR, ANN, and SR-DE-SVM in Table 9 shows that the three methods have a similar time cost for model building. However, the computing analysis time of the SR-DE-SVM algorithm was only 4489 s, compared with 6236 s for MLR and 5816 s for ANN due to the reduction in the parameters of boundary conditions to be applied greatly improving efficiency. More importantly, the model performance of SR-DE-SVM appears to provide the best match, with a maximum absolute error of only 1.50 MPa and a maximum relative error of only 9.09%.

Table 9 Model efficiency and error comparison in the recognition of boundary conditions

4.4 Distribution analysis of the in situ stress field

The distribution of the in situ stress field at the Huangdeng engineering site was successfully obtained after optimal boundary conditions from the SR-DE-SVM algorithm were input into a numerical calculation model for elastic-plastic finite element analysis.

4.4.1 Characteristics of vertical subsidence and horizontal displacement

Based on the power house excavation from step 1 through step 6, the curves of vertical subsidence (Fig. 12) and horizontal displacements (Fig. 13) in different excavation stages are shown to facilitate the comparative study under different stress fields, including an ideal gravity stress field and the estimated and practical in situ stress field.

Fig. 12
figure 12

Curves of subsidence induced by excavation in an ideal gravity stress field (a) and the estimated and practical in situ stress field (b)

Fig. 13
figure 13

Curves of the horizontal displacement induced by excavation in an ideal gravity stress field (a) and the es-timated and practical in situ stress field (b)

In the ideal gravity stress field, the subsidence curves of the selected analysis baseline in the Huangdeng numerical calculation model are smooth and symmetrical (Fig. 12a). With increasing excavation depth, the subsidence increases markedly, and the maximum subsiding center migrates slowly to the longitudinal axis of the power house.

In the estimated and practical in situ stress field, the subsidence curves of the selected analysis baseline show different characteristics at different excavation stages (Fig. 12b). In the initial stage of single-level excavation, although the subsidence curves are similar, the maximum subsidence value is only 72.8% of those in the gravity stress field. In the subsequent excavation steps, the maximum subsiding center gradually migrates from the primary longitudinal axis to the upstream side of the power house (the left side of the coordinate) with excavation depth during the excavation process. Moreover, the subsidence values are all larger than those of each excavation step in the ideal gravity stress field. This means that the migration rate is larger in the estimated and practical in situ stress field than in the gravity stress field, and the maximum subsidence increases 21.7% compared with the latter. Thus, the tectonic stress field is closely linked to the displacement field as well as the stability of the surrounding rock-mass in the engineering site, which appears to be non-negligible.

The horizontal displacements changed at different stages of excavation in different stress fields (Fig. 13). In general, the horizontal displacements of the downstream side of the power house (the right side of the coordinate) were much larger than those of the upstream side in the ideal gravity stress field. Furthermore, the maximum horizontal displacement of the downstream side of the power house was 2.24 times that of the upstream side in the gravity stress field (Fig. 13a). In contrast, the maximum horizontal displacement of the downstream side of the power house was only 1.59 times that of the upstream side in Fig. 13b, but the horizontal displacement and its distribution range expanded prominently because of the occurrence of the practical tectonic stress field.

4.4.2 Characteristics of in situ stress in the area of the power house along its depth

Taking the power house as an example, Fig. 14 shows the distribution curves of the normal stresses σH, σh, and σV of the selected analysis baseline (the longitudinal axis of the power house) along its depth. With increasing rock-mass depth, the normal stresses σH, σh, and σV increase linearly in the area of the underground rock-mass without geological faults as well as in those with tuff interlayers incised. In detail, σH ranges from 10.5 MPa to 15.1 MPa in the buried depth range of the Huangdeng underground cavern group, while σh is between 7.45 MPa and 11.3 MPa, and σV is between 7.36 MPa and 12.0 MPa. Thus, the change rate of the vertical stress σV seems to be more sensitive to the increase in the rock-mass depth. Around the surface area of the rock-mass, the in situ stress follows the size relationship of each normal stress σV<σh<σH, but when the buried depth exceeds a certain value of 200 m, the relationship changes to σh<σV<σH.

Fig. 14
figure 14

Distribution of in situ stress in the area of the power house along its depth

Furthermore, Fig. 15 shows the distribution curves of KH and Kh of the selected analysis baseline (the longitudinal axis of the power house) along its depth. KH and Kh decrease gradually with increasing rock-mass depth, with a particularly sharp decrease near the surface of the rock-mass. The decrease rates of KH and Kh are relatively stable in the buried depth range of the Huangdeng underground cavern group, with KH∈(1.26, 1.43), and Kh∈(0.94, 1.01).

Fig. 15
figure 15

Distribution of K in the area of the power house along its depth

σH ranges from 10.5 MPa to 15.1 MPa in the buried depth range of the Huangdeng underground cavern group, while σh and σV are between 7.45 MPa and 11.3 MPa, and between 7.36 MPa and 12.0 MPa, respectively. Thus, the change rate of the vertical stress σV seems to be more sensitive to the increase in the rock-mass depth. The influence mechanism of the tectonic movements to the in situ stress field is more complicated closer to the surface of the rock-mass, which ultimately results in the instability of K. Therefore, prioritizing K for a rock-mass whose buried depth exceeds 200 m, the function of the optimum regression curve is reasonably expressed by the formula K=105/H+0.80, which is highly consistent with the relevant results of Zhao et al. (2007).

5 Validation tests: Nuozhadu underground cavern groups

Validation tests were conducted by comparing the results produced by the SR-DE-SVM algorithm and the measured in situ stresses in the Nuozhadu underground cavern groups. The in situ measurements of the Nuozhadu underground cavern groups were made by the Hydro-China Kunming Engineering Corporation. As the measured in situ stresses show (Table 10), the stress level is low in general, because the buried depth of the underground cavern is not very deep. The magnitude and direction of in situ stress are influenced mainly by the geological structure, topography of the engineering area, and the gravity stress field. The numerical analysis was simulated using the SR-DE-SVM algorithm in the Abaqus codes. Table 10 compares the measured in situ stresses with results produced by the SR-DE-SVM algorithm in the Nuozhadu underground cavern group. Fig. 16 compares the normal stress at each of the four monitoring points in the Nuozhadu underground cavern group intuitively. The results demonstrate high precision of the in situ stress field in the Nuozhadu underground cavern area, which means that the in situ stress distribution within the engineering area can be estimated effectively using the SR-DE-SVM algorithm.

Fig. 16
figure 16

Comparison of each normal stress at each of the four monitoring points in the Nuozhadu underground cavern group: (a) σ x ; (b) σ y ; (c) σ z

Table 10 Comparison between the measured in situ stresses and results produced by the SR-DE-SVM algorithm in the Nuozhadu underground cavern group

6 Conclusions

Due to the complexity and importance of the in situ stress field, it is necessary to learn about the magnitude and direction of an in situ stress field for any underground project according to its specific geological conditions. The SR-DE-SVM algorithm, which is a nonlinear global optimization method combining SR, DE, SVM, and numerical analysis techniques, was presented for estimating the in situ stress in the rock-mass. Through the example of the Huangdeng hydropower station, the SR-DE-SVM approach was shown to maintain a consistently high solution accuracy for in situ stress estimation, particularly compared with two contemporary approaches, namely, the MLR method and the ANN surrogate direct inversion approach.

With increasing excavation depth, the subsidence trough maintains a single subsiding center throughout excavation in the gravity stress field. However, in the high-level tectonic stress field, the maximum subsiding center migrates gradually from the primary longitudinal axis to the upstream side of the power house with excavation depth, and the horizontal displacement and its distribution range expand markedly because of the occurrence of the practical tectonic stress field.

In the Huangdeng underground cavern group, the in situ stress is moderate and originates mainly from gravity and the geological tectonic stress field. According to the finite element analysis, the category of the surrounding rocks does not have a serious effect on the in situ stress field at the engineering site. With increasing rock-mass depth, the normal stresses increase linearly in the area of the underground rock-mass without geological faults and tuff interlayers crossed over in areas.

Validation tests were conducted by comparing the results produced by the SR-DE-SVM algorithm with the measured in situ stresses in the Nuozhadu underground cavern groups. The tests showed that the methodology described provides attractive tools to represent the nonlinear relations between boundary conditions and in situ stresses in the rock-mass and may be applied to other rock-masses to model the in situ stress field. This method can clarify the dominant cause of the formation of in situ stress specific to various practical projects to obtain an approximate solution that can provide in situ stress information for the design of dams and tunnels.