1 Introduction

In the application and development of aero-engine, unbalance force caused by unbalance mass in the multiple discs of the low-pressure fan section and the high-pressure compressor section will cause serious vibration, which is one of the most major vibration sources of aero-engine. However, the research on the unbalanced vibration response characteristics of aero-engine often faces the following problems and challenges. Firstly, complex structure and narrow internal space of the dual-rotor system make it difficult to place sensor. Secondly, it is the commonly used method in the field of engineering practice that obtaining a reliable response values in steady state of the system running through multiple start-stops, which costs a lot in time and economy.

At present, the researches of rotor unbalance are mainly the exploration of vibration characteristics. AL-Shudeifat et al. [1] numerically and experimentally found the change in the unbalance force angle with respect to the crack opening direction significantly altered the values of the critical whirl speeds and their corresponding peak whirl amplitudes in cracked rotor-bearing-disk systems for starting up operations. The research of Gao, P.’s revealed that the increasement of corresponding critical speeds and the vibration amplitudes of rotors happened, as the unbalances in LP and HP rotors mainly increased in a force model for the inter-shaft bearing with a local defect on the surface of the outer race or the inner race [2]. Cao et al. [3] analyzed quantitatively the effect of angular speed fluctuation on vibration responses of the unbalanced rotor, the result of which showed the speed fluctuation produced apparent frequency modulation, phase distortion and amplitude error of the unbalance. The research of Ref. [4] showed the sensitive intervals of unbalance vibration in the input side and output side are respectively in lower frequency and in higher frequency in a gas turbine rotor system. Zhang [5] proposed a non-whole beat correlation method to identify the unbalance responses, which was proved to be feasible and practicable from the numerical simulation and balancing experiment.

It is a fitting technology for the surrogate model that predicts the response value in the unknown situation using those in the known situation. Its essence is to approximately express the relationship between the input and output data through establishing the mapping between them, taking the fitting precision and the prediction precision as the constraints. With the application and development of surrogate model technology for more than 40 years, it has been quite mature in the applications of optimization design [6,7,8,9,10,11,12] and parameter identification [13,14,15] of complex engineering problems, replacing the high-precision model with heavy computation and solving the problem that analytical model cannot be established in some engineering fields. The introduction of surrogate model technology in the dual-rotor system, can establish the model vibration response of by the use of limited sample data to efficiently achieve accurate prediction for the unknown.

In the field of rotor dynamics, some progress in the application of surrogate model technology has been made. The research of the Ref. [16] illustrated the effectiveness of Kriging when predicting the critical speeds and the vibration amplitudes of a single flexible rotor modelled by analytic method. The research of the Ref. [17] used a polynomial surrogate method to effectively analysis steady-state response of cracked rotors with uncertain-but-bounded parameters by numerical simulations. Gu et al. [18] realized the identification of single-point unbalance parameters of the single-rotor system model, based on the PSO-SVR model, but there are not results for the multi-point and the dual-rotor. The research of the Ref. [19] presented a new method based on an improved Kriging surrogate model and evolutionary algorithm (IKSMEA), which was proved to effectively and accurately identify the structure parameters of a nonlinear rotor-bearing system by numerical studies and experimental validation.

In this study, multi-measuring-point metamodellings of a typical dual-rotor system with double unbalance disks were established based on MARS, RBF and Kriging in sequence, and successfully predicted the vibration amplitudes under the unknown working conditions which were randomly sampled with uniform distribution, proving the application feasibility in this field. The researches applying the metamodelling methods for predicting the vibration response of the dual-rotor system are quite few, so this paper is a supplement. This is an exploratory practice of applying the relatively mature mathematical technology to the engineering field, which can provide the reference for the dynamics balance and design of dual-rotor aero-engine.

2 Object, approach and algorithms

2.1 Object

Figures 1 and 2 show the rotor structure of a certain type of aero-engine, in which the numbers, 1–6, represent six bearing supports in the whole rotor system. Based on the principles of the structural similarity (1:2 reduced scale and similar characteristics of mass and moment of inertia) and dynamic similarity (the characteristics of first three order modes and Campbell diagrams of the high-pressure rotor, low-pressure rotor and the dual-rotor system are consistent with those of the prototype), a scaled model is obtained, as shown in Fig. 3. It is mainly composed of four parts, low-pressure fan disc (LPC), low-pressure turbine disc (LPT), high-pressure compressor disc (HPC) and high-pressure turbine disc (HPT). In Fig. 3, there are five supportings, among which bearings 1#, 2# and 5# support the LP rotor, and bearings 3# and 4# support the HP rotor; there are nine measuring points deployed in the rotors, with the purpose of monitoring the vibration responses. The measuring points 1, 2, 3, 4 and 5 are at the LP part, among which, measuring point 1 is the center of supporting 1#, measuring point 2 is the center of mass of the LPC, measuring point 3 is the center of supporting 2#, measuring point 4 is the center of mass of LPT, and measuring point 5 is the center of supporting 5#. The measuring points 6, 7, 8 and 9 are at the HP part, among which, measuring point 6 is the center of supporting 3#, measuring point 7 is the center of mass of the HPC, measuring point 8 is the center of mass of the HPT, and measuring point 9 is the center of supporting 4#.

Fig. 1
figure 1

The simplified model of a dual-rotor system

Fig. 2
figure 2

3D solid model of the dual-rotor system

Fig. 3
figure 3

The simplified model of a dual-rotor system

2.2 Approach

The achievement of prediction process includes the following three steps: (1) selecting samples and sample quantity in the variable space by the experimental design method; (2) calculating the output response variables of each sample for the research object, and then, establishing the metamodellings by training points; (3) doing the prediction research and verifying the prediction accuracy of the results by testing points. The analysis process is shown in Fig. 4, in which letters, a to f, represent the orders of parameters inputted or outputted.

Fig. 4
figure 4

Schematic diagram of predictions

Design of experiment (DOE) is a scientific method to research the correlation between multiple factors and response variables [20]. Common DOE methods include Full Factorials Design, Orthogonal Experiment Design, Latin.

Hypercube Experiment Design, etc. Latin Hypercube Experimental Design is a random sampling method with the feature “space filling”, which can ensure that the entire variable space is covered by sample points, so adopted to determine mass and phase of unbalance disks of training sets in the paper. It can be taken as the principle that the number of training points is 10 times that of design variables [21]. There are four variables in the variable space: unbalance mass and phase of the LPC and HPC, so forty is the number of training samples. In order to verify the accuracy of the established models, the eleven testing points are randomly sampled with uniform distribution in the variable space. The distributions of training points and testing points are shown in Figs. 5 and 6.

Fig. 5
figure 5

The distribution of training points

Fig. 6
figure 6

The distribution of testing points

Then, unbalance parameters are set so that the dynamic simulations in ADAMS are carried out to obtain the vibration responses of multiple measuring-points under 51 working conditions. ADAMS software is the most excellent dynamic simulation software of mechanical system developed by MDI. It is one of the most authoritative and widely used dynamic analysis software of mechanical system in the world. It applies Lagrange method to calculate multi-body dynamics, which is a relative coordinate method. The form of its dynamic equation is the second order differential equations of Lagrange coordinate matrix, namely \(A\left( {q,t} \right)\ddot{q} = B\left( {q,\dot{q},t} \right)\). It was first proposed to solve the problem of spacecraft and has been widely used until now. Its advantages are that the number of equations is the least, the number of coordinates of the tree topology system is equal to the degree of freedom of the system, and the dynamics equation is easily converted into ordinary differential equations.

After this process, the time-domain signals are extracted and the response values under the excitation of rotational frequencies N1 and N2, namely the vibration amplitudes, are obtained through the Fast Fourier Transform (FFT), as shown in Fig. 7; N1 (40 Hz) represents the rotational frequency of the LP rotor, and N2 (117 Hz) represents the rotational frequency of the HP rotor; the array, (N1, N2), is the crawling state of the dual-rotor system. From Fig. 7, the law can be seen that unbalance vibration of the dual-rotor system is mainly excited by rotation speeds and it is different from the single rotor that the more complicated structure of the dual-rotor will cause coupled oscillation between LP rotor and HP rotor that amplitudes excited by N1 and N2 are distributed in the whole system. What’s more, the unbalance vibration contributions from N1 and N2 vary as there exist different unbalance distributions in LPC and HPC, so this is the core of the work.

Fig. 7
figure 7

Time domain graph, spectrum graph and graph of axle center trail at the measuring point 1

The simulation results of training points are shown in Fig. 8. The simulation results of testing points are shown in Table 1. The Pearson correlation coefficient R*,Y for training and test sets can be separately found respectively in Tables 2, 3, 4 and 5, in which Y means output parameters (amplitudes) and * represents input parameters that A and B are respectively unbalance mass and angle phase of LPC while C and D are respectively unbalance mass and phase angle of HPC. From Tables 2 and 4, it can be seen that the amplitude under N1 is strongly correlated with unbalance mass of LPC, mediumly correlated with unbalance phase angle of LPC while weakly and weakly negatively correlated with unbalance mass and phase angle of HPC respectively; from Tables 3 and 5, it can be seen that the amplitude under N2 is strongly correlated with unbalance mass of HPC, mediumly correlated with unbalance phase angle of HPC while weakly and negatively correlated with unbalance mass and phase angle of LPC respectively.

Fig. 8
figure 8

Amplitudes excited by N1 and N2 based on training points a under N1, b under N2

Table 1 The partial results based on testing points/μm
Table 2 The Pearson correlation coefficient R*,Y for training sets under N1
Table 3 The Pearson correlation coefficient R*,Y for training sets under N2
Table 4 The Pearson correlation coefficient R*,Y for testing sets under N1
Table 5 The Pearson correlation coefficient R*,Y for testing sets under N2

In this work, the mean square error (MSE) and error rate of the vibration response value are used as the error analysis standards of the surrogate models.

MSE is expressed as,

$${\text{MSE = }}\frac{1}{N}\sum\limits_{i = 1}^{N} {\left( {y_{i} - y_{i}^{{\prime }} } \right)^{2} }$$
(1)

where N is the number of testing points, yi is the simulation result, and yi′ is the corresponding predicted response values. The smaller the value of MSE is, the closer it is to 0, the higher precision of the model.

If μsimulate and \(\mu_{predicted}^{*}\) respectively represent response value through simulation and prediction of the testing samples excited by the rotate frequency, the expression of error rate, \(\varepsilon_{\mu }^{*}\) is defined as,

$$\varepsilon_{\mu }^{ *} { = }\frac{{\mu_{simulate} - \mu_{predicted}^{*} }}{{\mu_{simulate} }} \times 100\%$$
(2)

where * refers to the type of the surrogate models, which can be MARS, RBF, and Kriging.

2.3 Algorithms

In this study, the surrogate models of vibration responses at each measuring point are established based on the following three algorithms where linear functions are all important components, all of which are multiple-input and single-output. The input variables are unbalance parameters of the training samples, while the output variables are the corresponding vibration responses. The method of multivariate adaptive regression splines, shorted as MARS, is a form of non-parametric regression analysis to model nonlinearities and interactions between variables [22], and chooses piecewise truncated linear functions to estimate the Eq. (3) expressed by the input variable x.

$$y = f(x) + \varepsilon$$
(3)

where ε is the predicted error, \(x = (x_{1} ,x_{2} , \cdots ,x_{p} )^{T}\) is input variables, p is the number of training points, and the column number of matrix x is the number of response variable y.

MARS is to form reflected pairs for the each predictor variable, xj, j ∈ {1, , p} with corresponding knots at all distinct relating values, xij, i ∈ {1, , n}, where n is the sample size, expressed by the set C in Eq. (4).

$$C = \left\{ {\left( {x_{j} - t} \right)_{ + } ,\left( {t - x_{j} } \right)_{ + } \left| \begin{array}{l} t \in \left\{ {x_{1j} ,x_{2j} , \cdots ,x_{nj} } \right\}, \hfill \\ j \in \left\{ {1, \cdots ,p} \right\} \hfill \\ \end{array} \right.} \right\}$$
(4)

For a given vector of predictor variables x and the target variable f(x), the form of the MARS model approximating the function in Eq. (3) is defined as

$$f\left( x \right) = \beta_{0} + \sum\limits_{m = 1}^{M} {\beta_{m} \cdot B_{m} \left( x \right)}$$
(5)

where Bm(x) represents a truncated linear function from set C or tensor product of more than one function, and M is the number of truncated linear function in the current model [22, 23]. For multiple variable cases, multiplying an existing piecewise linear basis function (BF) with a truncated linear function involving a new variable where both components are nonzero can create the interaction terms which is nonzero only over the space of predictors in MARS [24].

For a real problem, the model building strategy of radial basis function (RBF) is to construct an approximate model by linear superposition of radial basis functions, expressed as Eq. (6) [25],

$$f^{o} \left( x \right) = \sum\limits_{k = 1}^{M} {W_{ik} \cdot \Phi_{i} \left( {\left\| {x - c_{ki} } \right\|_{2} } \right)}$$
(6)

where wik is the synaptic weight between the kth neuron of the output layer of the second hidden layer and the ith neuron of the output layer, cki is the ith hidden center vector for the kth pattern class of the first hidden layer, || · ||2 is the Euclidean norm, M denotes the number of neurons in the output layer and the second hidden layer, namely the patterns class number for the training samples set, Φi(·) is the kernel function guaranteeing the required accuracy. In this work, the Multiquadric is the choice, written as Eq. (7) [26],

$$\Phi_{\text{i}} \left( {\left\| {x - c_{ki} } \right\|_{2} } \right) = \left( {\left\| {x - c_{ki} } \right\|_{2}^{2} + \sigma_{i}^{2} } \right)^{1/2}$$
(7)

where σi is the width of the receptive field.

Kriging constructs an unbiased estimation model with the minimum estimation variance characteristics of local estimation. It consists of a linear regression model and an extra random function, which, for the input variable x = (x1, , xi, , xj, , xm)T, can be defined as Eq. (8) [27],

$$G\left( x \right) = \sum\limits_{h = 1}^{p} {\beta_{h} \cdot g_{h} \left( x \right)} + z\left( x \right) = g^{T} \left( x \right)\beta + z\left( x \right)$$
(8)

The first function of Eq. (9) is a realization of a regression function and gh(x), (h = 1, 2, , p) is the basis regression function; the second is a stochastic process (random function) whose mean is 0 and the covariance of random process z(x) is

$$Cov\left[ {z\left( {x_{i} ,x_{j} } \right)} \right] = \sigma^{2} R\left( {c,x_{i} ,x_{j} } \right)$$
(9)

where R(c, xi, xj) is the correlation coefficient between z(xi) and z(xj) with parameter c and σ2 is the process variance. The correlation function used in this study is the Gaussian, which is expressed as Eq. (10),

$$R\left( {c,x_{i} ,x_{j} } \right) = \prod\limits_{n = 1}^{N} {\exp \left( { - c_{n} \left( {x_{i}^{n} - x_{j}^{n} } \right)} \right)}$$
(10)

where x ni is the nth component of xi.

Kriging selects the Best Linear Unbiased Predictor (BLUP), which minimizes the Mean Squared Errors (MSE) of the predictor [28],

$$\min_{c} MSE\left[ {G^{o} \left( {x_{i} } \right)} \right] = \min_{c} \left[ {G^{o} \left( {x_{i} } \right) - Y\left( {x_{i} } \right)} \right].$$
(11)

3 Prediction and accuracy tests

The parameters in the training points are used to establish the prediction models and those of testing set are used as input variables to predict the vibration response amplitudes at the measuring points. The specific prediction results of surrogate models are shown in Tables 6 and 7, which are corresponding with the data in Table 1 from the first three working conditions of the testing set. The experimental results and prediction results of the other 8 working conditions are shown in Tables 9, 10, 11. It can be seen that the prediction results converge to the simulation results by comparison.

Table 6 Predicted amplitudes in N1 of 3 working conditions/μm
Table 7 Predicted amplitudes in N2 of 3 working conditions

Table 8 shows the average calculation time of each algorithm when predicting. As can be seen from it, the computing efficiencies are all very high; the most efficient is RBF, followed by MARS and Kriging.

Table 8 Average computing time of the surrogate models

In fact, ADAMS simulation can also be used to predict the vibration response, showed as Tables 1 and 9. However, compared with metamodelling methods, it has three disadvantages. Firstly, ADAMS’ calculation time is about 10 s in this paper, which has a lower prediction efficiency according to Table 8 when ran in the same computer; secondly, complex multi-body dynamics simulation procedures, including the establishment of 3D model, settings of constraint conditions and driving motions, etc. all consume much time, which can be saved by metamodelling methods; thirdly, compared with surrogate model algorithm, multi-body dynamics methods require a higher computer configuration, especially CPU and RAM. So we choose the surrogate models to predict vibration response in rotor dynamics instead of the dynamics simulation.

Table 9 The residual results based on testing points/μm

Based on the simulation values and the predicted values of the surrogate models in testing points, the error rate histograms at the 9 measurement points under N1 and N2 are obtained, as shown in Figs. 9, 10 and 11. Figure 9 shows that MARS’s error rates under N1 are mostly less than 5%, and those under N2 are all ranging from − 0.1 to − 0.1%, with a better accuracy. Figure 10 shows that RBF’s error rates under N1 and N2 are all mostly less than 10%, except testing point 4, whose predicted values are obviously bigger than the corresponding simulated at all measurement points for the error rates ranging from − 10 to − 5%, no matter what in N1 or N2, according the Eq. (2) above. Figure 11 shows that Kriging’s error rates are ranging from − 0.1 to 0.1% in N2, while − 0.92% to 0.6% in N1 except those at measuring point 6 ranging from − 7.22 to − 2.15%.

Fig. 9
figure 9

Error rate histograms of MARS a under N1, b under N2

Fig. 10
figure 10

Error rate histograms of RBF a under N1, b under N2

Fig. 11
figure 11

Error rate histograms of Kriging a under N1, b under N2

If the maximum error rate 5% is taken as a standard to evaluate the predicted accuracy, the pie charts of the error rates are drawn in Figs. 12, 13 and 14. In these three figures, the Kriging’s ratios are all 100%, the RBF’s ratios are 99% and 98% respectively, and the MARS’s ratios are 97% and 100% respectively, under N1 and N2 (Tables 10, 11).

Fig. 12
figure 12

The pie charts of MARS’s error rates a under N1, b under N2

Fig. 13
figure 13

The pie charts of RBF’s error rates a under N1, b under N2

Fig. 14
figure 14

The pie charts of Kriging’s error rates a under N1, b under N2

Table 10 Predicted amplitudes in N1 of residual working conditions/μm
Table 11 Predicted amplitudes in N2 of residual working conditions/μm

The comparison of MSE values based on testing points from MARS, RBF and Kriging can be seen in Figs. 15 and 16, where Logarithmic Coordinates are used. The larger the MSE values, the smaller the ordinates.

Fig. 15
figure 15

Comparison of MSE values of the models at the measuring points under N1 a in LP rotor, b in HP rotor

Fig. 16
figure 16

Comparison of MSE values of the models at the measuring points under N2 a in LP rotor, b in HP rotor

From Fig. 15, the MSE values of the RBF model at measuring point 2 of the LP rotor are much higher than those of MARS and Kriging under N1, so the prediction accuracy of the RBF model here is much lower; at the other measuring points of the LP rotor, MSE values of all models are less than 5e-02, with high prediction accuracy. At the measuring points of the HP rotor, MSE values of all models are less than 3e-02, so the prediction accuracy is also quite high. In general, the magnitude order from the models at measurement points of the HP section is: RBF > MARS > Kriging, so the order of precision is: Kriging > MARS > RBF.

It can be seen from Fig. 16 that, at all measuring points under N2, the MSE values of MARS and Kriging are much smaller than those of RBF, with higher prediction accuracies.

4 Conclusions

In this work, based on the research finding that the vibration of dual-rotor system is caused by the coupled excitations of rotational frequencies N1 and N2, three kinds of surrogate models, MARS, RBF and Kriging, are established under finite working conditions to predict unknown unbalance responses of a simplified dual-rotor model of aero-engine. And it is proved that the predicted results have high accuracies, according to analyzation and comparison with the selected standards.

  1. 1.

    The predicted unbalance response values of these three algorithms basically converge to the corresponding simulation results under N1 and N2;

  2. 2.

    The three algorithms are high-efficiency, among which RBF is the fastest, while Kriging is the slowest by contrast.

  3. 3.

    Compared with those under N2, the error rates under N1 are significantly bigger, so the precision order is the opposite. The reason of the phenomenon is that the vibration amplitudes under N1 are significantly smaller than those under N2, so the former is more difficult to accurately predict.

  4. 4.

    When the maximum error rate of 5% is taken as an evaluation standard, the predicted results of Kriging’s are the most precise, and under N1 and N2 the MARS and the RBF respectively have better performances by contrast.

  5. 5.

    The MSE of testing points taken as an evaluation standard, the three surrogate models all have high prediction accuracies; the MSE values of RBF are obviously larger than those of the others, which accuracy is the lowest by contrast. Under N1, the Kriging’s accuracy is a little better than that of the MARS, but under N2, the two are much the same.