1 Introduction

Due to advances in computational power and simulation efficiency, whole engine models (WEMs) have become a staple tool in the design process of gas turbine engines in industry (Voutchkov et al. 2006; Arkhipov et al. 2009; Toal et al. 2014). Analyzing WEMs allows engineers to capture both the physics of inter-component interactions and also emergent behaviour that would otherwise be lost if the analysis was done on each component in isolation. For example, WEMs facilitate studies on the vibrational response of the engine, the magnitude of which needs to be minimized to increase fatigue life and reduce cabin noise (Garcia 2008). WEMs are also useful for understanding the engine’s structural behaviour during extreme scenarios, such as in the event of a Fan-Blade-Off (Bettebghor et al. 2013). The assessment of whether the engine casing is able to contain the detached fan blade requires a WEM that is accurate yet cheap to run, due to the high computational cost of the involved non-linear transient analysis.

However, the high cost of solving a WEM that is meshed using solid elements restricts the number of simulations that can be performed in the overall design process. This means that these simulations are often only used as gatekeepers for design decisions after they have been proposed, instead of as key drivers behind these decisions. It is thus necessary to simplify the WEM such that they can take on a more active role in design. The simplification needs to reduce the cost of solving the WEM without significantly affecting its structural behaviour when compared to the original model. The simplification method used in this work is medial object transformation, which is an approach for dimensionally reducing geometry.

By reducing solid CAD geometries to their medial objects, the resulting surfaces can be meshed using shell elements. We define shell meshes of this type as a medial mesh. In general, medial meshes can be much quicker to solve than solid meshes of the same geometries as larger shell elements can be used while maintaining solution accuracy. When medial meshes are used in design studies that involve only symmetric thickness changes, the time savings are even more notable as remeshing is not required. The thickness modifications can be realized by simply overwriting the physical properties of each shell element. Voutchkov et al. (2006) demonstrated this geometry modification approach in a multi-objective optimization study on a whole jet engine which employed manually constructed shell meshes. The simulation time was reduced but at a cost of increased manual effort and time during mesh construction. The MANTLE software (Modelling and Analysis in the NeuTral LinE) has been developed in Rolls-Royce to automate the medial object transformation process (Stanley 2010), and it is used to produce the medial meshes in the current work.

Evaluations of whole engine designs using medial meshes can be used to construct a relatively inexpensive surrogate model of the overall structural behaviour. The predictive accuracy of this surrogate model, with respect to whole engine evaluations using expensive solid meshes, can be further improved by augmenting the data set of medial mesh evaluations with a number of the expensive solid mesh evaluations within a multi-fidelity surrogate modelling framework. In the current work, Kriging and its auto-regressive multi-fidelity variant, Co-Kriging (Kennedy and O’Hagan 2000), are used, as the formulation provides estimates of the prediction error which can be subsequently used to search for infill points. Toal et al. (2014) demonstrated the use of Co-Kriging models for reducing the frequency of performing expensive whole engine transient thermo-mechanical simulations in the optimization of a high-pressure compressor for minimum specific fuel consumption. The thermal results were supplemented with a larger number of relatively cheap steady-state mechanical analyses. Brooks et al. (2011) optimized a transonic compressor rotor for maximum isentropic efficiency using Co-Kriging models where the different levels of fidelities were defined by the mesh resolution used for CFD. Zhang et al. (2015) utilized single- and double-sector fluid volumes of a combustor feed arm geometry to construct Co-Kriging models which were subsequently used to minimize pressure loss. To the best of the authors’ knowledge, structural optimization of whole engine models using mixed-dimensional finite element models and multi-fidelity surrogate models has not been demonstrated in the literature.

The following paper is organized as follows. Section 2 describes the medial object transformation process and how it is implemented in the MANTLE software, as well as a study to validate the simulation results of medial meshes against expensive solid meshes. Section 3 introduces Kriging and Co-Kriging. Section 4 presents two case studies to demonstrate the effectiveness of the multi-fidelity surrogate modelling framework for reducing optimization costs. Section 5 is a summary of our conclusions.

2 Medial mesh generation

2.1 The medial object

Medial object transformation is a process whereby a geometry is converted to a compact, lower-dimensional, skeletal form which preserves useful information about the properties of the original model, such as its topology and symmetry. Blum (1967) defined the medial object as the locus of a maximally-inscribed disc (for 2D shapes) or sphere (for 3D boundary representations of geometry) as it rolls within the boundaries of an input geometry (Fig. 1). This locus can take the form of (x,r(x)), where x is the disc/sphere centre and r is the radius of the disc/sphere. The medial object can also be described in terms of minimum distances (Gürsoy and Patrikalakis 1991). For closed geometries, minimum distance-to-boundary values can be computed for all internal points. Interpolating the set of internal points that have non-unique minimum distance values also gives the medial object.

Fig. 1
figure 1

A 2D shape boundary (blue curves) and its medial object (red curves) with no artificial branches at the end-faces (adapted from Wang et al. (2017)). The black dots and the solid black circles denote the medial points and the corresponding maximal discs respectively. Each medial point is defined by its coordinate xi and the radius of its maximal disc r(xi), where i is the medial point identifier. Note that not all medial points and maximal discs are shown

These definitions of the medial object are known to be challenging to apply for arbitrary geometries in a robust manner, as the medial object is highly sensitive to perturbations on the boundary. The formulation has since undergone significant changes to increase the stability of the computed medial object, as seen in the works of Tek and Kimia (2001), Katz and Pizer (2003), and Macrini et al. (2011), amongst many others. Here, we emphasize that the proprietary transformation algorithm in MANTLE does not implement any of these stability modifications, as it is currently only applied to engineering geometries which have surfaces that are smooth and well-defined. The reader is referred to Tagliasacchi et al. (2016) for a survey of the state of the art in skeletonization algorithms, including those for medial object transformation.

The key advantage of the medial object transformation approach for dimensionally reducing solid 3D geometries is its ability to accurately represent complex junctions. This is in contrast to conventional mid-surface generation tools in commercial CAD software that are commonly based on face-pairing mechanisms (Woo 2014; Zhu et al. 2016; Kulkarni et al. 2017). Figure 2a shows a cutout of an engine casing geometry which contains multiple structural members of different shapes, sizes, and orientations that intersect at several junctions. The corresponding surfaces that were generated using the mid-surface by face-pairing algorithm in Siemens NX9 Modelling are shown in Fig. 2b. The junctions consist of disconnected mid-surface patches, and this highlights the inability of these face-pairing methods for handling geometries with no obvious face pairs. Existing techniques to automatically extend and connect these patches are also unreliable. In comparison, the surfaces that were generated by the MANTLE software for the same geometry are fully connected (Fig. 2c).

Fig. 2
figure 2

Comparisonof the b mid-surface by face-pairing algorithm in Siemens NX9 Modelling with the c medial surfaces from the medial obejct transformation algorithm in MANTLE for the complex junction shown in a. Free edges are highlighted in grey. The mid-surfaces in b have been passed through the automatic edge stitching modules in Siemens NX9 Modelling

2.2 MANTLE and the medial mesh workflow

The MANTLE software, first proposed in Stanley (2010), is currently implemented via a suite of tools consisting of MATLAB functions and Siemens NX plugins that were written using the Siemens NX9 Open API. The tools are grouped into three main packages: M-2D, M-3D, and M-Core. M-2D and M-3D are the primary tools for the medial object transformation of axisymmetric and non-axisymmetric geometries, respectively. They contain functions for medial point cloud generation, duplicate node removal, and medial surface topology preservation. M-Core contains auxiliary operations that help complete the medial mesh generation process. This includes the definition of material properties, the transfer of thickness information between medial meshes, the application of boundary conditions, and post-processing of simulation results.

The MOT algorithm in MANTLE is unique in that it computes the medial point cloud from a surface mesh of the solid geometry, instead of from its boundary representation (Wang et al. 2014, 2017). This approach has several advantages:

  1. 1.

    The problem of artificial branch generation at the end-faces is eliminated since these faces can be left unmeshed and are thus ignored in the transformation process (see Fig. 1). Parallel computation of the medial object is enabled as the subdivision of geometries into segments no longer produces these branches at the location of the split-faces.

  2. 2.

    Boundary conditions and inter-component interface definitions that were applied to geometric entities can be transferred automatically to the medial mesh by tagging the surface nodes with information about their originating geometry.

The medial mesh generation process for the engine casing geometry from Fig. 2 is illustrated in Fig. 3.

  1. 1.

    The repeating pattern of the stator vane arrangement allows us to represent the casing using only a small segment of the original geometry. Tags are applied to the relevant geometric entities. The model decomposition and tagging tasks are currently performed manually in the modelling environment.

  2. 2.

    A surface mesh of the segment is generated. An element size which allows the resulting mesh to adequately represent existing curvatures (of fillets, for example) in the solid geometry is normally used.

  3. 3.

    M-3D takes the surface mesh as input, computes the medial points, and constructs the medial mesh of the segment. Tags that were originally applied to faces and edges are transferred automatically to element faces and nodes.

  4. 4.

    M-Core processes the existing split-face tags and performs the necessary segment-duplication and node-merging operations to build the complete medial mesh of the original geometry. This mesh can be solved directly but is often converted to a shell polygon for further mesh sensitivity studies. In this case, M-Core is also able to transfer thickness information between medial meshes of different resolutions.

Fig. 3
figure 3

The medial mesh generation workflow of an engine casing geometry within the MANTLE software. (1) The casing is reduced to a single-stator-vane segment. The end-faces (shown in green) and the split-faces (shown in pink) have been tagged. (2) A surface mesh of the segment is generated. The tagged faces are observed to have been excluded. (3) M-3D takes the surface mesh as input and generates its medial mesh while preserving the tagging information. (4) M-Core uses the tags to identify split-faces and subsequently constructs the complete medial mesh of the original geometry. Here, a section view of the complete medial mesh is provided to show the duplication of the medial mesh segments from step 3

2.3 Validation of medial mesh simulations against solid mesh simulations

In this section, the simulation accuracy of medial meshes with respect to solid meshes is demonstrated through two studies involving different types of simulations. The first study is a static structural simulation using a simple whole engine model, where the deformations at several common points in both models are compared. The second study is a free-free modal simulation on the casing of a low-pressure turbine from the same engine, where comparisons are made in terms of natural frequencies and mode shapes. Here we consider the evaluations from mesh-insensitive solid finite element models to be the most accurate source of structural information available to engine designers.

Figure 4 shows a section view of the whole engine. The geometry was built such that its longitudinal axis is parallel to the global x-axis, and its transverse plane is parallel to the global y-z plane. All studies were performed on a workstation which contained a 4-core Intel Xeon E5-4640 CPU clocked at 2.40 GHz and 256 GB of installed memory.

Fig. 4
figure 4

Section view of the whole engine model, its subsystems, and the locations of the output sets that were used for post-processing in the static structural analysis. There are 4 subsystems in this engine model: the fan casing (FC), the intermediate compressor casing (IC), the high-pressure turbine casing (HPT), and the low-pressure turbine casing (LPT)

2.3.1 Static structural analysis

For the solid mesh, four-sided CTETRA tetrahedral elements are used. For the medial mesh, a combination of CTRIA3 triangular shell elements and CQUAD4 quadrilateral shell elements is used. For both cases, each of the four engine subsystems are meshed independently and then assembled using RBE3 interpolation constraint elements (Siemens 2014) that are generated by M-Core. The set of boundary conditions used are obtained from a sample simulation that had been previously performed in Rolls-Royce and they correspond to operating conditions at cruise (Toal et al. 2014). Thrust loads in the form of uni-directional forces are applied to every compressor and turbine stage, while static pressure loads act normally on every surface. The engine is constrained in the y- and z-axes at the fan casing and low-pressure turbine casing mounts, and in the x-axis at the intermediate compressor casing thrust lugs. The simulations are solved using the SOL101 static structural solver in Siemens NX9 Nastran.

The displacements at a total of 960 points are extracted. The points are grouped into 15 output sets of 64 points each, and the points in each set are evenly distributed on the circumference of the WEM at a constant x-position (see Fig. 4). The output sets are post-processed to give maximum local radial displacement values which are then used as design constraints in Section 4. This maximum local radial displacement value is an approximation of the maximum pinch point metric for tip clearance studies of gas turbine engine casings (Arkhipov et al. 2009). The calculation first involves the fitting of a least-squares circle to the deformed points in each output set on the y-z plane. The radial displacement of each point is then computed as its distance from the y-z plane origin minus the radius of the least-squares circle. The least-squares circles approach accounts for rigid body translation.

The results of the mesh sensitivity study are shown in Fig. 5. A percentage difference of < 1% over all maximum local radial displacement values that are computed by the finest mesh of the corresponding type is taken to be the criteria for attaining mesh independence. Thus, the sixth coarsest meshes tested for both the solid and medial cases are considered to have converged. Table 1 compares the maximum local radial displacement values between the converged solid and medial meshes, and Table 2 is a summary of their computational requirements. For an average loss in accuracy of 5.2%, the medial-meshed engine model can be solved in 8.6% the wall time of the solid-meshed engine model.

Fig. 5
figure 5

Mesh sensitivity study of the solid and medial whole engine models. See Fig. 4 for a description of the response locations

Table 1 Comparison of the maximum local radial displacements of all outputs between the converged tetrahedral and medial WEMs
Table 2 Comparison of the computational requirements for solving the converged tetrahedral and medial whole engine models

2.3.2 Free-free modal analysis

To validate the medial mesh for dynamic analyses, the Modal Assurance Criterion (MAC) is used as it is regarded as a more suitable measure of accuracy than displacement results from static structural simulations (Chen 2001; Pastor et al. 2012; Towner and Band 2012). The MAC is a statistical indicator that is used to represent the similarity in mode shapes between two modal data sets. Besides being used for assessing mesh quality, the MAC is also commonly used to validate simulation results against physical experiments.

The MAC value between two mode shape vectors {ϕr} and {ϕt} is a normalized scalar product of the two vectors and is calculated as follows:

$$ \text{MAC}(r_{i},t_{j}) = \frac{|\{\phi_{r}\}^{T}_{i}\{\phi_{t}\}_{j}|^{2}}{(\{\phi_{r}\}^{T}_{i}\{\phi_{r}\}_{i})(\{\phi_{t}\}^{H}_{j}\{\phi_{t}\}_{j})} $$
(1)

The subscripts r and t denote the reference and test data sets respectively, while i and j are mode numbers. {ϕt}j thus refers to the modal vector of the j th mode shape from the test data set. The superscript T is a conjugate transpose operator. The results from comparing n modal vectors from the two models can be arranged into an n × n MAC matrix. The MAC values can range from 0 (no correlation in mode shapes) to 1 (perfect modal correlation). An accurate model would thus produce a MAC matrix that has values near 1 in the main diagonal and near-zero values everywhere else.

The meshes used in this study are the low-pressure turbine portions of the converged whole engine meshes from the previous study. The meshes were solved using the SOL103 real eigenvalue solver from Siemens NX9 Nastran under free-free conditions. Figure 6 shows the MAC matrix plotted in the form of a heat map, while Table 3 lists the MAC values in the main diagonal of the matrix. Within the range of interest, all main diagonal MAC values are near 1, while all off-diagonal MAC values are approximately zero. These results show that the medial mesh is also a good replacement for the expensive solid mesh for dynamic analyses.

Fig. 6
figure 6

The matrix of Modal Assurance Criterion values for the first 40 modes of the LPT casing. The horizontal axis, R (reference model), and the vertical axis, W (working model), represent the modes for the solid and medial meshes respectively. Modes 7 to 17 are critical for the current modal evaluation (modes 1 to 6 are rigid body modes and hence have near-zero frequencies, while modes 18 and above occur at frequencies past the maximum operating rotational frequency of the low-pressure turbine)

Table 3 Comparisons of Modal Assurance Criterion values and natural frequencies in the main diagonal of the matrix in Fig. 6

3 Kriging and Co-Kriging

3.1 Kriging

Kriging (Sacks et al. (1989)) is a popular surrogate modelling approach for facilitating engineering design studies that are based on deterministic computational responses. It is particularly useful for predicting responses from numerical simulations as it only makes assumptions about the smoothness of the response surface but not the degrees of freedom. The Kriging formulation also lends itself to the derivation of a prediction error metric which is useful for gauging the quality of the surrogate when high simulation costs make it difficult to do so otherwise (Keane and Nair 2005; Forrester et al. 2008; Sóbester et al. 2014).

The smoothness assumption says that the observed values for two points that are close together in the design space should be similar. The similarity in values, or correlation, between two points x(i) and x(j) in a k-dimensional design space can be expressed as,

$$ \begin{array}{@{}rcl@{}} R_{i,j} &=&\text{Corr}[f(\textbf{x}^{(i)}),f(\textbf{x}^{(j)})]\\ &=& \exp \left( -\sum\limits_{l=1}^{k} \theta_{l} |x_{l}^{(i)}-x_{l}^{(j)}|^{p_{l}} \right). \end{array} $$
(2)

A correlation matrix, R, can then be formed from all possible pairs of points in a given training data set containing n points,

$$ \boldsymbol{R} = \left( \begin{array}{ccc} \begin{array}{l} R_{1,1} \end{array} & {\cdots} &\begin{array}{l} R_{1,n} \end{array} \\ {\vdots} & {\ddots} & \vdots\\ \begin{array}{l} R_{n,1} \end{array} &{\cdots} &\begin{array}{l} R_{n,n} \end{array} \end{array} \right). $$
(3)

θl and pl are the hyperparameters of the l th design variable which control the rate of correlation decrease and the degree of smoothness, respectively. A maximum likelihood approach is used to find the values for these hyperparameters which best return the observed function values, f, for the training data. The natural logarithm of the likelihood function can be written as,

$$ \begin{array}{@{}rcl@{}} \ln(L) &= &-\frac{n}{2}\ln(2\pi) -\frac{n}{2}\ln(\sigma^{2}) - \frac{1}{2} \ln |\boldsymbol{R}|\\ &&- \frac{(\boldsymbol{f}-\boldsymbol{1}\mu)^{T}\boldsymbol{R}^{-1}(\boldsymbol{f}-\boldsymbol{1}\mu)}{2 \sigma^{2}}. \end{array} $$
(4)

Setting the partial derivatives of ln(L) to zero gives us the maximum likelihood estimates (MLE) of μ and σ2,

$$ \hat{\mu} = \frac{\boldsymbol{1}^{T}\boldsymbol{R}^{-1}\boldsymbol{f}}{\boldsymbol{1}^{T}\boldsymbol{R}^{-1}\boldsymbol{1}}, $$
(5)
$$ \hat{\sigma}^{2} = \frac{(\boldsymbol{f}-\boldsymbol{1}\hat{\mu})^{T}\boldsymbol{R}^{-1}(\boldsymbol{f}-\boldsymbol{1}\hat{\mu})}{n}. $$
(6)

Substituting the MLEs of μ and σ2 into (4) gives us what is called the concentrated likelihood function,

$$ \ln(L) = -\frac{n}{2} \ln (\hat{\sigma}^{2}) - \frac{1}{2} \ln |\boldsymbol{R}|, $$
(7)

which is then maximized to obtain the MLEs of the hyperparameters. For all Kriging and Co-Kriging models in the current work, a hybridized particle swarm algorithm was employed by the proprietary optimization software OPTIMATv2 (Toal et al. 2008, 2009, 2011) to optimize the hyperparameters.

With the hyperparameters tuned, the Kriging model can now be used to predict the response at unsampled points. The vector of correlations between an unsampled point, x(∗), and the points in the current training set can be expressed as,

$$ \boldsymbol{r} = \left( \begin{array}{cc} \text{Corr} [f(\boldsymbol{x}^{(1)}), f(\boldsymbol{x}^{(*)})] \\ {\vdots} \\ \text{Corr} [f(\boldsymbol{x}^{(n)}), f(\boldsymbol{x}^{(*)})] \end{array}\right) = \left( \begin{array}{cc} R_{1,*} \\ {\vdots} \\ R_{n,*} \end{array}\right). $$
(8)

The augmented correlation matrix for this new data set thus becomes,

$$ \tilde{\boldsymbol{R}} = \left( \begin{array}{cc} \boldsymbol{R} & \boldsymbol{r} \\ \boldsymbol{r}^{T} & 1 \end{array}\right). $$
(9)

Substituting \(\tilde {\boldsymbol {R}}\) into the last term of (4) produces the augmented ln-likelihood function which can be subsequently maximized to give the Kriging prediction of the response value at x(∗),

$$ \hat{f}(\boldsymbol{x}^{(*)}) = \hat{\mu} + \boldsymbol{r}^{T}\boldsymbol{R}^{-1}(\boldsymbol{f}-\boldsymbol{1}\hat{\mu}). $$
(10)

This predictor can be used by a global optimizer to quickly search and exploit promising regions in the design space. For a more exploratory approach, the mean square error (MSE) of the Kriging predictor can be used to drive the optimizer towards sparsely sampled regions. The MSE estimate is,

$$ s^{2}(\boldsymbol{x}^{(*)}) = \hat{\sigma}^{2} [ 1 - \boldsymbol{r}^{T}\boldsymbol{R}^{-1}\boldsymbol{r}]. $$
(11)

The MSE allows us to model the uncertainty in the predictions as a Gaussian distribution, F(x), with mean \(\hat {f}\) and variance s2. A prediction’s improvement over the current best objective, fmin, can be defined as,

$$ I(\boldsymbol{x}^{(*)}) = f_{\min} - F(\boldsymbol{x}^{(*)}). $$
(12)

The expected value of this improvement is thus,

$$ \begin{array}{@{}rcl@{}} \text{EI} &= &\frac{f_{\min}-\hat{f}(\boldsymbol{x}^{(*)})}{2}\left[1+\text{erf}\left( \frac{f_{\min}-\hat{f}(\boldsymbol{x}^{(*)})}{s\sqrt{2}} \right) \right]\\ &&+ \frac{s}{\sqrt{2\pi}} \exp \left[\frac{-(f_{\min}-\hat{f}(\boldsymbol{x}^{(*)}))^{2}}{2s^{2}} \right], \end{array} $$
(13)

where erf() is the error function.

By progressively adding maximum EI designs to the data set, a balance between exploration and exploitation of the design space is achieved. The three metrics that have been presented (predicted optimum, maximum error, and maximum expected improvement) are used in the case studies in the present work.

3.2 Co-Kriging

When there are multiple analysis models of varying accuracies and evaluation costs, a multi-fidelity surrogate modelling approach can be used to make full use of the available data. In this work, the auto-regressive, multi-fidelity extension of Kriging, called Co-Kriging (Kennedy and O’Hagan 2000; Forrester et al. 2007), is used. The auto-regressive model says that observations of the high-fidelity response are correct and that any inaccuracies in the Co-Kriging model are caused entirely by the low-fidelity response. This is equivalent to approximating the high-fidelity response by adding a scaled Gaussian process of the low-fidelity response to a Gaussian process of the difference between the two data sets. The reader is referred to Park et al. (2017) for a survey of multi-fidelity surrogate modelling methods, in which the Co-Kriging approach used here is classified as a Bayesian discrepancy framework with a non-informative prior.

The covariance between the two data sets can be expressed using a covariance matrix, C, as,

$$ \begin{array}{@{}rcl@{}} \boldsymbol{C}& =& \left( \begin{array}{cccc} \boldsymbol{C}_{1,1} & \boldsymbol{C}_{1,2}\\ \boldsymbol{C}_{2,1} & \boldsymbol{C}_{2,2} \end{array}\right),\\ \boldsymbol{C}_{1,1}& =& {\sigma^{2}_{c}} \boldsymbol{R}_{c}(\boldsymbol{X}_{c},\boldsymbol{X}_{c}), \\ \boldsymbol{C}_{1,2}& =& {\rho\sigma^{2}_{c}} \boldsymbol{R}_{c}(\boldsymbol{X}_{c},\boldsymbol{X}_{e}), \\ \boldsymbol{C}_{2,1}& =& {\rho\sigma^{2}_{c}} \boldsymbol{R}_{c}(\boldsymbol{X}_{e},\boldsymbol{X}_{c}), \\ \boldsymbol{C}_{2,2}& =& \rho^{2}{\sigma^{2}_{c}} \boldsymbol{R}_{c}(\boldsymbol{X}_{e},\boldsymbol{X}_{e}) + {\sigma^{2}_{d}} \boldsymbol{R}_{d}(\boldsymbol{X}_{e},\boldsymbol{X}_{e}). \end{array} $$
(14)

The subscripts e, c, and d denote the expensive, cheap, and difference models respectively. ρ is the scaling factor. X is the data set of each model fidelity. The correlation matrices Rc and Rd follow the form of (2) and (3).

There are now five hyperparameters to tune (two for each correlation matrix, as well as ρ). The MLE approach from (4) to (6) can be used but with f replaced by a set of difference values between the high- and low-fidelity data sets:

$$ \boldsymbol{d} = \boldsymbol{f}_{e} - \rho\boldsymbol{f}_{c}(\boldsymbol{X}_{e}). $$
(15)

with the hyperparameters estimated, the Co-Kriging predictor can be derived as,

$$ \hat{f}_{e}(\textbf{x}^{(*)}) = \hat{\mu} + \boldsymbol{c}^{T}\boldsymbol{C}^{-1}(\boldsymbol{f}-\boldsymbol{1}\hat{\mu}), $$
(16)

where c is a column vector of covariances between the existing set of observations and a new point. The MSE estimate is given as,

$$ {s^{2}_{e}}(\boldsymbol{x}^{(*)}) = \rho^{2}\hat{\sigma}^{2}_{c} + \hat{\sigma}^{2}_{d} - \boldsymbol{c}^{T}\boldsymbol{C}^{-1}\boldsymbol{c} + \frac{1-1^{T}\boldsymbol{C}^{-1}\boldsymbol{c}}{1^{T}\boldsymbol{C}^{-1}1}. $$
(17)

4 Application of the multi-fidelity optimization with a medial mesh

In this section, two case studies are presented to demonstrate the benefits of using Co-Kriging surrogate models of solid and medial mesh evaluations (abbreviated as the CK strategy) for the structural optimization of engine geometries. The optimum designs found using this multi-fidelity approach, as well as the optimization times, are assessed with respect to a reference single-fidelity approach which is solely constructed from expensive tetrahedral meshes (the KTM strategy). The properties of a single-fidelity approach employing only medial meshes (the KMM strategy) are also examined.

4.1 Two-variable optimization of an engine intercasing

The intercasing from the whole engine geometry shown in Fig. 4 is chosen to be the subject of the first case study. The objective is to minimize the mass of the intercasing, f, while ensuring that its stiffness in terms of the maximum local radial displacement values as described in Section 2.3.1 is not reduced. Figure 7 shows the intermediate compressor casing geometry and the location of the design variables. The variable bounds are set to ± 50% of the nominal dimensions. A total of seven design constraints, IC1 to IC7 (see Fig. 4), are present in the current study. The upper limits of the constraints are the maximum local radial displacement values of the nominal design. Preliminary results showed that the noise in the FEA simulations prevented consistently accurate evaluations of the constraints, so the constraint limits were increased by a factor of 0.1% (ranging from 1.519E − 04 to 2.333E − 03 mm) of the nominal values.

Fig. 7
figure 7

Section view of the engine intercasing and the design variables x1 and x2. x1 and x2 are chosen such that their thickness modification would not affect the gas path. x1 consists of the second flange and the vertical rib which separates the two void boxes, while x2 consists of the diagonal rib which separates the second void box and the combustor region

Thickness modifications for the solid geometry are realized using the Synchronous Modelling functions in Siemens NX9 Modelling. The geometry is meshed according to the results from the mesh sensitivity studies that were performed in Section 2.3.1. The meshing parameters for generating the tetrahedral meshes are taken to be valid across the design space considered and thus remain constant throughout this study. For the medial object, the converged medial mesh from the mesh sensitivity studies is used directly. The process of realizing design changes in the medial mesh is also straightforward as it only involves overwriting the thicknesses of the groups of elements that are associated with each design variable.

A preliminary assessment of the suitability of constructing Co-Kriging models from the tetrahedral and medial meshes could be performed. This involved a comparison of simulation results for a set of 100 designs as specified by a space-filling DOE. Table 4 shows the comparison results. Here, the root mean squared error (RMSE) and maximum absolute error (MAE) are presented as a percentage of the range of response values in the observed data. Note that the mass is not compared as in both cases it is obtained directly from the solid CAD model and is thus the same.

Table 4 Comparison of simulation results using the converged medial and tetrahedral meshes for 100 intermediate compressor casing designs

The r2 correlation coefficient values for the outputs IC3 to IC7 satisfy the r2 > 0.9 recommendation in Toal (2015). The low r2 values for IC1 and IC2 are attributed to the low sensitivities of the two responses toward changes in x1 and x2, where the range of observed values are an order of magnitude smaller than the remaining outputs. The low sensitivities may be explained by how IC1 and IC2 are relatively far away from x1 and x2, and that their responses are already largely dictated by the presence of the thrust lugs. The small ranges also contributed to the large percentage error values. For this case study, a surrogate model that is constructed from only medial mesh evaluations would thus be able to predict the locations of optima but not their response values.

The contour plots in Fig. 8 show the feasible design space of each mesh type as predicted by the Kriging models that are constructed from the space-filling data sets. Differences in the constraint limits are noticeable. The output IC6 is predicted to be the active constraint at the KTM optimum, while the output IC7 is predicted to be the active constraint at the KMM optimum. x2 is expected to go to its lower bound for both of the single-fidelity strategies. However, x1 is expected to be at ≈ 2.8 and ≈ 3.3 for the KTM and KMM optimum designs, respectively. The optimum design from the KTM strategy is also expected to offer more mass reduction.

Fig. 8
figure 8

Contour plots of Kriging predictions for the 100-point space-filling DOE. The contour plots are generated using Kriging predictions from a 501 × 501 grid. The coloured area represents the feasible design space. The bold lines represent the constraint limit curves. The colour gradients for both plots have been scaled to the same range. TM, tetrahedral mesh; MM, medial mesh

The KTM and KMM strategies are initiated with the same 20-point Latin Hypercube DOE, following the 10d recommendation from Jones et al. (1998). The same DOE is also taken to be the initial low-fidelity data set for the CK strategy, while the high-fidelity data set is a 5-point subset of the low-fidelity data that is obtained through the Morris-Mitchell space-filling procedure (Morris and Mitchell 1995).

Each strategy is run for 4 update iterations, where the surrogate models are retrained at the end of each iteration. The KTM and KMM surrogates are updated with 2 maximum EI points, 2 maximum RMSE points, and 1 point at the predicted optimum per iteration, giving a total of 40 evaluations at the end of the optimization. The CK strategy is updated with 1 maximum EI point, 1 maximum RMSE point, and 1 point at the predicted optimum for each mesh fidelity per iteration, ending with a data set of 17 and 32 high- and low-fidelity evaluations respectively. Each strategy is repeated with 10 differently seeded Latin Hypercube DOEs of the same size.

Table 5 summarizes the optimization results. The optimum design parameters for the KTM and CK strategies are approximately the same across all 10 runs. The mass reduction over the nominal design is ≈ 1.10%, with an active constraint IC6 (a constraint with a satisfaction value greater than 0.9995 is taken to be active as the pre-set minimum search distance parameter of 1E − 3 in the optimizer prevented any further evaluations in the feasible design space that is close to the constraint boundaries). The KMM strategy is not able to find an equally good optimum design. The average mass reduction is 69% of the mass reduction achieved by the other two strategies. However, when evaluated using tetrahedral meshes, the KMM optimum designs are able to satisfy the constraint limits in the KTM strategy, as predicted by Fig. 8b.

Table 5 Optimization results for the three surrogate modelling strategies in the intercasing case study. The outputs of the optimum designs have been normalized against the nominal design

To assess the global accuracy of the surrogate models, the final surrogate models from each optimization run are used to generate predictions for the 100-point space-filling DOE that is used in the preliminary test (see Table 4). The predictions are compared against the observed values. Tables 67, and 8 show the comparison results. The CK strategy is able to significantly improve the r2 correlation for IC1 and IC2 and also greatly reduce the percentage errors for all response predictions.

Table 6 r2 correlation between predictions from the surrogate modelling strategies at the end of the optimizations and the observed values for the 100-point space-filling data set. The best strategy for each output is italicized
Table 7 Percentage RMSE of predictions from the surrogate modelling strategies at the end of the optimizations as compared to the observed values for the 100-point space-filling data set. The best strategy for each output is italicized
Table 8 Percentage MAE of predictions from the surrogate modelling strategies at the end of the optimizations as compared to the observed values for the 100-point space-filling data set. The best strategy for each output is italicized

The improvement in the prediction accuracy of the Co-Kriging model can also be observed in Fig. 9b where its predictions of the constraint limits in the region of the optimum design are accurate with respect to the KTM strategy (Fig. 9a). The Co-Kriging predictions of the constraint limits in the region of {1.5 < x1 < 3.0,3.0 < x2 < 4.5} are less accurate as the region is sparsely sampled even with the influence of the maximum RMSE criterion. The predictions of the constraint limits in the optimum region from the KMM surrogate model (Fig. 9c) are not as accurate, even with the high sampling frequency in the vicinity as driven by the maximum EI and predicted optimum criteria.

Fig. 9
figure 9

Contour plots of predictions from the surrogate models in each strategy at the end of the optimization runs for the first of the 10 Latin Hypercube DOEs that were used. The contour plots were generated using Kriging predictions from a 501 × 501 grid. The bold lines represent the constraint limit curves. The colour gradients for all three plots have been scaled to the same range. TM, tetrahedral mesh; MM, medial mesh

Figure 10 illustrates the optimization histories for all three strategies where the costs are plotted in units of equivalent number of cheap evaluations. The cost ratio of tetrahedral mesh to medial mesh evaluation times, averaged over all optimization runs, is 11.10. The cost savings provided by the CK strategy can be clearly observed, with an average convergence cost of 52% when compared to the KTM strategy.

Fig. 10
figure 10

Optimization histories for the three strategies in the intermediate compressor casing case study. The dashed vertical lines mark the end of the DOE evaluations as well as the start of the surrogate model updating procedure

4.2 Eight-variable optimization of a whole engine

Having clearly illustrated the benefits of the CK strategy for optimizing the engine intercasing, the multi-fidelity optimization with medial meshes approach is now applied to a whole engine optimization study to demonstrate its effectiveness in solving realistic design problems involving large-scale engine assemblies.

The objective now is to minimize the whole engine mass, and the number of constraints have been increased to include all 15 maximum local radial displacement outputs across the entire engine (see Fig. 4). Figure 11 shows the eight design variables for the current optimization, where the thicknesses of two structural members for each of the three remaining subsystems have been added. Other than the increased problem size, the parameters of the optimization, as well as the manner in which the FEA simulations are performed, are the same as in Section 4.1.

Fig. 11
figure 11

Section view of the whole engine model and the design variables x1 to x8

The KTM and KMM strategies are initiated with an 80-point Latin Hypercube DOE. This DOE is also used as the initial low-fidelity data set for the CK strategy, while a 20-point Morris-Mitchell subset is extracted to be used as the high-fidelity data set. The longer evaluation times of the whole engine models made it infeasibly expensive to generate a space-filling data set of a suitable size for calculating solid and medial mesh correlations, as was done in the first case study. As such, the medial mesh accuracy is assessed based on the initial 80-point DOEs for the two single-fidelity strategies, where although the comparison results may not be as precise, they should still provide some indication as to whether a multi-fidelity approach is suitable. Table 9 shows the comparison results.

Table 9 Comparison between the 80-point Latin Hypercube data sets of medial and solid WEM evaluations. μ is the average output over the 3 repeated runs. The confidence intervals (CI2.5 and CI97.5) were calculated through resampling with 50,000 bootstrap samples

The r2 correlation coefficient values for all outputs are sufficiently high enough to warrant the use of medial meshes for building the Co-Kriging models. The percentage error values here are also significantly lower when compared to the first case study as the larger number of variables and the presence of inter-subsystem interactions resulted in larger ranges for each of the outputs.

Each strategy is run for 16 update iterations. The surrogate models in the KTM and KMM strategies are updated with 2 maximum EI points, 2 maximum RMSE points, and 1 point at the predicted optimum per iteration, giving a total of 160 evaluations at the end of the optimization. The surrogate models in the CK strategy are updated with 1 maximum EI point, 1 maximum RMSE point, and 1 point at the predicted optimum for each mesh fidelity per iteration, ending with a data set of 68 and 128 high- and low-fidelity evaluations respectively. Each strategy is repeated for a total of times times with the Latin Hypercube DOEs that have been evaluated.

Table 10 summarizes the optimization results. The mass reductions offered by both the KTM and CK strategies are ≈ 1.40%. The output HPT1 is also the active constraint for both strategies. There is, however, some variation in the values for variables x7 and x8 in the CK strategy. The 3 sets of optimum values for the variables x7 and x8 are {2.0873, 2.8875}, {2.4615, 2.6348}, and {4.3224, 1.5000}. Two of the three optimum designs are found to have a more equal distribution of material between the two structural members. The remaining optimum design is observed to push x8 to its lower bound, similar to the optimum designs found by the KTM strategy.

Table 10 Optimization results for the three surrogate modelling strategies in the whole engine case study. The outputs of the optimum designs have been normalized against the nominal design. μ is the average output over the 3 repeated runs. The confidence intervals (CI2.5 and CI97.5) were calculated through resampling with 50,000 bootstrap samples. The constraint satisfaction values for the KMM strategy are the results from evaluations of the optimum MM designs using TMs. Constraint violations are italicized

The KMM strategy here is able to find optimum designs that gave similar mass reductions to the two higher-fidelity strategies as well. However, when these optimum designs are evaluated using tetrahedral meshes, the constraint HPT1 is found to be violated by 1.14%, or approximately 3.13E − 3 mm. The optimum designs are found to have a good balance of material between x7 and x8, similar to a majority of the CK optimum designs. Even so, the KMM surrogate models are not able to predict the constraint responses to the levels of accuracy needed to prevent small violations of the constraint limits from occurring.

Figure 12 shows the optimization histories for all three strategies where the costs are plotted in units of equivalent number of cheap evaluations. The cost ratio of tetrahedral mesh to medial mesh evaluation times, averaged over all optimization runs, is 11.65. The cost savings provided by the CK strategy can again be clearly observed, with an average convergence cost of 57% when compared to the KTM strategy.

Fig. 12
figure 12

Optimization histories for the three strategies in the whole engine case study. The dashed vertical lines mark the end of the DOE evaluations as well as the start of the surrogate model updating procedure. Note that the KMM runs converged to optimum designs that are slightly infeasible with respect to the KTM constraints when they were evaluated using tetrahedral meshes

5 Conclusions

In this paper, a novel approach which combines medial meshing and multi-fidelity surrogate modelling techniques is proposed and demonstrated on two engine design problems. For the intermediate compressor casing problem, the Co-Kriging approach is able to find feasible optimum designs that are on par with the single-fidelity Kriging approach which solely uses solid mesh evaluations. However, the Co-Kriging approach was able to converge to its optimum at 52% the cost of the expensive single-fidelity Kriging approach. A single-fidelity Kriging approach which solely uses medial mesh evaluations was also tested and it is able to find optimum designs that are feasible with respect to the constraint limits of the high-fidelity strategy, even in the presence of active constraints. For the whole engine problem, the Co-Kriging approach was also able to find feasible optimum designs but at only 57% the cost of the expensive single-fidelity Kriging approach. The single-fidelity Kriging with medial meshes approach, however, found optimum designs that were infeasible with respect to the constraint limits of the high-fidelity strategy.

These results have demonstrated that the feasibility of conducting whole engine optimization studies using finite element models can be greatly improved through a combination of medial meshing and multi-fidelity surrogate modelling techniques. The low manual effort required to generate these dimensionally-reduced whole engine geometries, as enabled by the MANTLE software, also means that the geometries are cheap and easy to update as individual subsystem designs progress throughout their design cycles. Future work aims to further reduce the cost of the multi-fidelity approach. Given the correlation and error values between two data sets of different fidelities, research for establishing the most efficient distribution of computational resources between the high-fidelity and low-fidelity evaluations is necessary. The robustness of the multi-fidelity approach also needs to be demonstrated using more complex optimization problems with multi-modal response surfaces.

6 Replication of results

The MANTLE software, as well as all finite element models generated using the software, are protected intellectual properties of Rolls-Royce. OPTIMATv2 is a Rolls-Royce proprietary optimization toolbox. The engine geometry shown in Figs. 2, 3, 4, 7, and 11 are created using a Rolls-Royce model provided through the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 234344 (www.crescendo-fp7.eu).