A hierarchical kriging approach for multi-fidelity optimization of automotive crashworthiness problems

Kaps, Arne; Czech, Catharina; Duddeck, Fabian

doi:10.1007/s00158-022-03211-2

A hierarchical kriging approach for multi-fidelity optimization of automotive crashworthiness problems

Research Paper
Open access
Published: 18 March 2022

Volume 65, article number 114, (2022)
Cite this article

Download PDF

You have full access to this open access article

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

A hierarchical kriging approach for multi-fidelity optimization of automotive crashworthiness problems

Download PDF

2299 Accesses
11 Citations
Explore all metrics

Abstract

Multi-fidelity optimization schemes enriching expensive high-fidelity functions with cheap-to-evaluate low-fidelity functions have gained popularity in recent years. In the present work, an optimization scheme based on a hierarchical kriging is proposed for large-scale and highly non-linear crashworthiness problems. After comparison to other multi-fidelity techniques an infill criterion called variable-fidelity expected improvement is applied and evaluated. This is complemented by two innovative techniques, a new approach regarding initial sampling and a novel way to generate the low-fidelity model for crash problems are suggested. For the former, a modified Latin hypercube sampling, pushing samples more towards design space boundaries, increases the quality of sampling selection. For the latter, a projection-based non-intrusive model order reduction technique accelerates and simplifies the low-fidelity model evaluation. The proposed techniques are investigated with two application problems from the field of automotive crashworthiness—a size optimization problem for lateral impact and a shape optimization problem for frontal impact. The use of a multi-fidelity scheme compared to baseline single-fidelity optimization saves computational effort while keeping an acceptable level of accuracy. Both suggested modifications, independently and especially combined, increase computational performance and result quality in the presented examples.

Multi-Objective Optimization for Structure Crashworthiness Based on Kriging Surrogate Model and Simulated Annealing Algorithm

Article 31 July 2020

Hybrid Strategy Coupling EGO and CMA-ES for Structural Topology Optimization in Statics and Crashworthiness

Data-driven models for crashworthiness optimisation: intrusive and non-intrusive model order reduction techniques

Article Open access 28 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

As computational power has increased exponentially in recent years, also Finite Element (FE) models reached a higher level of detail and complexity—e.g. modern day car models for crash simulations may contain more than ten million elements. This balances out such that simulation times are not significantly decreasing. Especially in multi-query analysis such as optimization or robustness applications a high number of evaluations is required, which increases the computational effort to an infeasible level.

One possible remedy is the use of specifically designed optimization approaches such as Efficient Global Optimization (EGO) that was first proposed by Jones et al. (1998). The idea is to build a surrogate model from an initial design of experiments (DoE) and adaptively improve it utilizing a so-called infill criterion (Jones 2001; Forrester and Keane 2009). In this context, mostly kriging models (Krige 1951; Matheron 1963; Sacks et al. 1989) are exploited as surrogate models as their inherent error approximation features are especially beneficial.

More recently, these types of surrogate models were integrated in a multi-fidelity scheme, whereby the high-fidelity FE-analysis is complemented with an additional low-fidelity model: The corresponding multi-fidelity kriging schemes can be categorized into two variants. One class of techniques considers correction-based methods, where a “bridge function” or “scaling function” models the differences between high- and low-fidelity models (Chang et al. 1993; Gano et al. 2006; Han et al. 2013). The second type of multi-fidelity approaches are named cokriging. The idea of the latter is to enhance the low-fidelity surrogate model by utilizing the covariance matrix between low- and high-fidelity model. Originally proposed in the geostatistics community (Journel and Huijbregts 1978), this approach was extended to computer experiments by Kennedy and O’Hagan (2000) and called KOH autoregressive model. Due to its success, numerous extensions and modifications have been added to cokriging since its introduction: Han et al. (2012) proposed an alternative approach for creation of the covariance matrix between low- and high-fidelity models. Moreover, Gratiet and Garnier (2014) reformulated the cokriging algorithm in a recursive manner to reduce computational complexity. An extension considering complex cross-correlations between varying fidelity models can be found in Perdikaris and Karniadakis (2016). The present work is based on hierarchical kriging (HK), suggested by Han and Görtz (2012), whereby the low-fidelity surrogate model represents the trend term in the multi-fidelity predictor. It is beneficial in the context of multi-fidelity optimization as it provides better error estimation capabilities compared to other cokriging models.

As surrogate models were adapted to multi-fidelity applications, so were infill criteria. A criterion named Augmented EI, capable of adaptively sampling low- and high-fidelity models by considering coefficients for cross-correlations and cost ratios between models was suggested by Huang et al. (2006). Moreover, Zhang et al. (2018a) proposed the variable-fidelity expected improvement (VF-EI) criterion that implements a similar idea but with an analytical derivation and free from external coefficients. Therefore, the latter is used in the present work.

A common approach in multi-fidelity optimization is to combine FE models with varying levels of mesh sizes for high and low-fidelity models, such as realized by Zhang et al. (2018a) for an airfoil shape optimization. In combination with a cokriging adaptation presented by Gratiet and Garnier (2014), a hydrofoil shape optimization with varying mesh size levels was performed by Bonfiglio et al. (2018). Similar approaches are investigated in the applications of crashworthiness for honeycomb structures and sheet metal forming in Sun et al. (2010) and Sun et al. (2010), respectively. Alaimo et al. (2018) proposed a multi-fidelity approach where an adaptive functional principal component analysis (PCA) model is utilized with a simulated annealing (SA) algorithm applied to linear elastic structural topology optimization. Anselma et al. (2020) published a multi-fidelity scheme for the crashworthiness discipline inside a multidisciplinary optimization. The authors use analytical equations as a low-fidelity model and propose to only evaluate the FE high-fidelity model if the former predicts infeasible results. Also a cokriging-based multi-fidelity version of EGO was exploited for inverse problems in haemodynamics (Perdikaris and Karniadakis 2016).

In automotive crashworthiness, optimization has been performed for many years (Redhe et al. 2004; Duddeck 2008; Hunkeler et al. 2013). More recently, multi-fidelity schemes have also been applied in this field (Sun et al. 2010). Acar et al. (2020) investigated a multi-fidelity optimization for a frontal impact problem of a bumper system with the multi-fidelity surrogate modeling approach suggested by Zhang et al. (2018b). Results show that multi-fidelity approaches are capable of yielding significant time-savings while maintaining acceptable accuracy. Other mechanics-based low-fidelity models available for crashworthiness applications are listed in Lange et al. (2018). The authors begin with lumped mass-spring models and subsequently motivate the introduction of the so-called component solution space approach that can be applied in early phase component development for crashworthiness analyses.

Recently, model order reduction (MOR) techniques have been introduced also for non-linear problems (Guo and Hesthaven 2017; Swischuk et al. 2019) and applied in crashworthiness (Kneifl et al. 2021). The non-intrusive approaches are based on the results of training simulations—here named snapshots—which are utilized to compute a reduced subspace. In addition, a regression model is trained that combines the basis vectors of the subspace to represent the physical behavior of the system (Guo and Hesthaven 2019). The non-intrusive MOR has been integrated into a multi-fidelity training scheme by Kast et al. (2020) and related projection-based approaches for crashworthiness applications and optimization have been conducted (Le Guennec et al. 2018; Assou et al. 2019; Gstalter et al. 2020; Ren et al. 2020). A summary of recent developments in the field of non-intrusive MOR is presented in Yu et al. (2019) for fluid mechanics application. Moreover, principal component-based surrogate models can also be found in the field of structural topology optimization (Alaimo et al. 2018; Xiao et al. 2020; Choi et al. 2019).

In the present work we aim to develop enhanced multi-fidelity optimization schemes in crashworthiness applications. To that end, we propose to integrate an incremental projection-based MOR approach as low-fidelity model into a multi-fidelity EGO algorithm. In a second step to reduce computational effort, our recently developed isovolumetric sampling approach placing samples closer to design space boundaries is adapted (Kaps et al. 2021). When assessing algorithm performance, two main criteria can be established. The primary goal is to find an optimization approach with reduced computational effort produced by the high number of expensive evaluations of the objective function during the optimization. Secondly, an acceptable level of accuracy must be maintained; i.e. a multi-fidelity optimization scheme should not lead to inferior results compared to an optimization based using only high-fidelity simulations.

This work is structured as follows. Initially, the novel design of experiments approach is introduced in Sect. 2, followed by the optimization scheme based on HK and VF-EI in Sect. 3. The MOR approach used for low-fidelity model generations is presented in Sect. 4. Subsequently, the proposed optimization scheme and its implementation are explained in Sect. 5. The performance of the complete set of methods is illustrated by a lateral impact example and a crashbox design problem in Sect. 6, and final results are summarized together with an outlook into future work in Sect. 7.

2 Isovolumetric design of experiments

The first step of any population-based optimization is to determine an initial set of sample points by means of DoE. Covering the full design space with a small amount of samples is the general aim. As there is no unique criterion for this vaguely formulated goal, DoE is still an active field of research. An overview of popular criteria and approaches is given in Garud et al. (2017).

In the present work, a modified optimal Latin hypercube (OLH) approach is used. The standard construction of a Latin hypercube design (LHD) for N samples in d dimensions is described in the following. In each dimension the space is divided into N strata of equal probability, i.e. the design space then consists of $N^d$ cells. Randomly, N cells are selected such that each stratum of a dimension may only contain a single sample. Each sample can be placed in the center of its cell or randomly located within it (Rajabi et al. 2015), whereby the centered case is considered in the present work.

Initial Latin hypercube samples are incrementally optimized with a Simulated Annealing algorithm that consists of random pairwise- and coordinate-wise swaps. In an iterative process new samples are accepted if they improve a space-filling criterion or accepted with a certain probability if they do not bring an improvement (Morris and Mitchell 1995). Other approaches with deterministic sample selection, e.g. (Ye et al. 2000) or more elaborate optimization schemes such as the Enhanced Stochastic Evolutionary algorithm (Jin et al. 2003) have been suggested.

Recently, we proposed an adaptation to Latin hypercube sampling, named isovolumetric LHD, that places samples closer to design space boundaries (Kaps et al. 2021). The idea is to rethink the uniform strata that are created in each coordinate dimension for standard Latin hypercube sampling as nested hypervolume shells in the design space. These are created as shown with the colored regions in Fig. 1. Here, all shells are required to have identical volume, which is especially advantageous for higher dimensions, i.e. higher number of design variables. Applying this condition to a d-dimensional unit hypercube, with strata boundaries $p_i = \frac{i}{N}$ and sizes $a_j = \frac{1}{N}$ for standard LHS, yields the following new equations

$$\begin{aligned} p_i&= {\left\{ \begin{array}{ll} 0.5 \left( 1 - \left( \frac{N_v + 1 - i}{N_v}\right) ^{1/d} \right) , \quad &{} i \in \{ 1, 2, ..., N_v \}\\ 0.5 \left( 1 + \left( \frac{i - (N_v + 1)}{N_v}\right) ^{1/d} \right) , \quad &{} i \in \{ N_v+1, ..., N + 1\}\\ \end{array}\right. } \end{aligned}$$

(1)

$$\begin{aligned} a_j&= p_{j+1} - p_j,\quad j \in \{ 1, 2, ..., N \}. \end{aligned}$$

(2)

Here, N is the number of samples to be drawn and $N_v = \frac{N}{2}$ the number of nested hypervolume shells. An exemplary isovolumetric Latin hypercube design for six samples in two dimensions is depicted in Fig. 1. The adaptation is easy to implement and does not increase computational requirements; however, it has been shown to increase the quality of DoEs in popular space-filling criteria such as potential energy (Audze and Eglais 1977) in mid- to high-dimensional situations, i.e. five and more dimensions (Kaps et al. 2021). One expected advantage in the current application is that training points associated to the surrogate models are closer to the design space boundaries. Thereby, the prediction of surrogate models will be more based on interpolation of samples as opposed to extrapolation.

This approach, named optimal isovolumetric Latin hypercube (OIVLH), is transferred to the context of optimization in crashworthiness applications in the present work.

3 Multi-fidelity efficient global optimization

In the following, a multi-fidelity EGO approach is introduced. In general, EGO techniques are based on kriging models, which have first been introduced by (Krige 1951; Matheron 1963; Sacks et al. 1989) and are nowadays a popular choice for surrogate models. The idea of adaptively improving an initially created kriging model by means of an infill criterion was proposed by Jones et al. (1998). Common early variants of the approach are compared in Jones (2001).

Over the years, many surrogate models of variable fidelity have been suggested. Overviews can be found for example in Forrester et al. (2007) or Park et al. (2016). In the present work, hierarchical kriging is applied as the approach has been shown to yield better error estimations than other cokriging methods (Han and Görtz 2012). The relevant aspects of creating an HK model are summarized below, readers are referred to the original publication for detailed derivations (Han and Görtz 2012).

The idea is to first create a kriging model of the low-fidelity function that is subsequently used in the hierarchical kriging model for high-fidelity prediction. Based on these surrogate models, new sample points can be evaluated. The models are then adaptively improved. To that end, an infill criterion is introduced that can be maximized to determine the best location and fidelity level for a new adaptive sample.

Following, all steps of the outlined process are explained in details. The low-fidelity function is represented by a normal kriging model. Consider a random process for the low-fidelity (LF) function

$$\begin{aligned} Y_{lf} ({\varvec{x}}) = \beta _{0,lf} + Z_{lf} ({\varvec{x}}), \end{aligned}$$

(3)

with $\beta _{0,lf}$ an unknown constant and $Z_{lf} ({\varvec{x}})$ a stationary random process. For a d-dimensional problem, the low-fidelity kriging model is based on a set of sampling data $({\varvec{S}}_{lf}, {\varvec{y}}_{S,lf})$ consisting of $m_{lf}$ samples with input variable data ${\varvec{S}}_{lf} \in {\mathbb {R}}^{m_{lf} \times d}$ and corresponding output ${\varvec{y}}_{S,lf} \in {\mathbb {R}}^{m_{lf}}$.

To be able to construct the kriging predictor for new points, a correlation function, also named kernel, is needed to model the correlation between given sample points and new points. An overview of popular choices is for example given in Rasmussen and Williams (2005). In the present work, a squared-exponential kernel, also called Gaussian radial-basis function (RBF) kernel is utilized due to its smoothness and infinite differentiability:

$$\begin{aligned} R({\varvec{x}}^{(i)}, {\varvec{x}}^{(j)}) = \prod _{k=1}^d \text {exp} \left( - \theta _k |x_k^{(i)} - x_k^{(j)}|^2\right) . \end{aligned}$$

(4)

Here, $\theta _k$ denotes the kernel length scale. The kernel is called anisotropic, if there is a separate length scale for each design space dimension as in the equation above. In the present work, an isotropic kernel is chosen, where the hyperparameter $\theta _k = \theta$ is a scalar, i.e. independent of coordinate dimension. In fact, many different RBF kernels with similar properties than the Gaussian kernel exist. The focus in the present work lies solely on the latter for the sake of clarity.

The low-fidelity predictor for a new point ${\varvec{x}}$ can then be written as

$$\begin{aligned} \begin{aligned} {\hat{y}}_{lf} ( {\varvec{x}} )&= \beta _{0,lf} + {\varvec{r}}_{lf}^T ({\varvec{x}}) {\varvec{R}}_{lf}^{-1} ( {\varvec{y}}_{S,lf} - \beta _{0,lf} {\varvec{1}} ),\\ \text {with}&\quad \beta _{0,lf} = ({\varvec{1}}^T {\varvec{R}}_{lf}^{-1} {\varvec{1}} )^{-1} {\varvec{1}}^T {\varvec{R}}_{lf}^{-1} {\varvec{y}}_{S,lf}, \\ \text {and}&\quad {\varvec{r}}_{lf} = [R({\varvec{x}}, {\varvec{x}}^{(1)}), ..., R({\varvec{x}}, {\varvec{x}}^{(m)})] \in {\mathbb {R}}^{m_{lf}} \end{aligned} \end{aligned}$$

(5)

where ${\varvec{r}}_{lf}$ is the correlation vector between the sample data and the new point, ${\varvec{R}}_{lf} \in {\mathbb {R}}^{m_{lf} \times m_{lf}}$ the correlation matrix representing correlation between sample data points and ${\varvec{1}} \in {\mathbb {R}}^{m_{lf}}$ a column vector filled with ones.

Following the calculation of the initial sample data set, the kriging model is fitted by separately optimizing the kernel hyperparameter $\theta$. Differential evolution (Storn and Price 1997) is selected for the optimization of the hyperparameters in the present work due to its simplicity and its good global search characteristics. More on hyperparameter optimizations can be found in Toal et al. (2008).

Next, the hierarchical kriging model can be constructed including the predictor of the low-fidelity model. Therefore, consider a random process corresponding to the high-fidelity function

$$\begin{aligned} Y ({\varvec{x}}) = \beta _0 {\hat{y}}_{lf} ( {\varvec{x}} ) + Z ({\varvec{x}}). \end{aligned}$$

(6)

Here, the low-fidelity predictor scaled by an unknown constant $\beta _0$ is a trend term and $Z ({\varvec{x}})$ is a stationary random process. Given a d-dimensional sampling data set $({\varvec{S}}, y_S)$ containing m samples with input variable data ${\varvec{S}} \in {\mathbb {R}}^{m \times d}$ and corresponding output $y_S \in {\mathbb {R}}^m$, the HK predictor for the high-fidelity function can be written as

$$\begin{aligned} \begin{aligned} {\hat{y}} ( {\varvec{x}} ) = \beta _{0} {\hat{y}}_{lf} ( {\varvec{x}} ) + {\varvec{r}}^T ({\varvec{x}}) {\varvec{R}}^{-1} ( {\varvec{y}}_{S} - \beta _{0} {\varvec{F}} )\\ \text {with} \quad \beta _{0} = ({\varvec{F}}^T {\varvec{R}}^{-1} {\varvec{F}} )^{-1} {\varvec{F}}^T {\varvec{R}}^{-1} {\varvec{y}}_{S}, \end{aligned} \end{aligned}$$

(7)

where $\beta _{0}$ is a scaling factor indicating the correlation between high- and low-fidelity functions and ${\varvec{F}} = [{\hat{y}}_{lf} ( {\varvec{x}}^{(1)} ) ... {\hat{y}}_{lf} ( {\varvec{x}}^{(n)} )]^T, \forall {\varvec{x}}^{(i)} \in {\varvec{S}}$ represents the low-fidelity prediction at high-fidelity sample points. ${\varvec{r}} \in {\mathbb {R}}^m$ and ${\varvec{R}} \in {\mathbb {R}}^{m \times d}$ are defined the same way as for the low-fidelity predictor above. The factor ${\varvec{R}}^{-1} ( {\varvec{y}}_{S} - \beta _{0} {\varvec{F}} )$, named ${\varvec{V}}_{HK}$ in the original publication, does not depend on the untried point ${\varvec{x}}$ and can thus be calculated at model fitting time. The mean squared error (MSE) of the HK prediction is given with $\sigma ^2$, the process variance of $Z({\varvec{x}})$

$${\text{MSE}}(\hat{y}(x)) = \sigma ^{2} (1.0 - r^{T} {\varvec{R}}^{-1} r + \left[ {r^{T} {\varvec{R}}^{-1} {\varvec{F}} - \hat{y}_{{lf}} } \right]^{2} \left( {{\varvec{F}}^{T} {\varvec{R}}^{-1} {\varvec{F}}} \right)^{{ - 1}} ).$$

(8)

Once the initial HK model is built, it is adaptively improved using an infill criterion to determine the ideal position of new samples. In multi-fidelity applications two options exist: a ‘classic’ single-fidelity infill criterion or a multi-fidelity criterion. Among the former, expected improvement (Jones et al. 1998) is the most popular method. The interested reader is referred to Jones (2001) for other common techniques. The disadvantage of a single-fidelity criterion is that only high-fidelity samples can be considered adaptively. Hence, they are not further discussed here. In the present work, the so-called variable-fidelity expected improvement criterion (Zhang et al. 2018a) is utilized. It is defined at location ${\varvec{x}}$ and fidelity level L as

$$\begin{aligned} EI_{vf} ({\varvec{x}}, L) = {\left\{ \begin{array}{ll} s({\varvec{x}}, L) \left[ u \varPhi \left( u \right) + \phi \left( u \right) \right] , &{} \text {if} \quad s({\varvec{x}}, L) >0 \\ 0, &{} \text {if} \quad s({\varvec{x}}, L) =0 \end{array}\right. }, \end{aligned}$$

(9)

where $u = \frac{y_{min} - {\hat{y}} ({\varvec{x}})}{s({\varvec{x}}, L)}$ and $y_{min}$ as the currently best observed feasible high-fidelity function value. The term $s({\varvec{x}}, L)$ denotes the uncertainty of the HK model. The previously introduced scaling factor between fidelity levels $\beta _0$ is used here to model the uncertainty in high-fidelity prediction caused by the low-fidelity predictor

$$\begin{aligned} s^2 ({\varvec{x}}, L) = {\left\{ \begin{array}{ll} \beta _0^2 \cdot {\text{MSE}}( {\hat{y}}_{lf} ( {\varvec{x}} )), &{}L=0\; \text {low-fidelity}\\ {\text{MSE}}( {\hat{y}} ( {\varvec{x}} )), &{}L=1 \;\text {high-fidelity} \end{array}\right. }. \end{aligned}$$

(10)

Here, ${\text{MSE}}( {\hat{y}} ( {\varvec{x}} ))$ and ${\text{MSE}}( {\hat{y}}_{lf} ( {\varvec{x}} ))$ are the MSEs of the high- and low-fidelity kriging predictors, respectively. $\varPhi (\bullet )$ and $\phi (\bullet )$ in Eq. (9) represent the cumulative distribution and probability density functions of the standard normal distribution, respectively. The two summands in Eq. (9) can be identified with exploration and exploitation. The first term $\left( y_{min} - {\hat{y}} ({\varvec{x}}) \right) \varPhi (u)$ is dominated by improving the solution ${\hat{y}} ({\varvec{x}})$ and thus represents exploitation, while the second term $s({\varvec{x}}, L) \phi (u)$ represents exploration because it is dominated by the solution uncertainty $s({\varvec{x}}, L)$.

Due to the highly multi-modal nature of the EI functions, differential evolution (Storn and Price 1997) is selected for optimization of the infill criterion in the present work.

Notably, the VF-EI formulation is similar to the original EI definition (Jones et al. 1998); however, in addition the dependency on the fidelity level is introduced. In terms of multi-fidelity optimization the VF-EI criterion is comparable to the augmented EI criterion proposed by Huang et al. (2006). Both describe the expected improvement of the high-fidelity function with respect to adaptive samples on both fidelity levels. To that end, augmented EI contains two factors that are multiplied with a standard EI for the high-fidelity function. A more detailed discussion about the differences between VF-EI and augmented EI can be found in Zhang et al. (2018a). Here, we use VF-EI as it is free of empirical parameters and is as such more intuitive.

4 Model order reduction

The proposed multi-fidelity approach exploits a data-driven model order reduction (MOR) technique to create the low-fidelity model. As an analytical simplification is not available for non-linear problems, a data-driven approach is commonly used (Sirovich 1987). Therefore, an online and offline phase are introduced, whereby the offline phase can be understood as the counter part to the DoE. In particular, the surrogate model is created during the offline, also called training phase. Afterwards the simplified model can be evaluated for multi-fidelity analysis in the online phase.

4.1 Training phase

Within the training phase, a set of full order simulations is created, whereby all resultants are stored as snapshots $x_i \in {\mathbb {R}}^N$ with N degrees of freedom. Combining the training simulations to a so-called snapshot matrix ${\varvec{A}} \in {\mathbb {R}}^{N \times n}$, with n the number of collected snapshots, a reduced subspace and its projection matrix can be computed. Through the Singular Value Decomposition (SVD), also referred to as thin SVD (Golub and Van Loan 2013) the snapshot matrix ${\varvec{A}}$ can be represented by the left-singular vectors ${\varvec{U}} \in {\mathbb {R}}^{N \times n}$, the diagonal matrix $\Sigma \in {\mathbb {R}}^{n \times n}$ containing non-negative singular values $\sigma _i$ in descending order and the right-singular matrix ${\varvec{Z}} \in {\mathbb {R}}^{n \times n}$. Thus, the columns of the matrix ${\varvec{U}}$ are the eigenvectors of ${\varvec{A}}{\varvec{A}}^T$.

$$\begin{aligned} {\varvec{A}} = {\varvec{U}} \Sigma {\varvec{Z}}^T \approx {\varvec{U}}_k \Sigma _k {\varvec{Z}}^T_k = {\varvec{V}} \Sigma _k {\varvec{Z}}^T_k. \end{aligned}$$

(11)

Moreover, the matrix ${\varvec{A}}$ is approximated by truncating its parts to a rank $k \le n,m$, such that ${\varvec{U}}_k \in {\mathbb {R}}^{N \times k}$, $\Sigma _k \in {\mathbb {R}}^{k \times k}$ and ${\varvec{Z}}_k \in {\mathbb {R}}^{k \times N}$, respectively. To define the reduced basis of the subspace, ${\varvec{V}} := {\varvec{U}}_k \in {\mathbb {R}}^{N \times k}$ is further utilized as the projection matrix. In practice, the optimal rank k is not known beforehand and k = min ${\tilde{k}}$ with

$$\begin{aligned} \dfrac{\Vert {\varvec{A}} - {\varvec{U}}_ {{\tilde{k}}} \Sigma _{{\tilde{k}}} {\varvec{Z}}^T_{{\tilde{k}}} \Vert _F}{\Vert {\varvec{A}} \Vert _F} = \sqrt{\frac{\sum ^k_{i={\tilde{k}}+1}\sigma ^2_i}{\sum ^k_{i=1}\sigma ^2_i}} \le \epsilon \end{aligned}$$

(12)

can be found for an error threshold $\epsilon$. In other words, the matrices are truncated by ${\tilde{k}}$ such that an energy cutoff ratio $\mu$ is maintained:

$$\begin{aligned} \mu = \frac{\sum ^k_{i=1}\sigma ^2_i}{\sum ^n_{i=1}\sigma ^2_i}. \end{aligned}$$

(13)

For large-scale matrices the evaluation of the full SVD is cost intensive as its complexity is in the range $O(n^2)$ with n as the number of snapshots. Therefore, multiple approaches (Bach et al. 2019; Phalippou et al. 2020) to efficiently compute the truncated projection matrix, such as randomized or incremental SVD techniques, e.g. (Oxberry et al. 2017), can be applied. Here, an incremental SVD algorithm (Baker et al. 2012)^{Footnote 1} is utilized. To save computational resources, the snapshot matrix ${\varvec{A}}$ is built up incrementally. Therefore, ${\varvec{A}}$ is divided into batches, which are added to the projection matrix ${\varvec{V}}$. The SVD is computed within every iteration via QR decomposition and its truncation rank k is evaluated. The batch is added to the global ${\varvec{V}}$ and $\Sigma$ before the algorithm steps into the next iteration with new snapshots. Readers are referred to Baker et al. (2012) for a detailed algorithm description. In summary, a complexity of O(mnk) with m full order unknowns can be reached.

After the construction of the reduced subspace a regression or interpolation approach is introduced. The meta-model represents the unknown in the reduced space and is therefore restricted to the physical solution spanned in the subspace. In addition, the number of unknowns n is drastically reduced, as $k<< n$. Within this work the k-nearest neighbor (kNN) approach is utilized as an interpolation technique, but any other machine learning approach such as polynomial regression function, Gaussian process regression or neural networks could be used (Swischuk et al. 2019). k-nearest neighbor is mainly known as a classification technique, but can also be applied as a regression model. The function $y = f(x)$ is interpolated by the k-nearest neighbors of x as shown in Fig. 2 for a one-dimensional example. Here, three neighbors of x and their distances are evaluated to estimate f(x).

4.2 Online phase

After the construction of the low-fidelity model is completed, it is evaluated in the online phase. Recalling the truncated singular value decomposition, the columns of the matrix ${\varvec{V}} \in {\mathbb {R}}^{N \times k}$ can be interpreted as the basis vectors ${\varvec{v}}$ of the subspace.

$$\begin{aligned} V = \begin{bmatrix} \varvec{v_1} &{} \varvec{v_2} &{} \cdots &{} \varvec{v_k}\\ \end{bmatrix} \end{aligned}$$

(14)

The full displacement vector is estimated by the linear combination of the basis vectors ${\varvec{v}}$, whereby every basis is weighted by a scalar value $\sigma _i$.

$$\begin{aligned} {\varvec{x}} \approx \sum _{i=1}^{k} \varvec{v_i} \sigma _i \end{aligned}$$

(15)

The kNN approach provides an estimation for each $\sigma _i$, the degrees of freedoms of the subspace.

5 Proposed optimization scheme

In the present work the performance of multi-fidelity optimization schemes in crashworthiness applications is investigated. To that end, a multi-fidelity optimization method that integrates a non-intrusive reduced order model into the hierarchical kriging surrogate is proposed. The schematic process of this approach is shown in Fig. 3 following the baseline multi-fidelity scheme proposed by Zhang et al. (2018a). Initially, DoE is performed as described in Sect. 2 on both, the high- and the low-fidelity level separately, usually generating significantly more low- than high-fidelity samples. OLH and OIVLH are both applied in the present work, to assess the impact on optimization performance. All high-fidelity samples are then calculated. For memory efficiency, a reduced order model is incrementally created during the high-fidelity evaluations as introduced in Sect. 4. Following all initial high-fidelity simulations and as the main adaptation to the originally proposed scheme, the reduced basis is evaluated and the k-nearest neighbor regression model is trained for predictions. The reduced order model is evaluated on the initially created low-fidelity DoE points. From the results, the initial low-fidelity kriging model and subsequently the hierarchical kriging model are fitted. For adaptive improvement, the infill criterion (i.e. VF-EI) is maximized separately on both fidelity levels. Depending on which level yields the better results the next adaptive sample can be either low-fidelity (L=0 in Fig. 3) or high-fidelity (L=1). The objective function is evaluated for the respective new adaptive sample and the kriging model(s) are updated. Then, another infill criterion optimization is started and the iterative improvement continues.

Three different criteria determine the termination of the algorithm. First, a minimum allowable value for the maximized infill criterion is specified (here $IC_{th} = 10^{-5}$). Second, a maximum number of high-fidelity evaluations is specified. Third, a maximum total number of objective evaluations can be specified, which in single-fidelity optimizations is equivalent to the second criterion. Values specified for the criteria are given in Sect. 6 with the respective examples. The first criterion can be interpreted as convergence of the algorithm to an optimal point with little improvement possibilities. The other two criteria are used to represent time restrictions on the optimization runs, i.e. an estimation of the maximum run time.

The proposed scheme is implemented in an in-house Python code. The DoE part of the algorithm is based on Kaps et al. (2021). The implementation of the incremental SVD technique in the present work is adjusted from Baker et al. (2012) and Bach et al. (2019). HK model generation and kernel implementation are based on the scikit-learn library in Python (Pedregosa et al. 2011).

6 Crashworthiness application problems

In the following, the presented optimization scheme is compared to a baseline multi-fidelity scheme proposed by Zhang et al. (2018a) as well as a single-fidelity scheme based on a high-fidelity model and EI. Additionally, each of the techniques is assessed with a standard OLH sampling as well as the OIVLH method that places samples closer to design space boundaries. These overall six approaches are each evaluated for two crashworthiness problems in terms of result quality and computational requirements. Each optimization run is repeated ten times to ensure reliability of the assessment. An overview of the compared methods and nomenclature is given in Table 1. All analyses are performed on the identical hardware using the explicit finite element software LS-Dyna in its MPP version distributed on eight cores. All objective function values referred to below are based on the high-fidelity model unless explicitly stated otherwise.

Table 1 Overview of different optimization schemes applied in the following crashworthiness examples. All techniques are referred to by the abbreviation given in the first column

Full size table

6.1 Side sill impact problem

The initial problem is a crash model representing the side sill of a car under side pole impact as depicted in Fig. 4. Both ends of the side sill are fixed, and the pole represented by a cylindrical rigid body with radius 35mm has a prescribed initial velocity of 36 km/h and a mass of 86 kg. Further modeling information as well as all material parameters are summarized in “Appendix A”. The simulation is terminated when the impactor stops or—as a backup—after 40 ms.

The design variables of the optimization problem are the thicknesses $t_i$ of the five horizontal reinforcement ribs in the interior of the side sill (compare detail in Fig. 4). The objective is to minimize the mass of the side sill, i.e. the mass of the horizontal ribs, while keeping the lateral intrusion below $u_{\text{allow}}=50$ mm. Applying the penalty method with a penalty factor $p = 3.75$, the deformation constraint is included into the objective function. The complete optimization problem can then be formulated as

$$\begin{aligned}&\!\min _{{\varvec{t}}}& &f({\varvec{t}}) = m_{\text{ribs}} + p \cdot \max ( c({\varvec{t}}), 0), \quad \text {with } p = 3.75, \end{aligned}$$

(16a)

$$\begin{aligned}&\text {subj. to}& &c({\varvec{t}}) = \frac{u_{\text{max}}}{u_{\text{allow}}} - 1 \le 0, \quad u_{\text{allow}} = 50\,\text {mm}, \end{aligned}$$

(16b)

$$\begin{aligned}&\text {where}& &0.5\,\text {mm} \le t_i \le 3.0\,\text {mm}, \quad i = 1, ..., 5. \end{aligned}$$

(16c)

The model depicted in Fig. 4 represents the high-fidelity model with an element size of approximately 5mm in each direction. For comparison, the low-fidelity model with the element size doubled to 10mm is depicted in “Appendix A”. Thus, the speed-up factor of the low-fidelity model is about five to six. The number of initial high-fidelity samples for the single-fidelity case is set to 50. For all multi-fidelity approaches, the number of initial samples is 20 and 120 for high- and low-fidelity models, respectively. In this example, the only termination criterion specified is a threshold value for the respective infill criterion, i.e. EI for SF methods and VF-EI for MF methods, of $IC_{th}=10^{-5}$.

The reduced order model is constructed with 20 training simulations sampled by the respective method. With the truncation energy of $\mu = 0.9999$ a subspace containing approximately $k=4$ basis vectors is computed. Moreover, the kNN regressor using 5 neighbors is trained for the unknowns in the reduced subspace. Eventually, a single evaluation of the reduced order model has a speed-up factor of 150 in comparison to the high-fidelity simulation.

In Fig. 5, optimization results for the six different approaches are compared with a parallel coordinates plot. On the x-axes all five design variables are listed while the y-axes represent their normalized ranges. Each curve illustrates an optimized design, whereby the color indicates the respective objective function value. The problem seems to have a clear global optimum with $t_1 = t_3 = t_5 = 3.0$mm and $t_2 = t_4 = 0.5$mm. Distributing three horizontal ribs with the maximum allowable thickness across the whole height of the component and having the two ribs in between vanish results in the best compromise between low weight and sufficiently low impactor intrusion. The respective objective value $f({\varvec{t}}) = 1.898$ is found in the majority of single-fidelity and MF (MOR) repetitions independent of the chosen DoE method. For this set of input variables, the maximum impactor displacement of the high-fidelity model is $u_{\text{max}}=50.6$mm. The specified constraint $u_{\text{allow}}=50$mm is violated by 1.2%, which is considered acceptable in this exemplary problem. All compared results have a similar level of constraint violation, which is not further investigated to focus on the multi-fidelity optimization itself.

Individual results for single-fidelity, i.e. high-fidelity, EGO show a low variation for different DoE methods and the majority of repetitions terminate at the global optimum. For MF (base) and MF + OIVLH, results vary quite significantly and only three of the overall 20 runs converge to the global optimum. Many of the evaluations terminate at points rather close to the global optimum so that the objective function value is only a few percents of the optimal value. Notably, more MF + OIVLH analyses get close to the global optimum than MF (base). The MF (MOR) approaches yield more consistent results than MF (base) and MF + OIVLH, converging to the optimum for a total of 14 out of 20 assessments across both DoE methods, with two more runs being very close to the optimum. Overall, the OIVLH-based approaches yield a higher consistency within the multi-fidelity schemes. OIVLH places samples significantly closer to design space boundaries. Therefore, the reduced order low-fidelity model being created from the high-fidelity samples is based more on interpolation between samples than on extrapolation.

In a further step, the quality of the two varying types of low-fidelity models is assessed. To that end, the low-fidelity models are evaluated at the respective determined optimum. The maximum displacement of the impactor $d_{\text{max}}$ is considered as a metric to compare high- (HF) and low-fidelity (LF) models. This metric is chosen as it directly represents the constraint and—because the mass of the component can analytically be calculated from the given design variables—it also implies the objective function of the optimization problem. As such it estimates the accuracy of the low-fidelity models in the present example. The error metrics are defined as

$$\begin{aligned} e_{\text{abs}} = \vert d_{\text{max},hf} - d_{\text{max},lf} \vert , \end{aligned}$$

(17)

$$\begin{aligned} e_{\text{rel}} = \frac{ \vert d_{\text{max},hf} - d_{\text{max},lf} \vert }{\vert d_{\text{max},hf} \vert }, \end{aligned}$$

(18)

for the absolute difference $e_{\text{abs}}$ and the relative difference $e_{\text{rel}}$ between fidelity levels, respectively. Results for all methods are listed in Table 2. Each of the listed values represents the mean value of ten evaluations. For MF (base) and MF + OIVLH the low-fidelity model is identical for all evaluations, while for the MOR based methods, the low-fidelity model depends on the initial high-fidelity samples (compare Fig. 3). Both types of low-fidelity models approximate the (maximum) impactor displacement sufficiently well with an error of $4{\text{-}}6\%$. The error indicators for the reduced order low-fidelity models are slightly better, i.e. lower, compared to those of the coarse simulation models. For the latter, the use of OIVLH sampling does not significantly impact the low-fidelity result quality. However, for MF (MOR) + OIVLH the low-fidelity model is slightly more accurate than for MF (MOR) which may explain the performance difference between the two methods. It also indicates that the increased interpolation share for OIVLH sampling can in fact increase the ROM quality.

Table 2 Side sill impact problem: Evaluation of low-fidelity model quality. For all optimization runs, low-fidelity models are evaluated for the respective optimum and maximum impactor displacements are compared to those of the high-fidelity model [see Eqs. (17) and (18)]

Full size table

Keeping in mind the quality of the results, the computational requirements for all approaches are evaluated. Average run times along with their respective standard deviations are listed in Fig. 6 for all techniques. The larger standard deviations for SF (base) and SF + OIVLH compared to the other four methods are explained by a single outlier in each of the methods. In both cases, the algorithm finds the global optimum in a reasonable number of iterations but then requires many adaptive samples to reach the specified termination criterion. So both outliers can be explained by the somewhat unrealistic definition of termination criterion here, where no maximum allowed number of iterations was specified. As the standard deviations in all methods are similar apart from that, the following comparisons are focused on the average times. SF (base) is taken as a baseline for all comparisons. Without changing the optimization itself, switching the DoE to OIVLH reduces $14\%$ of computation time in this problem, while maintaining result quality.

Using an MF approach with a coarser low-fidelity model lowers the computational cost about $47\%$ and $35\%$ for OLH and OIVLH, respectively. In fact, this is the only case in the present work, where OIVLH requires more computational time than OLH. The reason here is the same as for the outliers outlined above. Both MF (base) and MF + OIVLH find the respective optimum after a similar number of high-fidelity iterations, but the latter variant requires a few more adaptive iterations than the former to reach the termination criterion. The work load to create the low-fidelity model is not considered here, however for complex models this is an additional time intensive step. Especially, when considering that the low-fidelity model may not be needed otherwise. Contrary to that, both MF (MOR) options do not require the manual creation of a low-fidelity model. They reduce computation times by about $51\%$ and $56\%$ for MF (MOR) and MF (MOR) + OIVLH, respectively. The speed-up of MF (MOR) compared to the other MF techniques can be explained by the significantly faster evaluation times of the reduced order low-fidelity models compared to the coarse simulation model. In comparison the MF (MOR) + OIVLH approach yields the best overall improvement for the side sill impact problem. It saves on average more than $50\%$ computation time compared to SF (base) while maintaining an acceptable level of accuracy.

6.2 Frontal impact problem

A shape optimization problem of the frontal impact of a crash box, as depicted in Fig. 7 is presented as a second application problem for the suggested multi-fidelity scheme. For the high-fidelity model, an element length of about $4-5$mm is specified, while for the low-fidelity model, the element length is in the region of 10mm. This yields a high-fidelity model with 4,928 elements and a low-fidelity model with 1,296 elements. The planar impactor crushing the component from the top has a prescribed mass of 300kg and an initial velocity of 30 km/h. The impactor is modeled as a rigid body, and the crash box is constructed as a tube with a steel-like material and a piecewise linear plasticity model. Exact material parameters are listed in “Appendix B”. For the contact formulation, the LS-Dyna ’*CONTACT_ AUTOMATIC_SINGLE_SURFACE’ is applied. The simulation is terminated when the impactor stops or after 45ms.

The crash box of the optimization study and its design variables are depicted in Fig. 7. The first three design variables are the vertical positions of the triggers in the model. The corner elements of the triggers are deleted to increase numerical stability. Three additional design variables are the depths of the triggers, i.e. the respective element rows are shifted in or against the arrow directions indicated in the figure next to the variables $x_4$, $x_5$ and $x_6$. The configuration depicted in Fig. 7 represents the setup for ${\varvec{x}} = 0$ for all design variables. Model generation features are realized using ANSA preprocessor.

Throughout the optimization analyses, the mass of the crash box remains approximately constant as the effect of design variables is negligibly small. The objective function is chosen as the load uniformity, also called peak amplification, of the force-displacement curve. It is defined as the peak force $F_{\text{max}}$ divided by the mean force $F_{\text{mean}}$ of the complete force-displacement curve measured at the rigid body impactor. The optimization problem can then be formulated as

$$\begin{aligned}&\!\min _{{\varvec{x}}}&f({\varvec{x}}) = \frac{F_{\text{max}}}{F_{\text{mean}}}, \end{aligned}$$

(19a)

$$\begin{aligned}&\text {where}&-10\,\text {mm} \le x_i \le 10\,\text {mm}, \quad i = 1, 2, 3, \end{aligned}$$

(19b)

$$\begin{aligned}&\quad&-4\,\text {mm} \le x_i \le 4\,\text {mm}, \quad i = 4, 5, 6. \end{aligned}$$

(19c)

The processing time of the classical low-fidelity model is one fourth of the high-fidelity computation time. The number of initial samples is accordingly set to 60 in single-fidelity analysis, and 30/90 for high-/low-fidelity samples in the multi-fidelity methods. For this problem, all possible termination criteria presented in Sect. 5 are specified and respective values are listed in Appendix Table 7. To construct the snapshot matrix for the reduced order model, 30 initial high-fidelity samples are utilized. With the truncation energy of $\mu = 0.9999$, a subspace with approximately $k=15$ bases is computed. Moreover, the kNN regressor using 5 neighbors is trained for the unknowns in the reduced subspace. A single evaluation of the reduced order model has a speed-up factor of 50 to 100 in comparison to the high-fidelity simulation.

To investigate the convergence of the optimization schemes, Fig. 8 shows the best current objective values in each iteration for all (high-fidelity) evaluations of SF (base) (grey lines) and MF (MOR) + OIVLH (black lines). These two methods are representatively chosen from all investigated methods for the purpose of clarity. The adaptive phase of the algorithm starts after 60 evaluations for the single-fidelity case and 30 evaluations for the multi-fidelity case. Both approaches reach objective values below 3.4, i.e. close to the final optimum, mostly within ten adaptive high-fidelity evaluations. Afterwards only slight improvements are achieved.

In addition, the termination criteria can give valuable insights to the performed optimization studies. Here, the observations are contrary to those of the previous lateral impact example, where all optimization analyses are terminated by the threshold infill criterion. A majority of analyses for the crash box example terminate after reaching the maximum number of allowed iterations. Of course, there is an argument to be made that these optimization runs may not be fully converged. However, as seen in Fig. 8 and because some runs do actually terminate due to $IC_{th}$, the authors believe that this is not the case. A maximum allowable number of iterations also represents a situation more akin to a practical application case. It matters more to find an acceptable result in a given time as opposed to the best result in as much time as necessary.

A comparison of the optimization results for the presented methods is depicted in Fig. 9. Low objective function values of slightly below 3.3 depend on $x_1$ on its upper limit of 10mm, $x_2$ around its lower limit of $-10$mm and $x_3$ in a mid region albeit slightly above 0mm. Only the first of the trigger depths $x_4$, $x_5$ and $x_6$, which is found close the maximum of 4mm throughout all methods, seems to significantly impact the optimization results. A significant share of the crash energy is absorbed in the first fold, i.e. the one controlled by $x_1$ and $x_4$. It usually also contains the total maximum in the force-displacement curve $F_{\text{max}}$. To that end, it appears reasonable that those two design variables appear to always converge to similar values, while the others, especially $x_5$ and $x_6$, may vary between optimizations.

All compared methods yield rather similar results in terms of objective function values except for a total of three runs in MF (base) and MF (MOR). In these, the best objective value is up to $10\%$ off the best overall result because the algorithm terminates in a local optimum. For the SF approaches, OIVLH appears to not affect result quality or consistency. In the multi-fidelity setups however, OIVLH sampling reduces the number of outliers compared to MF (base) and MF (MOR), respectively. All variants utilizing OIVLH sampling as well as the baseline single-fidelity EGO yield very similar results.

In a further step the quality of low-fidelity models is evaluated as done for the previous example. Here the same error metric is chosen mainly for two reasons. First, as the impactor kinetic energy remains constant, the maximum impactor displacement is directly related to the mean force $F_{\text{mean}}$ which is part of the objective function. Second, having the same comparison metric as in the previous example allows for comparisons of low-fidelity model accuracy between the two examples. Table 3 lists mean values for the differences in maximum impactor displacement between low- and high-fidelity models in the different methods [compare Eqs. (17) and (18)]. In this example, the coarse simulation model of MF (base) and MF + OIVLH has a significantly larger error value compared to the ROMs. Due to the highly nonlinear nature of frontal impact simulation, the mesh size has a high impact on the crushing behaviour of the component. As the ROM-based low-fidelity models utilize the high-fidelity model mesh for learning and predictions, they are not affected and show better accuracy compared to the coarse low-fidelity model. The present example confirms that the application of OIVLH for reduced order models increases the prediction accuracy. The results of MF (base) and MF + OIVLH, as listed in Table 3 differ mainly due to one single outlier in MF + OIVLH, where the low-fidelity model predicts a significantly earlier stop of the impactor, thus inflating the error measure. Overall, the low-fidelity accuracy of the two methods is similar, which is expected as the choice of DoE should not have an impact on the simulation model.

Table 3 Frontal impact problem: Evaluation of low-fidelity model quality. For all optimization runs, low-fidelity models are evaluated for the respective optimum and maximum impactor displacements are compared to the high-fidelity model [see Eqs. (17) and (18)]

Full size table

Computational requirements are again compared for all different methods using the mean run time of each optimization. The averaged processing times for all techniques are plotted in Fig. 10. Regarding the varying EGO approaches, the findings here are highly similar to those of the previous problem. Compared to SF (base), MF (base) and MF + OIVLH yield about $30{\text{-}}45\%$ speed-up while MF (MOR) and MF (MOR) + OIVLH result in $50{\text{-}}55\%$ time reduction. Here, the use of OIVLH over OLH shows slight time benefits across all variants. Notably, the time benefit SF + OIVLH yields over SF (base) is significantly smaller than for the previous example (compare Fig. 6). This is due to the difference in applied termination criterion between the examples.

Overall, results for the present application example closely match those for the side sill impact example presented above. The MF (MOR) + OIVLH scheme performs best with time-savings of more than $50\%$ compared to SF (base). Here, all approaches utilizing OIVLH perform better than respective OLH variants.

7 Conclusions

In the present work, a multi-fidelity efficient global optimization based on recently proposed hierarchical kriging and variable-fidelity expected improvement is applied to crashworthiness examples and its performance is investigated. Additionally, two adaptations to the scheme regarding initial design of experiments and the choice of low-fidelity model are proposed. For the former, a recently developed variant of Latin hypercube sampling is chosen that places samples closer to design space boundaries and thus allows for more interpolation instead of extrapolation in surrogate models. For the latter, a non-intrusive model order reduction scheme is applied as low-fidelity model as it integrates nicely into the existing multi-fidelity optimization scheme. All different optimization schemes are investigated on two crashworthiness application examples: one side sill impact size optimization and one frontal impact shape optimization. Already with the rather small problems presented here, results show that multi-fidelity optimization is capable of reducing computational costs of the optimizations significantly while not compromising result quality. Both proposed adaptations independently and especially combined further reduce computation times and also increase result quality compared to the baseline multi-fidelity optimization. Especially the use of non-intrusive reduced order modeling techniques is promising as it removes the need to (manually) create an additional low-fidelity model. Together, a speed-up factor of two in the optimization with next to no influence on result quality is observed.

The problems shown in the present work represent rather small examples with a low number of design variables. Based on the works introducing OIVLH (Kaps et al. 2021) and previous investigations on the projection of large-scale systems (Bach et al. 2019), it seems reasonable to assume that the advantages of the proposed schemes grow as the model size and number of design variables increase.

Multi-fidelity optimization is a wide topic with a variety of different applications and many imaginable adjustments to be explored. Based on the promising results of the present work we have collected some topics and questions that we believe to be interesting for future work:

In the present work all DoEs are performed separately for different fidelity levels and with no connection between levels. It seems only reasonable to use a multi-fidelity DoE scheme if the initial samples are combined into a multi-fidelity surrogate. Multi-fidelity sampling approaches proposed so far, require the high-fidelity DoE to be a subset of the low-fidelity DoE. It could be investigated, how these approaches perform for multi-fidelity optimizations shown here and if methodological improvements can be achieved.
We believe the potential of the proposed multi-fidelity scheme(s) should be confirmed in further studies on more complex larger crashworthiness application problems and other fields of applications.
A big challenge in practical applications is robustness with regards to both the method as well as the objective function. An optimization method should produce consistent results for given inputs, as in practice, repeating runs is often infeasible. Moreover, an optimum highly sensitive to small perturbations of the inputs is also not desirable. To that end, an effort has to be made to integrate robustness into the optimization framework seamlessly.

Notes

Implemented in IncPACK library.

References

Acar E, Yilmaz B, Güler MA, Altin M (2020) Multi-fidelity crashworthiness optimization of a bus bumper system under frontal impact. J Braz Soc Mech Sci Eng 42(9):1–17. https://doi.org/10.1007/s40430-020-02572-3
Article Google Scholar
Alaimo G, Auricchio F, Bianchini I, Lanzarone E (2018) Applying functional principal components to structural topology optimization. Int J Numer Methods Eng 115(2):189–208. https://doi.org/10.1002/nme.5801
Article MathSciNet Google Scholar
Anselma PG, Niutta CB, Mainini L, Belingardi G (2020) Multidisciplinary design optimization for hybrid electric vehicles: component sizing and multi-fidelity frontal crashworthiness. Struct Multidisc Optim 62(4):2149–2166. https://doi.org/10.1007/s00158-020-02603-6
Article Google Scholar
Assou S, Tourbier Y, Gstalter E, Charrier M, Dessombz O, Jézéquel L (2019) A reduced model using random forest: application on car crash optimization. SeMA J. https://doi.org/10.1007/s40324-019-00208-8
Article MATH Google Scholar
Audze P, Eglais V (1977) New approach to the design of experiments. Probl Dyn Strength 35:104–107
Google Scholar
Bach C, Ceglia D, Song L, Duddeck F (2019) Randomized low-rank approximation methods for projection-based model order reduction of large nonlinear dynamical problems. Int J Numer Methods Eng 118(4):209–241. https://doi.org/10.1002/nme.6009
Article MathSciNet Google Scholar
Baker CG, Gallivan KA, Dooren PV (2012) Low-rank incremental methods for computing dominant singular subspaces. Linear Algebra Appl 436(8):2866–2888. https://doi.org/10.1016/j.laa.2011.07.018
Article MathSciNet MATH Google Scholar
Bonfiglio L, Perdikaris P, Brizzolara S, Karniadakis G (2018) Multi-fidelity optimization of super-cavitating hydrofoils. Comput Methods Appl Mech Eng 332:63–85. https://doi.org/10.1016/j.cma.2017.12.009
Article MathSciNet MATH Google Scholar
Chang KJ, Haftka RT, Giles GL, Kao PJ (1993) Sensitivity-based scaling for approximating structural response. J Aircr 30(2):283–288. https://doi.org/10.2514/3.48278
Article Google Scholar
Choi Y, Oxberry G, White D, Kirchdoerfer T (2019) Accelerating design optimization using reduced order models. pp 1–20. https://doi.org/10.13140/rg.2.2.16056.08965
Duddeck F (2008) Multidisciplinary optimization of car bodies. Struct Multidisc Optimi 35(4):375–389. https://doi.org/10.1007/s00158-007-0130-6
Article Google Scholar
Forrester AI, Keane AJ (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45(1–3):50–79. https://doi.org/10.1016/j.paerosci.2008.11.001
Article Google Scholar
Forrester AI, Sóbester A, Keane AJ (2007) Multi-fidelity optimization via surrogate modelling. Proc R Soc: Mathe Phys Eng Sci 463(2088):3251–3269. https://doi.org/10.1098/rspa.2007.1900
Article MathSciNet MATH Google Scholar
Gano SE, Renaud JE, Martin JD, Simpson TW (2006) Update strategies for kriging models used in variable fidelity optimization. Struct Multidisc Optim 32(4):287–298. https://doi.org/10.1007/s00158-006-0025-y
Article Google Scholar
Garud SS, Karimi IA, Kraft M (2017) Design of computer experiments: a review. Comput Chem Eng 106:71–95. https://doi.org/10.1016/j.compchemeng.2017.05.010
Article Google Scholar
Golub GH, Van Loan CFCF (2013) Matrix computations, 4th edn. The Johns Hopkins University Press, Baltimore
MATH Google Scholar
Gratiet LL, Garnier J (2014) Recursive co-kriging model for design of computer experiments with multiple levels of fidelity. Int J Uncertain Quantif 4(5):365–386. https://doi.org/10.1615/int.j.uncertaintyquantification.2014006914
Article MathSciNet Google Scholar
Gstalter E, Assou S, Tourbier Y, De Vuyst F (2020) Toward new methods for optimization study in automotive industry including recent reduction techniques. Adv Model Simul Eng Sci 7(1):1–16. https://doi.org/10.1186/s40323-020-00151-8
Article Google Scholar
Guo M, Hesthaven JS (2017) Reduced order modeling for nonlinear structural analysis using Gaussian process regression. Comput Methods Appl Mech Eng 341:807–826. https://doi.org/10.1016/j.cma.2018.07.017
Article MathSciNet MATH Google Scholar
Guo M, Hesthaven JS (2019) Data-driven reduced order modeling for time-dependent problems. Comput Methods Appl Mech Eng 345:75–99. https://doi.org/10.1016/j.cma.2018.10.029
Article MathSciNet MATH Google Scholar
Han ZH, Görtz S (2012) Hierarchical kriging model for variable-fidelity surrogate modeling. AIAA J 50(9):1885–1896. https://doi.org/10.2514/1.J051354
Article Google Scholar
Han ZH, Zimmermann R, Görtz S (2012) Alternative cokriging method for variable-fidelity surrogate modeling. AIAA J 50(5):1205–1210. https://doi.org/10.2514/1.j051243
Article Google Scholar
Han ZH, Görtz S, Zimmermann R (2013) Improving variable-fidelity surrogate modeling via gradient-enhanced kriging and a generalized hybrid bridge function. Aerosp Sci Technol 25(1):177–189. https://doi.org/10.1016/j.ast.2012.01.006
Article Google Scholar
Huang D, Allen TT, Notz WI, Miller RA (2006) Sequential kriging optimization using multiple-fidelity evaluations. Struct Multidisc Optim 32(5):369–382. https://doi.org/10.1007/s00158-005-0587-0
Article Google Scholar
Hunkeler S, Duddeck F, Rayamajhi M, Zimmer H (2013) Shape optimisation for crashworthiness followed by a robustness analysis with respect to shape variables. Struct Multidisc Optim 48(2):367–378. https://doi.org/10.1007/s00158-013-0903-z
Article Google Scholar
Jin R, Chen W, Sudjianto A (2003) An efficient algorithm for constructing optimal design of computer experiments. international design engineering technical conferences and computers and information in engineering conference, Chicago, IL, USA, vol 37009. pp 545–554
Jones DR (2001) A taxonomy of global optimization methods based on response surfaces. J Glob Optim 21(4):345–383. https://doi.org/10.1023/A:1012771025575
Article MathSciNet MATH Google Scholar
Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. J Glob Optim 13(4):455–492. https://doi.org/10.1023/A:1008306431147
Article MathSciNet MATH Google Scholar
Journel AG, Huijbregts C (1978) Mining geostatistics. Academic Press, New York. https://doi.org/10.1180/minmag.1979.043.328.34
Book Google Scholar
Kaps A, Komeilizadeh K, Duddeck F (2021) An ISO-volumetric weighting approach to increase efficiency of stratified samplings. In: 14th world congress of structural and multidisciplinary optimization, Boulder, CO, USA.
Kast M, Guo M, Hesthaven JS (2020) A non-intrusive multifidelity method for the reduced order modeling of nonlinear problems. Comput Methods Appl Mech Eng 364:112947. https://doi.org/10.1016/j.cma.2020.112947
Article MathSciNet MATH Google Scholar
Kennedy MC, O’Hagan A (2000) Predicting the output from a complex computer code when fast approximations are available. Biometrika 87(1):1–13. https://doi.org/10.1093/biomet/87.1.1
Article MathSciNet MATH Google Scholar
Kneifl J, Grunert D, Fehr J (2021) A nonintrusive nonlinear model reduction method for structural dynamical problems based on machine learning. Int J Numer Methods Eng 122(17):1–13. https://doi.org/10.1002/nme.6712
Article Google Scholar
Krige DG (1951) A statistical approach to some basic mine valuation problems on the Witwatersrand. J South Afr Inst Min Metall 52(6):119–139
Google Scholar
Lange VA, Fender J, Song L, Duddeck F (2018) Early phase modeling of frontal impacts for crashworthiness: from lumped mass-spring models to deformation space models. Proc Inst Mech Eng D 233(12):3000–3015. https://doi.org/10.1177/0954407018814034
Article Google Scholar
Le Guennec Y, Brunet JP, Daim FZ, Chau M, Tourbier Y (2018) A parametric and non-intrusive reduced order model of car crash simulation. Comput Methods Appl Mech Eng 338:186–207. https://doi.org/10.1016/j.cma.2018.03.005
Article MathSciNet MATH Google Scholar
Matheron G (1963) Principles of geostatistics. Econ Geol 58(8):1246–1266. https://doi.org/10.2113/gsecongeo.58.8.1246
Article Google Scholar
Morris MD, Mitchell TJ (1995) Exploratory designs for computational experiments. J Stat Plan Inference 43(3):381–402. https://doi.org/10.1016/0378-3758(94)00035-T
Article MATH Google Scholar
Oxberry GM, Kostova-Vassilevska T, Arrighi W, Chand K (2017) Limited-memory adaptive snapshot selection for proper orthogonal decomposition. Int J Numer Methods Eng 109(2):198–217. https://doi.org/10.1002/nme.5283
Article MathSciNet Google Scholar
Park C, Haftka RT, Kim NH (2016) Remarks on multi-fidelity surrogates. Struct Multidisc Optim 55(3):1029–1050. https://doi.org/10.1007/s00158-016-1550-y
Article MathSciNet Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830
MathSciNet MATH Google Scholar
Perdikaris P, Karniadakis GE (2016) Model inversion via multi-fidelity Bayesian optimization: a new paradigm for parameter estimation in haemodynamics, and beyond. J R Soc Interface 13(118):20151107. https://doi.org/10.1098/rsif.2015.1107
Article Google Scholar
Phalippou P, Bouabdallah S, Breitkopf P, Villon P, Zarroug M (2020) ‘On-the-fly’ snapshots selection for proper orthogonal decomposition with application to nonlinear dynamics. Comput Methods Appl Mech Eng 367:113120. https://doi.org/10.1016/j.cma.2020.113120
Article MathSciNet MATH Google Scholar
Rajabi MM, Ataie-Ashtiani B, Janssen H (2015) Efficiency enhancement of optimized Latin hypercube sampling strategies: application to Monte Carlo uncertainty analysis and meta-modeling. Adv Water Resour 76:127–139. https://doi.org/10.1016/j.advwatres.2014.12.008
Article Google Scholar
Rasmussen CE, Williams CKI (2005) Gaussian processes for machine learning. MIT Press Ltd, Cambridge
Book Google Scholar
Redhe M, Giger M, Nilsson L (2004) An investigation of structural optimization in crashworthiness design using a stochastic approach. Struct Multidisc Optim 27(6):446–459. https://doi.org/10.1007/s00158-004-0400-5
Article Google Scholar
Ren C, Min H, Ma T, Wang F (2020) Efficient structure crash topology optimization strategy using a model order reduction method combined with equivalent static loads. Proc Inst Mech Eng D 234(7):1897–1911. https://doi.org/10.1177/0954407019893841
Article Google Scholar
Sacks J, Welch WJ, Mitchell TJ, Wynn HP (1989) Design and analysis of computer experiments. Stat Sci 4(4):409–423. https://doi.org/10.1214/ss/1177012413
Article MathSciNet MATH Google Scholar
Sirovich L (1987) Turbulence and the dynamics of coherent structures. I. Coherent structures. Q Appl Math 45(10):561–571. https://doi.org/10.1090/qam/910462
Article MathSciNet MATH Google Scholar
Storn R, Price K (1997) Differential evolution—a simple and efficient heuristic for global optimization over continuous spaces. J Glob Optim 11(4):341–359. https://doi.org/10.1023/a:1008202821328
Article MathSciNet MATH Google Scholar
Sun G, Li G, Stone M, Li Q (2010) A two-stage multi-fidelity optimization procedure for honeycomb-type cellular materials. Comput Mater Sci 49(3):500–511. https://doi.org/10.1016/j.commatsci.2010.05.041
Article Google Scholar
Sun G, Li G, Zhou S, Xu W, Yang X, Li Q (2010) Multi-fidelity optimization for sheet metal forming process. Struct Multidisc Optim 44(1):111–124. https://doi.org/10.1007/s00158-010-0596-5
Article Google Scholar
Swischuk R, Mainini L, Peherstorfer B, Willcox K (2019) Projection-based model reduction: formulations for physics-based machine learning. Comput Fluids 179:704–717. https://doi.org/10.1016/j.compfluid.2018.07.021
Article MathSciNet MATH Google Scholar
Toal DJJ, Bressloff NW, Keane AJ (2008) Kriging hyperparameter tuning strategies. AIAA J 46(5):1240–1252. https://doi.org/10.2514/1.34822
Article Google Scholar
Xiao M, Lu D, Breitkopf P, Raghavan B, Dutta S, Zhang W (2020) On-the-fly model reduction for large-scale structural topology optimization using principal components analysis. Struct Multidisc Optim 62:209–230. https://doi.org/10.1007/s00158-019-02485-3
Article MathSciNet Google Scholar
Ye KQ, Li W, Sudjianto A (2000) Algorithmic construction of optimal symmetric Latin hypercube designs. J Stat Plan Inference 90(1):145–159. https://doi.org/10.1016/S0378-3758(00)00105-1
Article MathSciNet MATH Google Scholar
Yu J, Yan C, Guo M (2019) Non-intrusive reduced-order modeling for fluid problems: a brief review. Proc Inst Mech Eng G 233(16):5896–5912. https://doi.org/10.1177/0954410019890721
Article Google Scholar
Zhang Y, Han ZH, Zhang KS (2018a) Variable-fidelity expected improvement method for efficient global optimization of expensive functions. Struct Multidisc Optim 58(4):1431–1451. https://doi.org/10.1007/s00158-018-1971-x
Article MathSciNet Google Scholar
Zhang Y, Kim NH, Park C, Haftka RT (2018b) Multifidelity surrogate based on single linear regression. AIAA J 56(12):4944–4952. https://doi.org/10.2514/1.J057299
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Koushyar Komeilizadeh for valuable insights and discussions.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

TUM School of Engineering and Design, Technical University of Munich, Arcisstr. 21, 80333, Munich, Germany
Arne Kaps, Catharina Czech & Fabian Duddeck

Authors

Arne Kaps
View author publications
You can also search for this author in PubMed Google Scholar
Catharina Czech
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Duddeck
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

AK: conceptualization, AK, CC: methodology, AK: software, AK: formal analysis and investigation, AK, CC: writing—original draft preparation, AK, CC, FD: writing—review and editing, FD: supervision.

Corresponding author

Correspondence to Arne Kaps.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Replication of results

The detailed information of the proposed methods and the corresponding software employed in this paper can be found in respective sections. This includes algorithms and the used hyper-parameters. A detailed discussion about the implementation can also be found at the end of Sect. 5. The self-implemented parts of the in-house Python code can be obtained from the corresponding author on reasonable request.

Additional information

Responsible Editor: Shikui Chen

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A

See Tables 4 and 5 (Figs. 11, 12).

Table 4 Side sill impact problem: general modeling properties

Full size table

Table 5 Side sill impact problem: material properties used for modeling the aluminum component

Full size table

Appendix B

See Tables 6 and 7.

Table 6 Frontal impact problem: material properties used for modeling the steel component

Full size table

Table 7 Table caption

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Kaps, A., Czech, C. & Duddeck, F. A hierarchical kriging approach for multi-fidelity optimization of automotive crashworthiness problems. Struct Multidisc Optim 65, 114 (2022). https://doi.org/10.1007/s00158-022-03211-2

Download citation

Received: 18 November 2021
Revised: 21 January 2022
Accepted: 21 February 2022
Published: 18 March 2022
DOI: https://doi.org/10.1007/s00158-022-03211-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A hierarchical kriging approach for multi-fidelity optimization of automotive crashworthiness problems

Abstract

Similar content being viewed by others

Multi-Objective Optimization for Structure Crashworthiness Based on Kriging Surrogate Model and Simulated Annealing Algorithm

Hybrid Strategy Coupling EGO and CMA-ES for Structural Topology Optimization in Statics and Crashworthiness

Data-driven models for crashworthiness optimisation: intrusive and non-intrusive model order reduction techniques

1 Introduction

2 Isovolumetric design of experiments

3 Multi-fidelity efficient global optimization