1 Introduction

The quest for ever more innovative engineering products has motivated the development of highly-accurate performance analysis tools, often prime-principle based and therefore able to span a vast variety of design solutions and operating conditions. The need for fast, effective, and ingenious design decisions has stimulated the integration of these tools with efficient global optimization approaches into simulation-driven design (SDD) [19] architectures. Naval architecture and ocean engineering are no exceptions, experiencing an outstanding development of computational tools to assess complex physical phenomena at different spatial and temporal scales [5, 32, 44], along with their integration in global SDD optimization approaches [11, 42].

These tools are generally computationally expensive, making the design- and operational-space exploration a technological and algorithmic challenge, thus preventing their widespread use in the industrial community. Recent work in the context of computational fluid dynamics (CFD) optimization showed how a hull-form optimization based on accurate unsteady Reynolds Averaged Navier-Stokes (URANS) solutions under realistic stochastic conditions may require up to 500 k CPU hours on high performance computing (HPC) systems, even if computational cost reduction methods are used [42]. Similarly, an accurate URANS-based statistically significant evaluation of ship maneuvering performance in irregular waves may require up to 1 M CPU hours on HPC systems [37]. Furthermore, multidisciplinary design optimization (MDO) problems, involving several interconnected disciplines, usually require additional computational efforts especially for global optimization problems [21] and stochastic conditions [12]. Achieving global and stochastic MDO still remains a challenge from the computational viewpoint [26].

To reduce the computational cost of SDD procedures, supervised machine learning methods in the form of surrogate models have been developed and successfully applied in several engineering domains [45]. With these methods, an approximate and easy to evaluate model of expensive computations is constructed based on a limited number of simulations. The design-space exploration and optimization are then performed on the surrogate model, increasing the efficiency of the overall SDD process.

The performance of surrogate models is problem-dependent and determined by several concurrent factors, such as the presence of nonlinearities, the problem dimensionality, and the approach used for the training [23]. Therefore, in the last decade, the research has moved from static to function-adaptive approaches, also known as dynamic (adaptive) surrogate models [46], which are able to improve their fitting capability by active learning. Namely, the design of experiments used for the surrogate model training is not defined a priori but dynamically updated, exploiting the information about the problem that becomes available during the analysis process. Training points are dynamically added based on active learning criteria, with the aim of producing globally accurate representations of the desired functions with as few training points as possible [41]. Unfortunately, the active learning process is generally affected by the computational-output noise [23], which is often unavoidable when large systems of nonlinear partial differential equations are numerically solved. Active learning methods may react to noise by adding many training points in noisy regions, rather than selecting new points in unseen regions [51]. This may deteriorate the model quality/efficiency and needs to be carefully considered.

In addition to dynamic/adaptive surrogate models and with the aim of reducing further the computational cost associated with SDD, multi-fidelity (MF) approximation methods have been developed, aiming at combining the accuracy of high-fidelity solvers with the reduced computational cost of low-fidelity solvers [4]. Thus, MF surrogate models use mainly low-fidelity simulations and only a few high-fidelity simulations, used to preserve the model accuracy. Additive and/or multiplicative correction methods, also known as “bridge functions” [18], can be used to build MF surrogate models. Several surrogate models have been used in the literature with MF data, such as non-intrusive polynomial chaos [29], co-kriging [3] and stochastic radial basis functions (SRBF) [41]. In general, different fidelity levels may be obtained by varying the physical model, the grid size, and/or combining experimental data with simulations. The active learning process is extended to MF analysis via the automatic selection of both the desired training point and fidelity level [41]. Most MF methods use two fidelity levels, nevertheless the approach can be generalized to an arbitrary number of fidelity levels, as shown in [40]. The use of MF models with noisy training data and the assessment of the effect of the noise associated with each fidelity level are still little discussed and require a rigorous assessment.

The objective of the present work is to present a generalized adaptive MF surrogate model for global design optimization of complex industrial problems affected by noisy performance evaluations. The proposed MF method advances the authors’ previous work by combining an arbitrary number of fidelity levels [40] with noise reduction and in-the-loop optimization of the MF surrogate model [51]. SRBF [46] are used in combination with an active learning method based on the lower confidence bounding (LCB), combining objective function prediction and associated uncertainty [7, 41]. Furthermore, the present formulation fully exploits the potential of simulation methods that naturally produce results spanning a range of fidelity levels: i.e., URANS simulations with adaptive grid refinement [47] or multi-grid resolution [5].

The performance of the proposed MF method is assessed using four analytical test problems [25] and three SDD problems pertaining to the minimization of: (1) the drag-coefficient of a NACA hydrofoil, (2) the calm-water resistance of the DTMB 5415 ship model, and (3) the calm-water resistance/payload ratio of a roll-on/roll-off passengers (RoPax) ferry, under the assumption of a limited budget for function evaluations. CFD computations are based on two URANS solvers: ISIS-CFD [32], developed at Ecole Centrale de Nantes/CNRS and integrated in the FINE/Marine simulation suite from Cadence Design Systems, for the NACA hydrofoil and the DTMB 5415; and \(\chi\)navis [5, 8, 9], developed at CNR-INM, for the RoPax ferry. In ISIS-CFD, mesh deformation and adaptive grid refinement are adopted to allow the automatic shape deformation. The fidelity levels are defined by the grid refinement ratio. In \(\chi\)navis, the mesh deformation is initially computed on the hull-surface grid and then propagated in the volume mesh. The different fidelities are the multi-grid levels. The problems are solved with a number of fidelity levels between one and four.

The remainder of this paper is organized as follows. Section 2 introduces the generalized multi-fidelity active learning method. Section 3 describes the analytical tests and the SDD problems. CFD solvers and numerical setup for the solutions of the SDD problems are provided in Sects. 4 and 5, respectively. The numerical results are discussed in Sect. 6 and, finally, conclusions and an outlook on future work are presented in Sect. 7.

2 Multi-fidelity surrogate modeling

Consider an objective function \(f({{\mathbf {x}}})\), where \({{\mathbf {x}}}\in \mathbb {R}^D\) is the design vector of dimension D. Let the true function \(f({{\mathbf {x}}})\) be assessed by numerical simulations \(s_l({{\mathbf {x}}})\) with different fidelity levels l, which are considered to be affected by random noise:

$$\begin{aligned} s_l({{\mathbf {x}}}) \equiv f_l({{\mathbf {x}}}) + {{\mathcal {N}}}_l({{\mathbf {x}}}) ~~~ \mathrm {with} ~~~ l=1,\dots ,N, \end{aligned}$$
(1)

where \(s_1({{\mathbf {x}}})\) denotes the highest-fidelity level, \(s_N({{\mathbf {x}}})\) is the lowest-fidelity, and \(\{s_l({{\mathbf {x}}})\}_{l=2}^{N-1}\) are the intermediate-fidelity levels. \(f_l({{\mathbf {x}}})\) is the hypothetical simulation response without noise. The simulation noise for each fidelity level \({{\mathcal {N}}}_l\) is considered as realizations of zero-mean uncorrelated random variables. This noise will be (partially) removed in the surrogate models. A multi-fidelity approximation \(\hat{f}({{\mathbf {x}}})\) of \(f({{\mathbf {x}}})\) can then be built by hierarchical superposition of the lowest-fidelity surrogate model \(\tilde{f}_N({{\mathbf {x}}})\) and the inter-level errors \(\tilde{\varepsilon }_l({{\mathbf {x}}})\) as

$$\begin{aligned} f({{\mathbf {x}}})\approx \hat{f}({{\mathbf {x}}})=\tilde{f}_N({{\mathbf {x}}})+\sum _{l=1}^{N-1}\tilde{\varepsilon }_l({{\mathbf {x}}}), \end{aligned}$$
(2)

For the l-th fidelity level the available simulation data is defined as \(\mathcal {T}_l=\{({{\mathbf {x}}}_j^{{\mathsf {T}}},s_l({{\mathbf {x}}}_j))\}_{j=1}^{\mathcal {J}_l}\), with \(\mathcal {J}_l\) the training set size. The resulting inter-level error training set, used to realize the error surrogate \(\tilde{\varepsilon }_l({{\mathbf {x}}})\), is defined as \({{\mathcal {E}}}_l=\{({{\mathbf {x}}}_j^{{\mathsf {T}}}, \varepsilon _l({{\mathbf {x}}}_j))\}_{j=1}^{\mathcal {J}_l}\), with

$$\begin{aligned} \varepsilon _l({{\mathbf {x}}}_j)=s_l({{\mathbf {x}}}_j)-\hat{f}_{l+1}({{\mathbf {x}}}_j), \end{aligned}$$
(3)

where the generical l-th multi-fidelity surrogate is:

$$\begin{aligned} \hat{f}_l({{\mathbf {x}}})=\tilde{f}_N({{\mathbf {x}}})+\sum _{i=l}^{N-1}\tilde{\varepsilon }_l({{\mathbf {x}}}). \end{aligned}$$
(4)

The choice of \(\hat{f}_{l+1}\) instead of \(s_{l+1}\) is based on the idea that a significant amount of noise can be present in the sampling data and that the surrogate models effectively filter this noise. Thus, when the filtering of noise is successful, the \((l+1)\)-th surrogate model is a better representation of the \((l+1)\)-th response than the actual simulations. However, \(s_{l+1}\) is available: to ensure the most effective training process and considering the nature of the CFD solvers (using adaptive and/or multi grids), lower-fidelity simulations are added in all the points \({{\mathbf {x}}}_j\) where a high-fidelity point is simulated. If there is no noise, the surrogate model \(\hat{f}_{l+1}({{\mathbf {x}}}_j)\) interpolates the data in all the training points, so the error data in the training points \(\varepsilon _l({{\mathbf {x}}}_j)\) is exact and equal to \(s_l({{\mathbf {x}}}_j)-s_{l+1}({{\mathbf {x}}}_j)\).

Fig. 1
figure 1

Example of MF surrogate model with \(N=2\) (without noise)

Fig. 2
figure 2

Example of the active learning method using one fidelity without noise: (left) shows the initial surrogate model with the associated prediction uncertainty and training set; (right) shows the position of the new training point and the new surrogate model prediction and its uncertainty

Finally, given a surrogate modeling that provides both the prediction and the associated uncertainty, and assuming that the uncertainties associated with the lowest-fidelity level \(U_{\tilde{f}_N}\) and the inter-level errors \(U_{\hat{\varepsilon }_l}\) are uncorrelated, the multi-fidelity prediction uncertainty \(U_{\hat{f}}\) reads

$$\begin{aligned} U_{\hat{f}}({{\mathbf {x}}})= \sqrt{U^2_{\tilde{f}_N}({{\mathbf {x}}})+\sum _{l=1}^{N-1}U^2_{{\tilde{\varepsilon }}_l}({{\mathbf {x}}})}. \end{aligned}$$
(5)

An example with two fidelities, without noise, is shown in Fig. 1.

In the present work, the term uncertainty always refers to the surrogate model prediction uncertainty (see Eq. 5), while the noise is associated with the objective function evaluation and intrinsically related to the fidelity level: as often happens, higher fidelities tend to be less noisy.

2.1 Active learning

The multi-fidelity surrogate model is dynamically updated by adding new training points following a two-step procedure:

  1. 1.

    Identify the new training point \({{\mathbf {x}}}^\star\);

  2. 2.

    Defining \(\beta _l=c_{l}/c_1\), where \(c_l\) is the computational cost associated to the l-th level and \(c_1\) the computational cost of the highest-fidelity, \(\varvec{\phi }\equiv \{ U_{{\hat{\varepsilon }}_1}/\beta _1, \ldots ,\) \(U_{{\hat{\varepsilon }}_{N-1}}/\beta _{N-1}, U_{\tilde{f}_N}/\beta _N \}\) as the fidelity selection vector, and \(l^*=\mathrm {maxloc}(\varvec{\phi })\), add the new training point to the \(l^*\)-th training set \(\mathcal {T}_{l^*}\) (as well as to the lower-fidelity sets from \(l^*+1\) up to \(l^*=N\)).

The identification of the new training points is based on the LCB sampling method [7] with equal weight for the function value and the prediction uncertainty (see Fig. 2). It aims to find points with large prediction uncertainty and small objective function value. Accordingly, LCB identifies new training points by minimizing the acquisition function \(\psi (\mathbf{x})\)

$$\begin{aligned} \mathbf{x}^\star = \mathop {argmin}\limits _{\mathbf{x}} \left[ \psi (\mathbf{x}) \right] \end{aligned}$$
(6)

where

$$\begin{aligned} \psi (\mathbf{x})=\hat{f}(\mathbf{x})-U_{\hat{f}}(\mathbf{x})+P_x(\mathbf{x}) \end{aligned}$$
(7)

and

$$\begin{aligned} P_x(\mathbf{x}) = {\left\{ \begin{array}{ll} \frac{1}{\epsilon }\frac{d_0-d(\mathbf{x})}{d_0} &{}\mathrm {if} \,\,\, d(\mathbf{x})<d_0,\\ 0 &{}\mathrm {else} \end{array}\right. } \end{aligned}$$
(8)

is a penalization factor based on the distance from the existing training sets (considering all fidelities) to prevent the sampling of already sampled points and having matrix \(\mathbf {A}\) ill-conditioned (see Eq. 11). \(\epsilon\) is a coefficient here set equal to \(1E-1\), \(d(\mathbf{x})\) is the distance of the point \(\mathbf{x}\) to the closest point identified among all the training sets, and \(d_0=5.0E-3\) is the minimum acceptable distance to an existing training point.

2.2 Least squares regression via in-the-loop optimization

The surrogate model predictions \(\tilde{f}({{\mathbf {x}}})\) are computed as the expected value (EV) over a stochastic ensemble of radial basis function (RBF) surrogate models, defined by a stochastic tuning parameter, \(\tau \sim \text {unif} [1,3]\):

$$\begin{aligned} \tilde{f}({{\mathbf {x}}})=\mathrm {EV}\left[ g({{\mathbf {x}}},\tau )\right] _\tau , \end{aligned}$$
(9)

with

$$\begin{aligned} g({{\mathbf {x}}},\tau )=\sum ^{\mathcal {K}^\star }_{j=1} w_j \Vert {{\mathbf {x}}}-{{\mathbf {c}}}_j\Vert ^\tau , \end{aligned}$$
(10)

where \(w_j\) are unknown coefficients, \(\Vert \cdot \Vert\) is the Euclidean norm, and \({{\mathbf {c}}}_j\) are the RBF centers, whose coordinates are defined via k-means clustering [24] of the training point coordinates in the design space. The uncertainty \(U_{\tilde{f}}({{\mathbf {x}}})\) associated with the SRBF surrogate model prediction is quantified by the 95%-confidence interval of \(g({{\mathbf {x}}},\tau )\), evaluated using a Monte Carlo sampling over \(\tau\) [46]. Noise reduction is achieved by choosing a number of SRBF centers \(\mathcal {K}\) less than the number of training points \(\mathcal {J}\). Hence, \(w_j\) are determined with least squares regression by solving

$$\begin{aligned} {{\mathbf {w}}}=(\mathbf {A}^\mathsf{T}\mathbf {A})^{-1}\mathbf {A}^\mathsf{T}\mathbf {s} , \end{aligned}$$
(11)

where \({{\mathbf {w}}}=\{w_j\}\), \(a_{ij}=\Vert {{\mathbf {x}}}_i-{{\mathbf {c}}}_j\Vert ^\tau\), and \(\{({{\mathbf {x}}}_i,s({{\mathbf {x}}}_i))\}_{i=1}^{\mathcal {J}} \in \mathcal {T}\). The optimal number of SRBF centers (\(\mathcal {K}^\star\)) is defined by minimizing a leave-one-out cross-validation (LOOCV) metric [14, 22]. Let \(\tilde{h}_{i,\mathcal{K}}({{\mathbf {x}}})\) be a surrogate model trained by all points but the i-th point using \(\mathcal {K}\) centers, then \(\mathcal {K}^\star\) is defined as:

$$\begin{aligned} \mathcal {K}^\star = \mathop {argmin}\limits _{{\mathcal {K}}} [\mathrm {RMSE}({\mathcal {K}})], \end{aligned}$$
(12)

where the root mean squared error (RMSE) is defined as

$$\begin{aligned} \mathrm {RMSE}(\mathcal{K}) = \sqrt{\dfrac{1}{\mathcal {J}}\sum _{i=1}^{\mathcal {J}} \left( s({{\mathbf {x}}}_i) - \tilde{h}_{i,\mathcal{K}}(\mathbf{x}_i)\right) ^2 }. \end{aligned}$$
(13)

To avoid abrupt changes in the surrogate model prediction from one iteration to the next one, during the active learning procedure the search for \(\mathcal {K}^\star\) can be constrained. In the present work, \(\mathcal {K}_{k-1}^\star -2< \mathcal {K}_k^\star < \mathcal {K}_{k-1}^\star +2\), with k the active learning iteration. An example with \(\mathcal {J}=6\) and \(\mathcal {K}^\star =3\) is shown in Fig. 3.

Fig. 3
figure 3

Example of least squares regression by SRBF

The function which gives the lowest EV for the LOOCV metric is the exact objective function \(f({{\mathbf {x}}})\). Since f and \(\tilde{h}\) are deterministic functions, the EV of an error-squared term in the RMSE measure can be expanded as

$$\begin{aligned} \begin{aligned}&\mathrm {EV}\left[ \left( s({{\mathbf {x}}}_i) - \tilde{h}({{\mathbf {x}}}_i)\right) ^2 \right] = \\&\mathrm {EV}\left[ \left( s({{\mathbf {x}}}_i) - f({{\mathbf {x}}}_i) + f({{\mathbf {x}}}_i) - \tilde{h}({{\mathbf {x}}}_i)\right) ^2 \right] = \\&= \mathrm {EV}\left[ \left( s({{\mathbf {x}}}_i) - f({{\mathbf {x}}}_i)\right) ^2 \right] + \\&\,\,\,\,\,\,\,\,\, 2 \mathrm{EV}\left[ \left( s({{\mathbf {x}}}_i) - f({{\mathbf {x}}}_i)\right) \right] \left( f({{\mathbf {x}}}_i) - \tilde{h}({{\mathbf {x}}}_i)\right) + \\&\,\,\,\,\,\,\,\,\, \left( f({{\mathbf {x}}}_i) - \tilde{h}({{\mathbf {x}}}_i)\right) ^2 . \end{aligned} \end{aligned}$$
(14)

Since the noise has a zero mean, the EV in the middle term vanishes. Thus, the overall expected value for the error is minimized if \(\tilde{h}({{\mathbf {x}}}_i) = f({{\mathbf {x}}}_i)\). Therefore, the LOOCV criterion in Eq. 12 is a suitable measure of the quality of a surrogate model function. It should be noted that the proposed method does not need any assumption about the noise distribution. The method is expected to run robustly even in the event of noise with no Gaussian distribution nor non-zero mean.

3 Optimization problems

Table 1 Analytical benchmark problems
Fig. 4
figure 4

Analytical test problem \(P_1\)

The assessment of the multi-fidelity surrogate model is based on four analytical tests and three CFD-based design optimization problems, with design space dimensions D ranging from 1 to 10. Problems are solved with a number of fidelity levels N ranging from 1 to 4. These are reasonable numbers in multi-fidelity shape optimization, where several grid resolution levels and/or physical models may be considered.

The initial training set for each problem is based on a face-centered central composite design (CCF) without factorial points [2], i.e., the domain center and the centers of the domain boundaries. Details are provided in the following subsections.

A deterministic single-objective formulation of the particle swarm optimization algorithm [39], is used for the surrogate based optimizations, as well as for the solution of the minimization sampling problem of Eq. 8. The active learning is performed with a fixed budget of function evaluations: considering a normalized computational cost of a highest-fidelity evaluation (equal to 1), the overall computational cost CC is proportional to the training set sizes \(\mathcal {J}_l\) and is defined as:

$$\begin{aligned} CC = \sum _{l=1}^N \beta _{l} \mathcal {J}_l. \end{aligned}$$
(15)
Fig. 5
figure 5

Analytical test problems: from top to bottom \(P_2\), \(P_3\), and \(P_4\) with \(D=2\) (without noise); from left to right \(f_1\), \(f_2\), and \(f_3\) (high-, medium-, and low-fidelity)

Fig. 6
figure 6

NACA 4-digit hydrofoil

3.1 Analytical test problems

The four analytical test problems used to assess the MF method [25] are an MF version of the Forrester (P1), Griewank (P2), Rosenbrock (P3), and shifted-rotated Rastrigin (P4) functions. Table 1 summarizes the test problem equations, domain size, dimensions, reference global optimum position \(\varvec{\check{\mathbf{x}}}\) and value \({f}(\varvec{\check{\mathbf{x}}})\), and finally the bibliographical reference. Most problems have \(N=3\).

Artificial noise is added to the analytical problems. For each fidelity l a normal distributed noise \(\eta _l \in \mathcal {N}(0,\sigma _l)\) is defined, where \(\sigma _l=\{0.025, 0.05, 0.10\}R_1\), with \(R_1\) the function range of the highest-fidelityFootnote 1. Figure 4 shows an example of the artificial noise applied to the analytical function \(P_1\) as semi-transparent scatter. The choice of testing problems with a noise variance that decreases as the fidelity increases is based on what usually happens with the numerical solvers used in this paper (ISIS-CFD and \(\chi\)-navis), when applied to ship hydrodynamics problems. However, in other problems the noise variance could be distributed differently across the different fidelity levels. Nevertheless, since the proposed approach does not make any assumption about how the noise variance is distributed across fidelity levels, it is expected that the method would run robustly even in the case where high-fidelity evaluations are affected by the greatest noise variance. In the latter case, the proper identification of the noise variance would be certainly computationally expensive, as many high-fidelity evaluations would be needed.

The analytical test problems are selected to be representative of complex real-world problems and challenging from the optimization viewpoint. They are characterized by several local minima and adjacent regions of strong gradients and flatness, as shown in Figs. 4 and 5. Furthermore, the different fidelity levels are modeled to never have a coincident global minimum among them. Finally, \(P_3\) and \(P_4\) are arbitrarily scalable in the number of dimensions, and \(P_4\) also in the number of fidelities.

Fig. 7
figure 7

NACA hydrofoil computational grids (G) for ISIS-CFD; from left to right G1 (12.8 k cells), G2 (5.7 k cells), and G3 (3.6 k cells)

3.2 NACA hydrofoil

This problem addresses the drag coefficient minimization of a NACA four-digit airfoil. The following minimization problem is solved

$$\begin{aligned} \begin{array}{rll} \mathrm {minimize} &{} f({{\mathbf {x}}})=C_D (\mathbf {x}) \\ \mathrm {subject \, to} &{} C_L(\mathbf{x}) = 0.6\\ \mathrm {and \, to} &{} \mathbf {l}\le \mathbf {x} \le \mathbf {u}, \end{array} \end{aligned}$$
(16)

where \(\mathbf {x}\) is the design variable vector, \(C_D\) and \(C_L\) are respectively the drag and lift coefficient. The equality constraint on the lift coefficient is necessary to compare different geometries at the same lift force (equal to the weight of the object), since the drag depends strongly on the lift.

The simulation conditions are: velocity \(U=10\) m/s, chord \(c=1\) m, and fluid density \(\rho =1,026\) \(\mathrm kg/m^3\), with a chord based Reynolds number \(\mathrm {Re}=8.41 \cdot 10^6\).

The hydrofoil shape (see Fig. 6) is defined by the general equation for four-digit NACA foils [28]. The upper (\(y_u\)) and lower (\(y_l\)) hydrofoil surfaces are computed as

$$\begin{aligned} \left\{ \begin{array}{l} \displaystyle \xi _u = \xi - y_t \sin \theta \\ \xi _l = \xi + y_t \sin \theta \\ y_u = y_c + y_t \cos \theta \\ y_l = y_c - y_t \cos \theta \end{array} \right. \end{aligned}$$
(17)

with \(\theta =\arctan ( d y_c / d \xi )\) and

$$\begin{aligned} y_c= \left\{ \begin{array}{lc} \displaystyle \frac{m}{p^2}\left[ 2p\frac{\xi }{c}-\left( \frac{\xi }{c} \right) ^2 \right] , &{} 0\le \xi < pc \\ \dfrac{m}{(1-p)^2}\left[ (1-2p)+2p\dfrac{\xi }{c}-\left( \dfrac{\xi }{c} \right) ^2 \right] , &{} pc\le \xi \le c \end{array} \right. \end{aligned}$$
(18)

where \(\xi\) is the position along the chord, c the chord length, \(y_c\) the mean camber line, p the location of the maximum camber, m the maximum camber value, t the maximum thickness, and \(y_t\) the half thickness:

$$\begin{aligned} \begin{aligned} y_t =&5t (0.2969\sqrt{\xi } - 0.1260\xi +\\&-0.3516\xi ^2 + 0.2843\xi ^3 -0.1015\xi ^4 ). \end{aligned} \end{aligned}$$
(19)

In this work, three design spaces are defined. For \(D=1\), \(\mathbf {x}=\{m\}\) with \(m\in [0.025,0.065]\); the thickness and maximum camber position are fixed at \(t=0.030\) and \(p=0.4\). For \(D=2\) the design variables vector is defined as \(\mathbf {x}=\{t,m\}\) with \(t\in [0.030,0.120]\) and m, p like for \(D=1\). Finally, for \(D=3\) the design variables are \(\mathbf {x}=\{p,t,m\}\) with \(p\in [0.25,0.70]\) and m and t as per \(D=2\).

Tests are run with one, two, and three fidelity levels (\(N=1,2,3\)). The optimization budget is fixed at \(CC=45\) for all design spaces.

Fig. 8
figure 8

The different steps of the ISIS-CFD grid creation for the DTMB 5415 optimization

3.3 DTMB 5415 model

The SDD problem concerns the hull-shape optimization of the DTMB 5415 destroyer considering the free water surface. The shape of the DTMB 5415 destroyer is optimized for minimal resistance (\(R_T\)). The optimization problem reads

$$\begin{aligned} \begin{array}{rll} \mathrm {minimize} &{} f({{\mathbf {x}}})=R_T ({{\mathbf {x}}}) \\ \mathrm {subject \, to} &{} L_{pp}({{\mathbf {x}}}) = L_{pp,0}\\ \mathrm {and \, to} &{} \mathbf {l}\le {{\mathbf {x}}}\le \mathbf {u}, \end{array} \end{aligned}$$
(20)

where \(L_{pp,0}=5.72\) m (model scale) is the original length between perpendiculars. The ship is at even keel, with Froude number \(\mathrm {Fr}=0.30\) and \(\mathrm {Re}=1.18 \cdot 10^7\). The \(L_{pp}\) constraint is automatically satisfied by the shape modification method.

The modified geometries (\(\mathbf {g}\)) are produced by the linear superposition of D orthonormal basis functions (\(\varvec{\psi }\)) on the original geometry (\(\mathbf {g}_0\)), as follows

$$\begin{aligned} \mathbf {g}(\varvec{\xi },\mathbf {x}) = \mathbf {g}_0(\varvec{\xi }) + \varvec{\delta }(\varvec{\xi },\mathbf {x}), \end{aligned}$$
(21)

with

$$\begin{aligned} \varvec{\delta }(\varvec{\xi },\mathbf {x})=\sum _{k=1}^D x_k\varvec{\psi }_k(\varvec{\xi }), \end{aligned}$$
(22)

where \(\varvec{\xi }\) are the geometry Cartesian coordinates, whereas \(-1.25\le \{x_k\}_{k=1}^D\le 1.25\) and \(\{\varvec{\psi _k}\}_{k=1}^D\) are the reduced design variables and the eigenfunctions, respectively, provided by the design-space augmented dimensionality reduction procedure described in [36]. Details about the original design space definition can be found in [38]. In this work, two design variables are used.

The optimization is performed with \(N=1\), 2, and 3 fidelity levels. For the initial sample plan (only for this problem), CFD simulations for all fidelities were run in the center of the domain, and with each design variable at either \(-1\) or \(+1\).

3.4 RoPax ferry

The optimization of the RoPax ferry pertains to the minimization of the resistance over the ship displacement (\(\nabla\)):

$$\begin{aligned} \begin{array}{rll} \mathrm {minimize} &{} f({{\mathbf {x}}})=R_T({{\mathbf {x}}})/\nabla ({{\mathbf {x}}}) \\ \mathrm {subject \, to} &{} \mathbf {l}\le {{\mathbf {x}}}\le \mathbf {u}. \end{array} \end{aligned}$$
(23)

The design variable vector is defined as two geometrical parameters of the aftship: \({{\mathbf {x}}}=\{\mathrm {ABL},\mathrm {DF}\}\), with the aft-body length \(\mathrm {ABL}\in [0.3,0.61315]\) and the draught factor \(\mathrm {DF}\in [0.8,1.2]\), respectively. The original ship hull coordinates are in the domain center.

The analysis is performed for a straight-ahead advancement, with the ship at even keel condition. The operational speed is 19 kn (at full scale). Computations are performed at model scale (scale factor \(\lambda =27.14\)), with \(\mathrm {Fr}=0.245\) and \(\mathrm {Re}=1.017 \cdot 10^7\), which corresponds to a water density \(\rho =998.2\) kg/m\(^3\), kinematic viscosity \(\nu =1.105\cdot 10^{-6}\) m\(^2\)/s, and gravitational acceleration \(g=9.81\) m/s. Free water surface effects are considered.

The parametric geometry of the RoPax is realized with the computer-aided design environment integrated into the CAESES software by FRIENDSHIP SYSTEMS AG. The deformation of the hull surface is obtained by imposing the design variable values into the parametric model in CAESES. A surface grid of the RoPax ferry (i.e. the grid discretizing the hull surface) provides the displacement of the nodes on the hull surface.

The next step is the interpolation of the deformation vector from the surface grid to the volume grid. This is done in two steps: (1) the deformation of the hull surface is interpolated from the CAESES surface grid onto the patches on the hull surface of the hydrodynamic volume grid (the interpolation is performed using a system of RBFs); (2) the deformation of the hull surface is propagated in the volume grid (vertices are moved along the coordinate lines normal to the surface, with the displacement of the nodes decaying with the distance).

The optimization is performed with \(N=4\) fidelity levels.

Fig. 9
figure 9

RoPax ferry grids (bow detail): from left to right G1, G2, G3, and G4

4 CFD solvers

CFD simulations for the NACA hydrofoil and the DTMB 5415 are performed with the URANS solver ISIS-CFD developed at ECN – CNRS [32], available in the FINE/Marine computing suite from Cadence Design Systems. The hydrodynamics performance of the RoPax is assessed by the URANS code \(\chi\)navis developed at CNR-INM [5, 8, 9].

4.1 ISIS-CFD

ISIS-CFD is an incompressible unstructured finite-volume solver for multi-fluid flow. The velocity field is obtained from the momentum conservation equations and the pressure field is extracted from the mass conservation constraint transformed into a pressure equation. These equations are similar to the Rhie and Chow SIMPLE method [33] but have been adapted for flows with discontinuous density fields. Free-surface flow is simulated with a conservation equation for the volume fraction of water, discretized with specific compressive discretization schemes. The method features sophisticated turbulence models, such as an anisotropic EASM model and DES models.

The unstructured discretization is face-based. While all unknown state variables are cell-centered, the systems of equations used in the implicit time stepping procedure are constructed face by face. Therefore, cells with an arbitrary number of arbitrarily-shaped constitutive faces are accepted. The code is fully parallel using the message passing interface (MPI) protocol. A detailed description of the solver is given by [32]. Information on the interface-capturing scheme can also be found in [49].

Computational grids are created through adaptive grid refinement [47, 48], to optimize the efficiency of the solver and to simplify the automatic creation of suitable grids. The adaptive grid refinement method adjusts the computational grid locally, during the computation, by dividing the cells of an original coarse grid. The decision where to refine comes from a refinement criterion, a tensor field \(\mathcal {C}(x,y,z)\) computed from the flow. The tensor is based on the water surface position and on second derivatives of pressure and velocity, which gives a crude indication of the local truncation errors. The grid is refined until the dimensions \(\mathbf {d}_{p,j}\) (\(j=1,2,3\)) of each hexahedral cell p satisfy

$$\begin{aligned} \Vert \mathcal {C}_p \mathbf {d}_{p,j} \Vert = T_r. \end{aligned}$$
(24)

The refinement criterion based on the second derivatives of the flow is not very sensitive to grid refinement [48], so the cell sizes everywhere are proportional to the constant threshold \(T_r\).

For the MF optimization, grid adaptation is used to take into account the need for several fidelities. The interest of this procedure is that different fidelity results can be obtained by running the same simulations and simply changing the threshold \(T_r\). Thus, it is straightforward to automate the MF simulations.

4.2 \(\chi\)navis

\(\chi\)navis is a general purpose URANS solver based on a finite-volume discretization, with variables co-located at the cell centers. Turbulent stresses are taken into account by the Boussinesq hypothesis; several turbulence models (both algebraic and differential) are implemented. The free surface is taken into account through a single-phase level set algorithm [5, 8].

To treat complex geometries or bodies in relative motion, the numerical algorithm is discretized on a block-structured grid with partial overlap, possibly in relative motion [10, 53]. This approach makes domain discretization and quality control of the computational grid much easier than with similar discretization techniques implemented on structured grids with adjacent blocks. Unlike standard multi-block approaches, grid connections and overlaps are not trivial and have to be calculated in the pre-processing phase. The coarse/fine grain parallelization of the RANS code is obtained by distributing the structured blocks among available distributed and/or shared memory processors (nodes); shared memory capability (threads) is used mainly for do-loop parallelization. Pre-processing tools, which allow an automatic subdivision of structured blocks and their distribution among the processors, are used for load balancing. The communication between the processors for the coarse grain parallelization is obtained using the standard MPI library, whereas the fine grain parallelization (shared memory) is achieved through the open message passing library. The efficiency of the parallel code has been examined in earlier research, showing satisfactory results in terms of acceleration for different test cases [6].

The solver uses a full multi grid—full approximation scheme (FMG–FAS), with an arbitrary number of grid levels. In the FMG–FAS approximation procedure, the solution is computed on the coarsest grid level first. Then, it is approximated on the next finer grid and the solution is iterated by exploiting all the coarser grid levels available with a V-Cycle. The process is repeated up to the finest grid level. Thus, the procedure provides multi-fidelity data without any additional computational cost. More details on the code implementation and applications can be found in [5, 8, 9, 15].

5 Problem setups

The setups for each CFD-based design optimization problem are described in the following subsections.

5.1 NACA hydrofoil

The computational domain runs from 11c in front of the leading edge to 16c behind the hydrofoil and from \(-10\) to 10c vertically. Dirichlet conditions on the velocity are imposed, except on the outflow side which has an imposed pressure. The hydrofoil surface is treated with a wall law and \(y^+=60\) for the first layer. Turbulence is modeled with the standard \(k-\omega\) SST model [27]. To obtain the same lift for all geometries (see Eq. 16), the angle of incidence \(\alpha\) for the hydrofoil is adjusted dynamically during the simulations.

Up to three fidelity levels are used. The initial computational grid has 2654 cells, the refinement threshold value \(T_r\) is set equal to 0.1, 0.2, and 0.4 from highest- to lowest-fidelity. This results in a cell size ratio of 4 : 1 between the refined fine and coarse grids. The final grids (G) have about 12.8 k, 5.7 k, and 3.7 k cells, respectively (see Fig. 7). Highest- to lowest-fidelity simulations require about 17, 9, and 5 minutes, respectively, of wall-clock time to converge. The resulting computational cost ratios are about \(\beta _2=0.5\) and \(\beta _3=0.3\).

5.2 DTMB 5415 model

Simulations of the DTMB 5415 are performed on half geometries. The domain runs from \(1.5L_{pp}\) in front of the bow to \(3L_{pp}\) behind the stern, up to \(2L_{pp}\) laterally, and from \(-1.5L_{pp}\) to \(0.5L_{pp}\) vertically. Dirichlet conditions on the velocity are imposed on the inflow and side faces, pressure is imposed on the top, bottom, and outflow side. The hull is treated with a wall law and \(y^+=60\) for the first layer. Turbulence is modeled with \(k-\omega\) SST.

The grids for the simulation of different geometries are obtained through grid deformation. Each simulation starts from the same original grid (see Fig. 8a). The grid is divided in layers around the hull. For each geometry \(\mathbf {g}(\varvec{\xi },{{\mathbf {x}}})\), the displacement of the hull faces with respect to \(\mathbf {g}_0(\varvec{\xi })\) is propagated through these layers [13]. The displacements are multiplied with a weighting factor which goes from 1 on the hull to 0 on the outer boundaries, so that the latter are not deformed (see Fig. 8b). The original grid is coarse, since deforming these is easier and safer than for fine grids. The final grid, including all the refinement at the free surface, is created using adaptive refinement (see Fig. 8c).

The initial grid has 130 k cells. The thresholds for the simulations with different fidelities are \(T_r=0.0145\), 0.0072, and 0.0036 from coarse to fine. This implies a 4 : 1 cell size ratio between the coarsest and finest grids and results in approximately 240 k, 860 k, and 3.4 M cells respectively. On a 20-core workstation, the computations take about 1.2 h, 4 h, and 19 h each. The resulting computational cost ratios are about \(\beta _2=0.21\) and \(\beta _3=0.06\).

5.3 RoPax ferry

The computational grid is composed of 54 blocks, for a total of about 5.4 M cells for the finest grid (only half of the domain is discretized); the domain extends to 2\(L_{pp}\) in front of the hull, 3\(L_{pp}\) behind, and 1.5\(L_{pp}\) on the side; a depth of 2\(L_{pp}\) is imposed.

The Spalart–Allmaras turbulence model is used [43]. Wall-functions are not adopted, therefore \(y^+\le 1\) is ensured at the wall. On solid walls, the velocity is set equal to zero and a zero normal gradient is enforced on the pressure field; at the (fictitious) inflow boundary, the velocity is set to the undisturbed flow value and the pressure is extrapolated from the inside; the dynamic pressure is set to zero at the outflow, whereas the velocity is extrapolated from inner points. On the top boundary, which remains always in the air region, fluid dynamic quantities are extrapolated from inside.

Four grid levels are used (see Fig. 9, from coarser to finer: G4, G3, G2, and G1), each obtained from the next finer grid with a refinement ratio equal to 2, resulting in \(\beta _2=0.125\), \(\beta _3=0.0156\), and \(\beta _4=0.002\).

Fig. 10
figure 10

Analytical test problem \(P_1\): from left to right \(E_x\), \(E_f\), and \(E_t\)

6 Numerical results

The optimization results are assessed by three error metrics [39]. Knowing the position of the global optimum \(\check{\mathbf {x}}\), these metrics characterize the normalized error in the design space, the objective function space, and Euclidean distance in the normalized \(\mathbf {x}-f(\mathbf {x})\) hyperspace, respectively:

$$\begin{aligned} E_x&\equiv \frac{\Vert \mathbf {x}^{\star }-\check{\mathbf {x}}\Vert }{\sqrt{N}} , \end{aligned}$$
(25)
$$\begin{aligned} E_f&\equiv \dfrac{f({\mathbf {x}}^{\star }) - f(\check{\mathbf {x}})}{R_1}, \end{aligned}$$
(26)
$$\begin{aligned} E_t&\equiv \sqrt{\frac{E_x^2+E_f^2}{2}}, \end{aligned}$$
(27)

where \(\mathbf {x}\) is the array of the design variables (normalized to a unit hypercube), \(\mathbf {x}^\star\) is the location of the optimum of the approximation to f, and \(R_1\) is the range of the highest-fidelity level computed considering the initial training set. The error metrics \(E_x\) and \(E_f\) evaluate design and goal accuracy, whereas the aggregate metric \(E_t\) evaluates accuracy balancing the performance quantification of the method when optima are in very flat or very peaky portions of the design space. It may be noted that Eq. 26 uses an evaluation of the objective with the highest fidelity level at the point \(\mathbf {x}^\star\) identified by the surrogate as the global optimum.

In the absence of a reference optimum (as often occurs in SDD problems), a different set of design-sensitive metrics are employed. These metrics quantify design point location and objective function, respectively:

$$\begin{aligned} \Delta _x&\equiv \dfrac{\Vert \mathbf {x}^\star -\mathbf {x}_0\Vert }{\sqrt{N}} \end{aligned}$$
(28)
$$\begin{aligned} \Delta _f&\equiv \dfrac{f({\mathbf {x}}^{\star }) - f({\mathbf {x}}_0)}{f({\mathbf {x}}_0)}, \end{aligned}$$
(29)

where \(\mathbf {x}_0\) is the original objective function value, meaning that \(\Delta _x\) evaluate the distance of the global optimum position from the original design in the design variable space, whereas \(\Delta _f\) provides the objective function variation with respect to the parent design. Additionally, the prediction error is used to quantify the error of the surrogate model in predicting the minimum value:

$$\begin{aligned} E_p = \dfrac{\hat{f}({\mathbf {x}}^{\star }) - f({\mathbf {x}}^{\star })}{R_1}, \end{aligned}$$
(30)

Whenever different fidelities are tested, the single fidelity surrogate model is based on the highest-fidelity level available, the two-fidelity surrogate model is based on the highest- and the lowest-fidelities, whereas intermediate-fidelity levels are added for three- and higher-fidelity surrogate models.

Fig. 11
figure 11

Analytical test problem \(P_2\), from left to right \(E_x\), \(E_f\), and \(E_t\)

Fig. 12
figure 12

Analytical test problem \(P_3\), from left to right \(E_x\), \(E_f\), and \(E_t\), from top to bottom \(D=2, 5, 10\)

Fig. 13
figure 13

Analytical test problem \(P_4\), from left to right \(E_f\), \(E_x\), and \(E_t\), from top to bottom \(D=2, 5, 10\)

Table 2 Analytical test problems, summary of the results

6.1 Analytical test problems

The computational cost of the analytical test problems is negligible, therefore an artificial computational cost is defined as \(\beta _1=1\), \(\beta _2=0.2\), and \(\beta _3=0.1\). The performance of the method is assessed using \(N=1, 2, 3\) fidelity levels. A computational budget equal to \(40+5D\) is used. Since the noise is synthetically added to the analytical functions by a numerical generator of random numbers, a statistical analysis [16] is performed varying the seed of the random number generator. For each problem, problem size, and number of fidelities 50 repetitions are performed. The statistics of the three metrics (for the maximum computational cost) are reported in Table 2 and discussed using box plots. The box plot shows the \(q_1\), \(q_2\) (median), and \(q_3\) quartiles, while the lower and upper whiskers extend to the most extreme data points not considered outliers. Defining the inter-quartile range (IQR) as \(\mathrm {IQR}= q_3 - q_1\), the outliers are those values greater than \(q_3 + 1.5\mathrm{IQR}\) or less than \(q_1 - 1.5\mathrm{IQR}\).

Figure 10 shows the results for the problem \(P_1\). For all the metrics the lowest median is achieved with \(N=3\). Furthermore, as the number of fidelities increases the IQR decreases. Finally, with \(N=3\) the median is always closer to \(q_1\) than to \(q_3\).

For the problem \(P_2\) (see Fig. 11), overall the MF (\(N=2, 3\)) surrogate models achieve better results than the HF (\(N=1\)) surrogate. Specifically, the median of the MF surrogates is lower than the HF surrogate and also the IQR is smaller. Finally, the results for \(N=2\) and \(N=3\) are very close, with \(N=3\) achieving slightly better results than \(N=2\), see Table 2.

Fig. 14
figure 14

NACA hydrofoil, surrogate models and associated prediction uncertainty for \(D=1\); from left to right \(N=1,2,3\)

Figure 12 shows the metrics for the \(P_3\) problem with \(D=2, 5, 10\) (from top to bottom). For \(D=2\) the \(N=1, 2\) surrogates have negligible IQR, this is due to the fact that they use many high-fidelity function evaluations (namely 50 and 45, respectively) and that the noise for the high-fidelity function is small. Therefore, the active learning method is only slightly disturbed by the training set noise and performs an almost equal sampling for each repetition. Differently, when \(N=3\) a non-negligible IQR is shown since the lowest-fidelity function has a smaller range and the active learning method is, therefore, more affected by the training set noise. For \(D=2\) the HF (\(N=1\)) surrogate achieves the best performance. For \(D=5\) the IQR is significant for all the surrogates, in particular, \(N=3\) shows the largest IQR but also the minimum median value for all the metrics. Finally, for \(D=10\) the MF surrogate with \(N=3\) shows the smaller median value for \(E_x\) and \(E_t\) and the smaller IQR for all the metrics. The values of \(E_f\) do not reflect the magnitude of the position error \(E_x\) because of the peculiar shape of the Rosenbrock function, which is characterized by a long and flat valley extending from the neighborhood of the global minimum.

Finally, Fig. 13 shows the error metrics for the \(P_4\) problem with \(D=2, 5, 10\) (from top to bottom, respectively). For \(D=2\) and \(D=5\) the MF surrogate with \(N=2\) achieves the lowest \(E_t\) error, whereas the smallest IQR is achieved by \(N=1,3\) surrogates for \(D=2,5\), respectively. For \(D=10\) the MF (\(N=2,3\)) surrogates achieve the smallest \(E_t\) values, where \(N=3\) shows a larger IQR than \(N=2\).

Table 2 summarizes the median of the surrogate models performance for the analytical problems. Overall, the MF (\(N=2,3\)) surrogate models outperform the HF (\(N=1\)) surrogate model especially when \(D>2\). Furthermore, the MF surrogate model with \(N=3\) outperforms the MF surrogate model with \(N=2\) in most of the cases.

6.2 NACA hydrofoil

Fig. 15
figure 15

NACA hydrofoil, surrogate models (top) and associated prediction uncertainty (bottom) for \(D=2\): from left to right \(N=1,2,3\)

For this problem a reference global minimum from a high-fidelity optimization is available [31], which is \(\check{\mathbf{x}}=[0.3776, 0.3333, 0.0000]\) with \(f(\check{\mathbf{x}})=7.2116E-3\).

Figures 14 and 15 show the global surrogate model prediction with \(D=1,2\), respectively, at the final iteration of the active learning procedure. The active learning method is able to identify the global minimum region for all cases (\(N=1,2,3\)). For \(D=1\), using a single-fidelity surrogate model (see Fig. 14, left) the noise in the CFD outputs is negligible. Differently, the use of two fidelities (see Fig. 14, center) introduces a significant amount of noise (due to the low-fidelity CFD outputs), negatively affecting the MF prediction. Finally, with an intermediate-fidelity level (see Fig. 14, right) the noise is still present, but it is filtered out more effectively by the addition of a less noisy intermediate fidelity model. The improvement of the MF surrogate model prediction when adding fidelities can be associated with the different number of optimal RBF centers (see Eq. 12) used for the lowest fidelity: \(\mathcal {K}^*=20\) and 13, for \(N=2\) and 3, respectively. A similar behavior is found for \(D=2\) (see Fig. 15).

Table 3 NACA hydrofoil optimization problem, summary of the results

Table 3 summarizes the results for the NACA hydrofoil optimization problem. The error metrics for \(D=1,2,3\) are comparable. Although the lowest errors are generally achieved by the HF (\(N=1\)) surrogate model, the MF (\(N=2,3\)) surrogate models achieve very similar values although the numerical noise in the MF training sets is significantly higher than in the HF training set. This shows that the proposed approach is effective in filtering out the numerical noise. Furthermore, \(N=3\) always outperforms \(N=2\).

Fig. 16
figure 16

NACA hydrofoil, drag coefficient as a function of t for \(c=0.0475\) and \(p=0.475\). The drawn lines are adapted grids (ADR, G3=3.6 k, G2=5.4 k, G1=11.8 k, and G0=36.3 k cells), the dashed lines are Hexpress-only grids (HEX, G3=14.4 k, G2=24.5 k, G1=37.4 k, and G0=52.8 k cells)

Fig. 17
figure 17

DTMB 5415, surrogate models prediction (top) and associated uncertainty (bottom); from left to right \(N=1, 2, 3\)

Table 4 DTMB 5415 and RoPax SDD problems, summary of the results

An analysis of the noise behavior in the simulations is given in Fig. 16, which shows the evolution of the drag for a systematic variation of the foil thickness over a small range. The drag on adaptively refined grids is compared with a systematic series of grids created directly by the Hexpress grid generator. These results confirm that for this case, the noise is mostly due to the adaptive refinement procedure, since the grids without adaptation produce a much smoother behavior. These oscillations in the forces are related to small changes in the topology of the adapted grids (i.e. cells, especially in the boundary layers and at the leading edge, which are either refined or not depending on small changes in the hydrofoil geometry). Also, the dynamic adjustment of the foil angle of attack depends, within the tolerances of the algorithm, on the history of the forces. Thus, if the history of the grid refinement is different, the converged angle of attack may vary, even if the final grid topology is the same. This makes it difficult to identify a single cause for the noise.

However, the noise is proportional to the numerical errors of the simulations: it is most pronounced on the coarsest grid and disappears rapidly as the adapted grids become finer. In addition, based on a rough estimation of the grid convergence for the two series, the adapted grids produce similar accuracy as non-adapted grids with four to five times more cells, thanks to the excellent capturing of the flow around the leading edge on the adapted grids (see Fig. 8), that is essential to obtain the right forces for a lifting hydrofoil. Thus, the main benefit of the adaptive refinement here is a gain in efficiency for the CFD simulation.

Finally, even in the Hexpress-only grids, jumps in the drag can be observed when the topology of the grids changes. However, these jumps are less frequent than for the adaptively refined grids and thus, much harder to predict. It may actually be preferable to have noise everywhere, which can be filtered with a procedure like the one described in this paper, rather than apparently smooth CFD results but with some unpredictable local jumps.

Fig. 18
figure 18

DTMB 5415, comparison of hull shapes

Fig. 19
figure 19

DTMB 5415, mass-fraction on the hull-surface; original (top) and optimized (bottom), along with a stern detail (right)

6.3 DTMB 5415 model

Fig. 20
figure 20

DTMB 5415, comparison of the original (top) and optimized (bottom) hull shapes; hull-surface pressure (left) and wave elevation pattern (right)

For this problem, a reference solution is not available, therefore only the Eqs. 28 and 29 are used as metrics.

Figure 17 shows the surrogate models with \(N=1, 2, 3\) and the associated prediction uncertainty for the DTMB 5415 optimization problem. The MF method with \(N=3\) identifies two minimum regions in the neighborhood of (0.15, 0.75) and (0.65, 0.2). However, an inspection of the actual highest-fidelity data reveals that the first optimum may be a numerical artifact of the surrogate model. Nevertheless, the active learning procedure correctly ignores the upper-right and lower-left corners, where the prediction uncertainty is high, but the objective function value is high too. The optimization results are summarized in Table 4. It may be noted that for \(N=2\) not all the available budget is used since the active learning process was prematurely terminated. The numerical noise of the low-fidelity evaluations was forcing the sampling to cluster samples in previously sampled areas without improving the surrogate model, which made further simulations useless. For \(N=1\) the lowest prediction error is achieved, this is expected since the high-fidelity evaluations are less affected by numerical noise. However, for the MF with \(N=3\), the introduction of an intermediate fidelity level has significantly reduced the prediction error of the surrogate model. Finally, the largest reduction of the total resistance is achieved with \(N=3\). The optimal hull shape is compared to the original in Fig. 18, whereas Fig. 19 shows the original and optimized wetted area. It is worth noting that the optimized hull has a completely dry stern.

Figure 20 (left) compares the pressure distribution along the optimal and the original hull surfaces. The optimized hull has a stronger pressure gradient along the hull, but the low pressure zone is mostly perpendicular to the flow direction, so it has little influence on the drag. Figure 20 (right) shows the wave elevation of the original and optimized hull, which indicates the main reason for the resistance reduction. The optimal geometry has a bulge behind the stern which creates a second bow wave, out of phase with the first one. The two waves cancel, which produces a flattened free-surface in comparison with the original hull. This indicates that the single-speed optimum shape is dependent on the Froude number Fr: since the wavelengths change with Fr, they do not cancel in off-design conditions.

Fig. 21
figure 21

DTMB 5415, resistance sensitivity to \(x_2\) variation (\(x_1=0\))

Fig. 22
figure 22

DTMB 5415, bow wave: from top to bottom \(x_2=-0.85\), \(-0.825\), \(-0.8\), from left to right G3, G2, G1

Fig. 23
figure 23

DTMB 5145, stern wave pattern: from top to bottom \(x_2=-0.85\), \(-0.825\), \(-0.8\), \(-0.8\), from left to right G3, G2, G1

Like for the NACA hydrofoil, a sensitivity analysis is performed to assess the noise in the CFD data for the DTMB 5415. Figure 21 shows the resistance variation for a small change in \(x_2\) at fixed \(x_1=0\) (i.e. close to the optimum), for the three fidelities used, confirming the noise presence in the CFD data. As for the hydrofoil problem, there is less noise for the medium grid than for the coarse grid. In the fine-grid solutions, the noise appears to be even less. However, the curve has a discontinuous slope at \(x_2=-0.825\).

Moreover, for this more complex case, while the noise is still related to topology changes in the adapted grids, it is possible to identify distinct sources for the noise. For the lowest-fidelity data, the noise is mostly associated with modifications in the position and the smearing of the wave system. For example, Fig. 22 (left) shows random fluctuations in the thickness and the position of the volume fraction discontinuity in the first trough and the shoulder wave. Since the amount of wetted surface has a large influence on the drag, these changes in the waves explain the oscillations in the forces. Figure 22 (center and right) shows that this perturbation behaves like the noise for the hydrofoil: the finer the grid, the smaller the oscillations. The stern wave crest in Fig. 23 shows similar oscillations. The gradient change in the fine grid solutions is explained by the stern wave too (see Fig. 23, right): a topology change occurs at \(x_2=-0.825\) close to the stern, where a double wave ridge with a small wetted patch on the transom is replaced by a single, shallower ridge with less wetting. On the coarse grid, this effect is absent (see Fig. 23, left).

The transom is oriented normal to the flow, so even a small change in wetted surface here has a large influence on the resistance. This may imply that the optimum for this case is located around the transition from partially wetted to dry transom flow—just like the optimum for the NACA hydrofoil occurs when the incoming flow is aligned with the camber line. However, the effect likely depends on the choice of the optimization problem: for a higher Froude number, the transom would remain dry for all geometries.

This topology change in the highest-fidelity simulations, which probably occurs in a more or less chaotic way depending on small changes in the mesh, is an explanation of why the LCB sampling requests a large number of highest-fidelity data points close to the optimum. While this may seem wasteful, it is required to filter the noise in the highest-fidelity data, so this behavior indicates that the algorithm adapts itself to the requirements of the data.

6.4 RoPax ferry

Fig. 24
figure 24

RoPax ferry, MF surrogate prediction (top) and associated uncertainty (bottom)

Fig. 25
figure 25

RoPax ferry, original (black lines) and optimized (blue-dashed lines) shapes with the free-surface line (FS)

Fig. 26
figure 26

RoPax ferry, non-dimensional hull-surface pressure and wave patterns. Colors represent non-dimensional pressure levels

As for the DTMB 5415 SDD problem, no reference solution is available for the RoPax SDD one, therefore the surrogate performance is assessed by Eqs. 28 and 29.

Figure 24 shows the MF surrogate model (with \(N=4\)) and the associated prediction uncertainty. The method identifies two minimum regions in the neighborhood of (1.0, 1.0) and (0.9, 0.75). The active learning is strictly focused on the global minimum region and the overall surrogate model prediction uncertainty is low. Table 4 summarizes the results of the SDD procedure. The MF surrogate model provides a prediction error at the minimum close to 10% and an objective function improvement equal to 12.7%.

The original and the optimized hull stations are compared in Fig. 25: the optimized hull is characterized by a reduction of the submergence of the stern region and a less pronounced bulbous bow. As a consequence, surface pressure fields and wave patterns of the ship advancing straight ahead are significantly different in comparison to the original. In particular, the optimized hull shows a dry stern vault, with a reduction of the wetted area as a consequence (see Fig. 26). Furthermore, the wave pattern of the optimized shape highlights a less pronounced wave throat and a stronger rooster tail, as shown in Fig. 27.

Fig. 27
figure 27

RoPax ferry, non-dimensional wave patterns. Colors represent non-dimensional wave elevation

The bow wave of the optimized hull is similar to the original, with the crest for the optimized shape slightly higher and retarded due to the reduced length and the increased height of the bulbous bow (see Fig. 25).

7 Conclusions and future work

A method was presented and discussed based on a generalized multi-fidelity surrogate model and active learning, for design-space exploration and global optimization with a limited budget of function evaluations. The method is able to leverage an arbitrary number of fidelity levels and it is intended to keep a high effectiveness also in presence of training data affected by numerical noise, as often occurs with CFD-based evaluations.

The proposed method is based on stochastic radial basis functions with least squares regression and in-the-loop optimization of the surrogate hyperparameters. The MF approximation is obtained with an additive correction of a low-fidelity trained surrogate with the surrogate of the errors between consecutive fidelities. The surrogate model is dynamically updated by an active learning procedure that automatically queries new samples at a specified fidelity. The sampling method is based on the LCB approach and considers the benefit-cost ratio associated with using different fidelity sources.

The method was tested on a set of four analytical test problems, a NACA hydrofoil optimization, the DTMB 5415 hull-form optimization, and the hull-form optimization of a RoPax ferry. The assessment of the MF surrogate model performance was performed based on its ability to identify the optimum position and the error between the verified predicted minimum and the real global optimum (when available).

Based on the current test cases, the following conclusions can be drawn:

  1. 1.

    The use of an MF surrogate model is more convenient than a surrogate model based on HF only. In the analytical tests the MF surrogate model overall achieves lower errors than the HF surrogate. In most cases, the use of three fidelities (\(N=3\)) provides lower errors than the use of two fidelities. The advantage of using MF surrogate models as opposed to HF surrogates is more significant when the problem dimensionality (D) increases. Indeed, as the design space dimension increases the prediction uncertainty increases at the corners of the design space. This is because the distance of the corners from the initial training points increases with the dimension. With a larger uncertainty at the corners, it is more likely that the active learning method places new samples at the corners. This is expected to favor the multi-fidelity approach over the single-fidelity since the former performs most of the exploration with low-fidelity data. Based on the aggregate metric \(E_t\), the MF surrogate models always outperformed the HF surrogate model for \(D=5\) and \(D=10\). Considering the NACA hydrofoil problem the MF surrogate model with three fidelities achieves lower error values than with two fidelities. The reason for this is that the extra fidelity levels add robustness, providing more reliable noise filtering. Increasing the number of fidelity levels could potentially provide even better performance. Nevertheless, this should be carefully assessed on a case-by-case basis considering the trade-off between accuracy and computational cost associated with additional fidelity levels. Indeed, cheaper models may lead to significant computational cost reduction but may be too inaccurate, thus misleading the MF method. Differently, fidelity levels with higher accuracy may have a quite high-computational cost, thus not increasing significantly the overall efficiency.

  2. 2.

    The active learning process of the MF surrogate allows for a wider exploration of the design space, compared to using the HF only, as SDD problems show. This is because, in general, the lower the fidelity, the higher the associated noise (see e.g. Figs. 16 and 21). Therefore, the surrogate model prediction uncertainty associated with the lowest fidelity is higher and distributed over a wider region of the design space. For this reason, the combination of LCB with the MF method provides a wide exploration of the whole design space (see Figs. 14 and 15).

  3. 3.

    The proposed active learning method tends to cluster training points in the most noisy regions of the design space (see e.g. Figs. 14, 15, and 17). This is due to the use of LCB (see Eq. 8) with a regressive model for noisy data. The prediction uncertainty associated with this model does not vanish at the training points. As a consequence, training point neighborhoods may be identified as promising regions to add new points through Eq. 8. This improves the noise identification in noisy regions, but at the same time it prevents a wider exploration of the design space with more evenly distributed training points. A proper trade-off between noise identification and design space exploration should therefore be carefully addressed.

  4. 4.

    Even if the adaptive grid refinement and the multi-grid resolution methods cannot be directly compared, they perfectly fit the MF method philosophy and formulation. They represent a good example of simulation methods that naturally produce results spanning a range of fidelity levels and therefore represent a natural fit for MF methods.

Future work includes the assessment of alternative active learning approaches through: (a) different definitions of the prediction uncertainty in the presence of noise, to address the question of training points clustering [50]; (b) the definition of dynamic weights for the objective function and prediction uncertainty, whose values are based on the overall and remaining budget of function evaluations, to achieve a better balance of the exploration and exploitation of the region of the minimum during the entire optimization process. Finally, the possibility of selecting the number of RBF centers via clustering metrics, such as the within-cluster sums of squares [20] or the silhouette [34], will be addressed as an alternative to the leave-one-out cross-validation metrics.