Inverse design optimization framework via a two-step deep learning approach: application to a wind turbine airfoil

Yang, Sunwoong; Lee, Sanga; Yee, Kwanjung

doi:10.1007/s00366-022-01617-6

Inverse design optimization framework via a two-step deep learning approach: application to a wind turbine airfoil

Original Article
Open access
Published: 04 March 2022

Volume 39, pages 2239–2255, (2023)
Cite this article

Download PDF

You have full access to this open access article

Engineering with Computers Aims and scope Submit manuscript

Inverse design optimization framework via a two-step deep learning approach: application to a wind turbine airfoil

Download PDF

4685 Accesses
23 Citations
2 Altmetric
Explore all metrics

Abstract

The inverse approach is computationally efficient in aerodynamic design as the desired target performance distribution is prespecified. However, it has some significant limitations that prevent it from achieving full efficiency. First, the iterative procedure should be repeated whenever the specified target distribution changes. Target distribution optimization can be performed to clarify the ambiguity in specifying this distribution, but several additional problems arise in this process such as loss of the representation capacity due to parameterization of the distribution, excessive constraints for a realistic distribution, inaccuracy of quantities of interest due to theoretical/empirical predictions, and the impossibility of explicitly imposing geometric constraints. To deal with these issues, a novel inverse design optimization framework with a two-step deep learning approach is proposed. A variational autoencoder and multi-layer perceptron are used to generate a realistic target distribution and predict the quantities of interest and shape parameters from the generated distribution, respectively. Then, target distribution optimization is performed as the inverse design optimization. The proposed framework applies active learning and transfer learning techniques to improve accuracy and efficiency. Finally, the framework is validated through aerodynamic shape optimizations of the wind turbine airfoil. Their results show that this framework is accurate, efficient, and flexible to be applied to other inverse design engineering applications.

Neural network design for data-driven prediction of target geometry for an aerodynamic inverse design algorithm

Article 22 June 2024

Rapid Airfoil Inverse Design Method with a Deep Neural Network and Hyperparameter Selection

Article 05 September 2022

Data-driven design exploration method using conditional variational autoencoder for airfoil design

Article 07 July 2021

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Recent advances in high-performance computing have enabled aerodynamic engineers to use high-fidelity analyses, offering a wide range of options in the aerodynamic design process. Accordingly, numerous novel design methodologies have been developed, most of which are based on two conventional aerodynamic design methods: inverse and direct design approaches [1,2,3]. In particular, inverse design is computationally efficient in that the desired target performance distribution is already defined and the corresponding design shape can be calculated with a few iterations coupled with a flow solver [4,5,6].

However, the inverse method also has a critical disadvantage: whenever the target distribution changes, an iterative process to find the design shape matching the target distribution should be repeated. Considering that most design stages require significant trial and error, this process undermines the efficiency of the inverse design. Therefore, several researchers have used a surrogate model in inverse design to avoid this iterative process. In particular, artificial neural network (ANN) surrogate models have been widely used owing to their universal approximation property [7]. Kharal and Saleem [8] and Sun et al. [9] used aerodynamic quantities of interest (QoI) as the inputs of an ANN model to obtain airfoil shape parameters as the output in an inverse design procedure. Wang et al. [10] also applied an ANN for the same purpose, but additionally performed dimensional reduction of input data to reduce the database size required for model training.

Though these studies do not require iterative procedures coupled with the flow solver, they still require the predefined performance distribution. For an efficient inverse design, an appropriate aerodynamic performance should be defined, which is highly dependent on the designer’s engineering knowledge and experience. This ambiguity in specifying the target distribution has inspired researchers to optimize it. Obayashi and Takanashi [11] and Kim and Rho [12] used control points-based techniques to parameterize the pressure distributions, and then optimized the distribution using a genetic algorithm (GA). In these inverse design optimization processes, the aerodynamic QoI of the distribution were obtained through theoretical/empirical predictions, and a number of constraints were imposed to ensure the reality of the distribution. For instance, Obayashi and Takanashi [11] estimated the viscous drag using the Squire–Young empirical formula and imposed six constraints for realistic pressure coefficient (C_p) distributions. Though these studies attempted to deal with the fundamental limitation of the inverse design method, the following problems still exist: (1) loss of the diverse representation capacity of the C_p distribution due to parameterization; (2) excessive constraints to ensure a realistic C_p distribution; (3) discrepancies between the QoI predicted theoretically/empirically and those calculated using a flow solver; and (4) impossibility of explicitly imposing geometric constraints on the design shape.

To address the limitations of the representation capacity, QoI discrepancies, and excessive constraints, Zhu et al. [13] reduced the dimension of the C_p distribution data via proper orthogonal decomposition (POD) and used the support vector regression (SVR) model to predict the aerodynamic performance of the airfoil from the reduced C_p data. Then, a GA was implemented to optimize the C_p distribution coupled with POD and SVR. Finally, the airfoil shape corresponding to the optimized pressure distribution was obtained in an iterative manner coupled with the flow solver. However, the limitation of geometric constraints has not been addressed, since the prediction of the design shape is separated from the pressure optimization process. Therefore, when the shape predicted in the inverse design process violates geometric constraints, the design process should be traced back to the optimization process. Additionally, at the optimum solution, the discrepancy between the pressure distributions from prediction and calculation is noticeable, which indicates the low accuracy of its framework.

The drawbacks of the previous inverse design studies presented so far should be overcome by applying novel techniques in that computational efficiency, an essential advantage of inverse design, cannot be fully exploited. Therefore, this study proposes an inverse design optimization framework with a two-step deep learning approach. This approach refers to the sequential coupling of two deep learning models: variational autoencoder (VAE) [14] and multi-layer perceptron (MLP) [15]. The VAE and MLP were used to generate a realistic target distribution and to predict the QoI and shape parameters from the generated distribution, respectively. Then, target distribution optimization was performed as the inverse design optimization based on this approach. Active learning and transfer learning strategies were applied to improve the accuracy of the two-step approach-based optimization with efficient computational cost. The proposed inverse design optimization framework via a two-step deep learning approach was validated through aerodynamic shape optimization problems of the airfoil of a wind turbine blade, where inverse design is actively being applied.

This paper is organized as follows. Section 2 describes the mathematical background of two deep learning models used in the inverse design optimization framework. Section 3 presents the scheme of the proposed framework. In Sect. 4, validation of the framework with application to a wind turbine airfoil is performed and the results are discussed. Finally, Sect. 5 concludes the study, emphasizing the flexibility of the presented framework.

2 Methodology

2.1 Multi-layer perceptron (MLP)

ANNs have been widely used in recent engineering design studies in that they can successfully learn complex nonlinear correlations between inputs and outputs [16, 17]. Furthermore, they are known for their efficiency in training high-dimensional problems with massive amounts of data. Among several ANN models, the MLP is the most common structure as it can learn various nonlinear physical phenomena using hidden layers; according to the universal approximation theorem, a neural network with a single hidden layer and finite nodes can approximate any continuous function [7]. In the inverse design optimization framework investigated in this study, an MLP model with these advantages was used as a regression surrogate model. This section presents a brief background of this model.

The MLP model passes on information received from the input layer to the output layer through the feed-forward process. Information transfer between each layer is performed through an affine transformation using weights and biases, as given in Eq. (1) (where $x$ is a vector of nodes at the current layer, $W$ and $b$ are the weight matrix and bias vector between the current layer and the next layer, respectively and $y$ is a vector of the nodes at the next layer). As it is based on a linear relationship, nonlinearity cannot be modeled no matter how many hidden layers are used.

$$y=Wx+b.$$

(1)

Modeling of nonlinearity is enabled by imposing activation functions on each layer. Typical examples of the functions are sigmoid, tanh, ReLU, and Leaky ReLU. The ReLU function has been used in numerous studies [18,19,20], because it has less saturation problems than sigmoid and tanh, and it does not perform expensive exponential operations. However, the ReLU function also has the disadvantage of being non-zero centered; therefore, the Leaky ReLU function is implemented as follows [21] ($a$ is for allowing a small, non-zero gradient and its value is set as 0.01 in this study):

$$f\left(x\right)=\left\{\boldsymbol{ }\begin{array}{c}x\,if\,x>0\\ ax\,otherwise\end{array}\right..$$

(2)

The MLP model takes an activation function on $y$ in Eq. (1) and uses it as an input for the next layer to learn nonlinear behavior. This process is repeated until the data of the input layer reach the output layer. However, the final output will be quite different from the value of the training data as there is no training procedure for weights and biases in feed-forward. Accordingly, the weights and biases are trained based on the gradient of the loss function using backpropagation. They are updated through this process reducing the loss function of the model. Feed-forward (which transfers information from input to output) and backpropagation (which updates the model parameters) are repeated iteratively using gradient descent optimization techniques. Several optimization methods can be used, such as Adagrad [22], RMSprop [23], and Adam [24]. Adam has been widely used as it combines the strengths of Adagrad and RMSprop; its advantages are the ability to deal with sparse gradients and non-stationary objectives [24]. When the loss function decreases to the desired level, the optimization process ends and the final MLP model with fixed weights and biases serves as a surrogate model (prediction can be performed almost in real-time). Many papers adopting the MLP model have been published in various engineering fields; therefore, only the fundamental contents are presented in this paper. For more details on MLP, refer to Ref. [15].

2.2 Variational autoencoder (VAE)

A generative model learns how it is generated from real-world data [25]. In particular, a deep generative model uses neural networks with hidden layers to learn the underlying features of arbitrary data. Generative adversarial network (GAN) [26] and VAE [14] are typical examples of widely used deep generative models. Both models can be regarded as a dimensional reduction technique that learns a compact low-dimensional latent space from high-dimensional original data. Thus, the trained model can be used as a data generator in that it generates high-dimensional output data when it receives a low-dimensional latent variable. However, GAN has the disadvantage that it is difficult to generate continuous physical data such as pressure distribution and airfoil shape; therefore, auxiliary filters or layers should be used. For example, Chen et al. [27, 28] added a Bezier layer or a free-form-deformation layer at the end of the generator to enforce the continuity of the airfoil generated by GAN. In addition, Achour et al. [29] used the Savitzky–Golay filter on the generated airfoil for the same reason. However, when user-defined layers or filters are added to the neural network structure, additional hyperparameters are generated, thereby increasing the engineer’s effort to train the model. Furthermore, GAN is known to be quite difficult to train owing to non-convergence, mode collapse, and vanishing gradients [30]. Therefore, in this study, a VAE was used as the data generator for the inverse design optimization. It is easy to train and does not require an extra post-processing technique to ensure continuity of the generated data. It will be shown later that the VAE used for this purpose was trained to generate smooth target distribution data for the inverse design.

The mathematical details of the VAE model are presented below. Note that this section is mainly based on Ref. [14].

Consider a dataset $X ={\{{x}_{i}\}}_{i=1}^{N}$, which consists of independent and identically distributed N samples of variable $x$, and the unknown random variable $z$ is for generating $x$. When generating $x$ from $z$, variational inference is applied as the posterior ${p}_{\theta }(z|x)$ is intractable due to the intractability of the likelihood ${p}_{\theta }\left(z\right)$, where $\theta $ is a generative model parameter. Here, the approximation function ${q}_{\phi }(z|x)$ is used for the sampling instead of the intractable posterior ${p}_{\theta }\left(z|x\right)$, where $\phi $ is a variational parameter. Using this approximation, the marginal likelihood ${p}_{\theta }(x)$ can be expressed as

$$\begin{aligned}\text{log}\left({p}_{\theta }\left(x\right)\right)& = \int \text{log}\left({p}_{\theta }\left(x\right)\right){q}_{\phi }\left(z|x\right)dz \left(\because \int {q}_{\phi }\left(z|x\right)dz=1\right) \\ & = \int \text{log}\left(\frac{{p}_{\theta }\left(x,z\right)}{{p}_{\theta }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz \left(\because {p}_{\theta }\left(x\right)=\frac{{p}_{\theta }\left(x,z\right)}{{p}_{\theta }\left(z|x\right)}\right)\\ &= \int \text{log}\left(\frac{{p}_{\theta }\left(x,z\right)}{{q}_{\phi }\left(z|x\right)}\cdot \frac{{q}_{\phi }\left(z|x\right)}{{p}_{\theta }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz \\ &= \int \text{log}\left(\frac{{p}_{\theta }(x,z)}{{q}_{\phi }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz+\int \text{log}\left(\frac{{q}_{\phi }\left(z|x\right)}{{p}_{\theta }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz.\end{aligned}$$

(3)

The second term in the last equation is the Kullback–Leibler (KL) divergence of ${q}_{\phi }\left(z|x\right)$ from ${p}_{\theta }\left(z|x\right)$, $\text{KL}({q}_{\phi }\left(z|x\right)||{p}_{\theta }\left(z|x\right))$, and it is impossible to calculate this term. Furthermore, because this term is always non-negative by its definition, the first term in the last equation becomes the lower bound on the marginal likelihood. Therefore, the maximization of the marginal likelihood can be replaced by the problem of maximizing the lower bound according to the evidence lower bound. The corresponding lower bound term can be divided as follows:

$$\begin{aligned}\int \text{log}\left(\frac{{p}_{\theta }(x,z)}{{q}_{\phi }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz & = \int \text{log}\left(\frac{{p}_{\theta }(x|z){p}_{\theta }(z)}{{q}_{\phi }\left(z|x\right)}\right){q}_{\phi }\left(z|x\right)dz \left(\because {p}_{\theta }\left(x,z\right)={p}_{\theta }\left(x|z\right){p}_{\theta }\left(z\right)\right)\\ &= \int \text{log}\left({p}_{\theta }\left(x|z\right)\right){q}_{\phi }\left(z|x\right)dz-\int \text{log}\left(\frac{{q}_{\phi }\left(z|x\right)}{{p}_{\theta }\left(z\right)}\right){q}_{\phi }\left(z|x\right)dz\\&= {\mathbb{E}}_{{q}_{\phi }(z|x)}\left[\text{log}\left({p}_{\theta }\left(x|z\right)\right)\right]-\int \text{log}\left(\frac{{q}_{\phi }\left(z|x\right)}{{p}_{\theta }\left(z\right)}\right){q}_{\phi }\left(z|x\right)dz.\end{aligned}$$

(4)

Therefore, maximization of the marginal likelihood ${p}_{\theta }(x)$ means maximizing the last equation of Eq. (4), where the first term is the reconstruction error of x through the posterior distribution (approximation function) ${q}_{\phi }\left(z|x\right)$ and the likelihood ${p}_{\theta }(x|z)$. The second term is the KL divergence of ${q}_{\phi }\left(z|x\right)$ from the prior ${p}_{\theta }\left(z\right)$, $\text{KL}({q}_{\phi }\left(z|x\right)||{p}_{\theta }(z))$. In summary, the VAE model can be trained using the loss function consisting of the reconstruction error of data $x$ and the KL divergence of the approximation function from the prior.

In the derived loss function above, the approximation function ${q}_{\phi }\left(z|x\right)$ represents the dimensionality reduction procedure using an encoder (from original data to latent variable) and the likelihood ${p}_{\theta }(x|z)$ represents the dimensionality reconstruction procedure using a decoder (from latent variable to original data). To represent the intractability of these procedures, neural networks with hidden layers (MLP) are used to realize the encoder and decoder of the VAE. This explains why a VAE has a similar structure to a standard autoencoder (AE) [31]. However, the main difference is the latent space variable $z$. In an AE, the output of the encoder is used as a latent variable deterministically. On the other hand, the encoder of the VAE outputs a set of distribution parameters consisting of the mean (${\mu }_{i}$) and standard deviation (${\sigma }_{i}$) of the Gaussian distribution, and the latent space is sampled with randomness; to sample a $J$-dimensional latent variable $z$, the same number of sets $\{{\mu }_{i}, {\sigma }_{i}\}$ is required. However, because the gradients cannot be calculated in this random sampling process, model training through backpropagation becomes impossible; therefore, a reparameterization trick is applied (refer to Ref. [32] for more information about this trick). The latent variable $z$, which is assumed to take the form of the Gaussian distribution, can be expressed as follows:

$${z}_{i} = {\mu }_{i}+ {\sigma }_{i} \odot \epsilon \text{ and }\epsilon \sim N\left(\text{0,1}\right).$$

(5)

As presented earlier, the loss function for training the VAE model consists of a reconstruction error and the KL divergence of the approximation function from the prior. The reconstruction error can be calculated using the binary cross-entropy between the original data and reconstructed data from the VAE (reconstructed data refers to data after the original data goes through the encoder and decoder of the VAE). Then, to calculate the $KL({q}_{\phi }\left(z|x\right)||{p}_{\theta }(z))$ term, ${q}_{\phi }\left(z|{x}_{i}\right) \sim N({\mu }_{i},{\sigma }_{i}^{2})$ and ${p}_{\theta }(z) \sim N(\text{0,1})$ are assumed. Finally, the KL divergence in the loss function for the datapoint ${x}_{i}$ can be expressed as follows, where $J$ is the dimensionality of the latent variable $z$:

$$\mathcal{L}\left(\theta ,\phi ;{x}_{i}\right)= \frac{1}{2}\sum_{j=1}^{J}(1+\text{log}\left({({\sigma }_{i}^{j})}^{2}\right)-{\left({\mu }_{i}^{j}\right)}^{2}-{\left({\sigma }_{i}^{j}\right)}^{2}).$$

(6)

Backpropagation with respect to the loss function (composed of reconstruction error and KL divergence) enables the decoder of the VAE to be trained to generate realistic data. The flowchart of the VAE is shown in Fig. 1.

3 Inverse design optimization framework

3.1 Two-step deep learning approach

This section demonstrates the two-step deep learning approach, which connects the VAE and MLP models. First, the VAE was trained with the target performance distributions of the training data. When training has been completed, only the decoder part of the VAE model was used; it operates as a data generator that receives a low-dimensional latent variable and outputs a realistic high-dimensional target distribution. Then, the MLP was trained to predict the QoI and shape parameters from the target distribution. This regression process is more efficient and accurate compared with previous inverse design studies in that there is no iteration and theoretical/empirical assumptions (prediction was performed almost in real-time). In this study, these two deep learning models, the decoder of the VAE and MLP, were utilized sequentially; the decoder generates the distribution once it receives the latent variable, and the MLP outputs the QoI and shape parameters from this generated distribution. This structure refers to a two-step deep learning approach and its flowchart is shown in Fig. 2.

3.2 Target distribution optimization

The two-step deep learning approach allows mapping from the latent space of the target performance distribution to the QoI and shape parameters. Therefore, the optimization of the target distribution can be performed based on this approach. The inputs of the corresponding approach (latent variables) were used as the optimization variable, and the outputs (QoI and shape parameters) as the objective functions and constraints. In this study, because the shape parameters were incorporated into the target distribution optimization procedure, geometric constraints can be imposed directly as shape parameters. At the end of the optimization, numerical validation was performed at the optimum solutions by comparing the QoI predicted using the two-step approach and QoI calculated using the numerical solver. The inverse design optimization framework terminates when the differences in these values satisfy the error criterion. If not, the process described in Sect. 3.3 is repeated, until it is satisfied.

3.3 Active learning and transfer learning

Since the VAE and MLP models are trained with initial training data, the optimization based on them is unlikely to meet the desired accuracy at once. Therefore, surrogate-based optimization studies usually add training data repeatedly to increase the accuracy of the surrogate model. This technique is called the active learning strategy and was adopted in this study for an accurate inverse design optimization framework [33, 34]. When the error criterion at optimum solutions is not satisfied (as presented at the end of Sect. 3.2), these solutions were added to the previous dataset and the deep learning models were trained again. In the training procedure based on data splitting, which splits the full dataset into training and test data, the designs newly added at every iteration in the active learning process should be incorporated in the training data to maximize the efficiency of its process. This is because to reflect the newly added designs directly in the model training, they should be included in the training dataset, not the test dataset (test data is not used directly in model training). Then, optimization and numerical validation were performed based on these retrained models. This active learning strategy continues until the error between the QoI from the two-step deep learning approach and those from the numerical solver decreases so that they satisfy a predefined error criterion.

Although the active learning process was applied to effectively improve the accuracy of the framework, restarting model training with randomly initialized weights and biases at every iteration can severely degrade the computational efficiency. Moreover, in general, when adding new data via active learning, the model does not change significantly compared with that of the previous iteration, as only a small amount of data is added to the existing data. Therefore, this study used a parameter-based transfer learning strategy. This technique ensures that the weights and biases of the previously trained models are transferred to the models to be newly trained [35, 36]. Combining these two strategies, iterative active learning for model accuracy can be efficiently performed through transfer learning. The flowchart of the inverse design optimization framework, which summarizes the contents of Sects. 3.1–3.3, is shown in Fig. 3.

4 Framework validation: optimization of the airfoil for wind turbine blade

The proposed inverse design optimization framework can be applied to all inverse design problems in any engineering field. Specifically, in aerospace engineering, inverse design is actively being applied to wind turbine design [37,38,39]; therefore, to verify the accuracy, effectiveness, and robustness of this framework, airfoil optimization of a megawatt-class wind turbine was chosen as the application. For the airfoil design of a wind turbine blade, structural and aerodynamic performances are mainly considered. In particular, the airfoil at the blade tip is known to be critical for the aerodynamic performance of the blade. This study aims to optimize the airfoil at the tip of the blade, mostly taking into account its aerodynamic properties (structural performance is indirectly considered by the airfoil area). Single-objective and multi-objective optimizations were performed to demonstrate the versatility of the proposed framework in various optimization problems. The following sections describe the optimization problems (Sect. 4.1), architectures of the two-step deep learning models used in the inverse design optimization (Sect. 4.2), and the results and discussion of the single-objective and multi-objective optimizations (Sects. 4.3 and 4.4).

4.1 Optimization of the airfoil of a wind turbine blade

This section presents the optimization problems of the airfoil of a wind turbine blade tip region. Section 4.1.1 presents the validation of the numerical flow solver used in this study, and Sect. 4.1.2 presents the problem definitions for the optimization.

4.1.1 Flow solver

Xfoil is a two-dimensional panel code capable of viscous/inviscid analysis, and it derives accurate results in a very short time when used appropriately [40]. Because a wind turbine operates at relatively low Reynolds numbers, numerous wind turbine airfoil design studies have used this solver [41,42,43]. In this study, Xfoil was adopted to calculate the aerodynamic QoI with reasonable computational cost. It should be noted that the proposed inverse design optimization framework can be coupled with any numerical solver with arbitrary QoI.

Although Xfoil is a well-known and widely used solver, we performed solver validation using experimental results [44]. The experimental data are based on the GA(W)-1 airfoil with Reynolds of 6.3 × 10⁶, Mach of 0.15, and angle of attack of 8.02° (the flow conditions of the validation are intended to be similar to those of subsequent airfoil optimizations). It was confirmed that this flow solver is appropriate for the wind turbine airfoil optimization performed in this study as it predicts a pressure distribution almost identical to that from experiments (Fig. 4).

4.1.2 Optimization problem definitions

Before performing the optimization, the optimization problems should be defined first. There are numerous parameterization methods for representing the airfoil shape, such as PARSEC, B-spline, and class-shape transformation (CST) [45,46,47]. In this study, PARSEC parameters were adopted as they were originally introduced owing to their close relationship with the aerodynamic characteristics [48]. There are 11 PARSEC variables; six of them were used as shown in Fig. 5, namely R_L.E. (leading-edge radius), X_up (x-coordinate of the upper crest), Z_up (z-coordinate of the upper crest), X_low (x-coordinate of the lower crest), Z_low (z-coordinate of the lower crest), and Z_T.E. (z-coordinate of the trailing-edge). These variables were selected from the prior sensitivity test, which demonstrated that they have a significant impact on the flow characteristics, whereas the other five PARSEC variables have little impact. The corresponding multi-dimensional design space to be explored in the optimization process is summarized in Table 1 (the baseline airfoil shape is selected as the median value of each variable’s range).

Table 1 Design space of the six airfoil shape parameters: the baseline airfoil is selected as the median value of each range

Full size table

When the airfoil shape is determined using PARSEC parameters, flow analysis proceeds. Xfoil is executed under predefined settings (including the flight conditions), and the aerodynamic QoI that will be used as objective functions or constraints in the optimization are obtained. The corresponding flight conditions, objective functions, and constraints for single-objective and multi-objective airfoil optimizations are summarized in Table 2. In the single-objective optimization, the objective is to maximize the lift to drag ratio (L/D), which is the most crucial factor for the aerodynamic efficiency. Furthermore, some constraints are considered to exclude undesirable performance [49]: the drag should be less than the baseline value, the pitching moment coefficient at c/4 (c is the chord length) should be greater than the specific value to limit blade torsion, and the area should be at least 90% of the baseline area to prevent serious degradation of the structural performance. Note that the geometric constraint (airfoil area in this case) can be directly imposed in this framework as QoI in the target distribution optimization process (PARSEC parameters can also be set as geometric constraints, such as Z_up < 0.15, but these were realized by limiting the design space in this study). In the multi-objective optimization, the two objectives are to maximize the L/D ratio and airfoil area, considering that the area represents the structural performance. Other constraints are the same as those in the single-objective optimization.

Table 2 Flight conditions, objective functions, and constraints for single-objective and multi-objective optimizations

Full size table

Then, design of experiments was performed to train the MLP and VAE models; the sampled initial design samples were used as the initial training data. Latin hypercube sampling was selected owing to its uniformity in the design space [50]. A total of 500 initial designs were sampled and two deep learning models were trained based on them. Finally, the optimization proceeds using the latent variables of the VAE as the optimization variables, and the QoI and shape parameters as the objective functions and constraints, as shown in Fig. 3. For single-objective optimization, GA was adopted owing to its efficient global exploration in discontinuous and multimodal problems [51]. For multi-objective optimization, the non-dominated sorting genetic algorithm-II (NSGA-II) was adopted to obtain the diversified Pareto solutions of both objective functions [52]. Both optimization algorithms were implemented using Python package pymoo [53]. When the first optimization ends with two-step deep learning models trained using 500 DoE designs, active learning with transfer learning strategy was repeated iteratively. Because there is only one optimal solution in the single-objective optimization, the single optimal solution is selected as the design to be infilled. On the other hand, there are several optimal solutions for the multi-objective optimization (Pareto solutions). In this case, the leftmost, middle, rightmost designs in the Pareto solutions are selected to be infilled to increase the overall accuracy of the Pareto solutions. These criteria for infilling can be determined arbitrarily by the engineer (the number of points to be added for each iteration and their distribution in the Pareto frontier can be determined by the engineer as appropriate). Through these iterative procedures, the framework consisting of the two deep learning models satisfies the given error criterion. Again, note that the proposed framework can be applied to any inverse design problem (the design shape, corresponding shape parameters, QoI, numerical solver, and optimizer can be selected arbitrarily).

4.2 Architectures of the two-step deep learning models

In the two-step approach, the MLP model was trained to take the C_p distribution as input and output QoI and shape parameters. First, all airfoils were discretized to share 199 identical x-coordinates: they were extracted using a two-sided hyperbolic tangent distribution function (where the spacing at the leading-edge and trailing-edge was constrained as 0.001c and 0.005c, respectively) [54] from NACA 0012 airfoil. Then, the pressure coefficients of the corresponding points were used as the input of the MLP. For the output, six shape parameters (R_L.E., X_up, Z_up, X_low, Z_low, and Z_T.E.) and four QoI (L/D, C_d, C_m, and area) were concatenated to form the 10 output nodes. Finally, the MLP has 199 input nodes, 10 output nodes (all the inputs and outputs are normalized), and 2 hidden layers with 100 nodes: the MLP with the corresponding hidden layers was found to have sufficient accuracy for the regression in this problem. Then, LeakyReLU activation functions were applied to all the layers for nonlinearity. Adam was used as the optimizer with the mean square error loss function to train this MLP architecture, and the initial learning rate starts with 0.001. For the first iteration in active learning, 500 initial samples were split into training and test data in the ratio of 8:2 (the same ratio was also used to train the VAE). A total of 30,000 epochs with a mini-batch size of 100 were performed using a scheduler that multiplies the learning rate by 0.8 for every 3000 epochs. Because there were no weight or bias values to be used as a reference in the first iteration, they were initialized using He initialization [55]. Then, active and transfer learning were performed. Interestingly, during the MLP training, it was observed that if the parameters of all layers from the previous model were passed, the training terminates without any meaningful change from the previous model. This is because the number of newly added designs is small compared with that of existing training data, and their effect on the loss function is negligible. Therefore, active learning, which increases the accuracy by appending previous optimum solutions to the current training data, becomes meaningless as their properties are not fully reflected in the current model. In this study, the parameters of the last layer of the MLP were initialized using He initialization, whereas those of other layers were transferred from the previous model (in other words, transfer learning was applied except for the last layer). For subsequent iterations, the required total epochs decreased to 10,000 with the same initial learning rate and scheduler as the first iteration, owing to these transfer learning techniques. As a result, training the MLP in the first iteration using a personal computer (Intel Core i7-8700 3.2 GHz with 16 GB 2400 MHz DDR4 RAM) took 508 s, and subsequent iterations took an average of 186 s per iteration using Python package PyTorch [56]. The fact that subsequent iterations during active learning require a much shorter training time than the first iteration emphasizes the effect of applying the transfer learning technique in this study: by transfer learning technique, active learning can be efficiently performed.

For the VAE model, the 199 C_p data previously inputted into the MLP were used as inputs and outputs (VAE has the same input and output). The 199-dimensional input data were reduced to 4-dimensions using an encoder with hidden layers of 120, 60, and 30 nodes. Herein, the four-dimensions represent distribution parameters for random sampling in the latent space: two for the mean and the other two for the standard deviation of the Gaussian distribution. From these parameters, random sampling was performed and the dimension was reduced to a two-dimensional latent space. In this random sampling process, to sample a $J$-dimensional latent variable $z$, the same number of $\{\mu ,\sigma \}$ sets is required as described in Sect. 2.2, and $J$ equals 2 in this study. These 2-dimensions were again reconstructed to 199-dimensions using a decoder with hidden layers of 30, 60, and 120 nodes. The corresponding architecture of the VAE is shown visually in Fig. 6. As in the MLP, the LeakyReLU activation function and Adam optimizer with mean square error loss function were used. The initial learning rate starts with 0.001, and a total of 30,000 epochs were performed with a mini-batch size of 100 and a scheduler multiplying the learning rate by 0.5 for every 5000 epochs. In contrast to the MLP, the situation in which the learning process terminates without reflecting information of newly added designs is hardly observed in the VAE. Therefore, the parameters of all the layers from the previous iteration were transferred to the next iteration (in other words, transfer learning was applied to all layers). For subsequent iterations, a total of 10,000 epochs were performed with the same initial learning rate and scheduler as the first iteration. As a result, training the VAE in the first iteration took 627 s, and subsequent iterations took an average of 226 s per iteration (again, the efficiency of transfer learning can be verified). All the hyperparameters of the MLP and VAE mentioned in this section were applied identically in the single-objective and multi-objective optimizations.

4.3 Single-objective optimization results and discussion

Active learning of the single-objective optimization satisfies the error criterion (the error of the objective function at optimum solutions should be lower than 1%) after the 24 infilling iterations. The convergence history of optimization with active learning can be observed in Fig. 7. The objective function starts at approximately 65 and increases gradually, reaching a value of approximately 72 after 24 iterations. After that, no better optimal point was found. Subsequent analysis of the single-objective optimization results is based on the trained VAE and MLP at iteration 24 (total learning time for 24 iterations is (508 + 627) + (186 + 226) * 24 = 11,023 s).

The final optimal airfoil shape is shown in Fig. 8, and its QoI (objective function and constraints) values are summarized in Table 3. Its objective function (L/D) has a value of 72.22, which increased by 39% compared with the baseline. However, this value is just a prediction from the MLP model, and it is not certain whether the L/D calculated by Xfoil will have this value. Therefore, the QoI from Xfoil were compared with the predicted values from the MLP model. It was confirmed that the four QoI values have an error (between predicted and calculated) of less than 1%, and all the imposed constraints were satisfied. Note that the airfoil area satisfies the constraint imposed with little margin (0.6%), whereas other constraints (drag and pitching moment) were satisfied with some margin. Additionally, numerical validation was performed to verify whether the optimal airfoil has the C_p distribution generated by the VAE model. Figure 9 shows that the C_p generated by the VAE and that calculated using Xfoil are almost indistinguishable. These results validate the accuracy of the MLP model in the two-step approach for single-objective optimization.

Table 3 Summary of the QoI of the optimum solution

Full size table

Then, validation of the VAE model was performed. The VAE model reduces 199-dimensional C_p distribution data to a 2-dimensional latent space and reconstructs it to 199-dimensional data. The trained decoder in this study was used as a data generator that receives a 2-dimensional latent variable and creates 199-dimensional C_p distribution data. However, if the generated distribution cannot represent the overall training data or has completely different characteristics from them, the optimization results from this generator becomes inaccurate and inefficient. Therefore, the generated C_p distributions by the trained VAE decoder were analyzed. Figure 10 shows 50 randomly selected C_p distributions from the 500 initial training C_p data (black lines, Fig. 10a) and 50 randomly generated C_p distributions by the decoder (red lines, Fig. 10b). Herein, the following points are confirmed. First, the data generated using the decoder covers the range of the training data well. Moreover, although no other technique was applied to smoothen the C_p distribution during the VAE training, the decoder successfully generated smooth distributions indistinguishable from the training data. This supports the reason for adopting a VAE in this framework instead of a GAN; the VAE generates sufficiently realistic data (continuous C_p in this case) without adopting auxiliary layers or filters to ensure the reality (continuity in this case) of the data. Second, from the generated C_p distributions, we can also identify their dominant features. As the black dashed box indicates, significant shape differences are observed near the suction peak (near the leading-edge of the airfoil’s upper surface), whereas most differences in the other regions are just slight shifts in the C_p values. From the fact that these dominant features near the suction peak are also observed in the training data (Fig. 10a), it can be concluded that the VAE model successfully learned the characteristics of the training data. In summary, since we have confirmed that the data generated by the VAE can be well representative of the training data, the trained VAE model will perform successfully as a data generator in this framework.

4.4 Multi-objective optimization results and discussion

In Sect. 4.3, it was confirmed that the proposed inverse design optimization framework yields outstanding results when single-objective optimization is performed. In this section, results of multi-objective optimization are presented to ensure the universality of the framework in various optimization problems. In the previous single-objective problem, which uses L/D as an objective function and airfoil area as a constraint, the area of the optimum solution barely satisfied the imposed constraint with little margin (0.6%). This indicates that a potential increase in the objective function L/D is suppressed by the area constraint. Therefore, in the multi-objective optimization, these two QoI were set as objective functions to consider their trade-off relationships (aerodynamic and structural performance). Refer to Table 2 for the problem definition of multi-objective optimization.

The active learning process of the multi-objective optimization converged after 59 iterations. Accordingly, 677 data consisting of initial 500 points and 177 infilled points were used (the total learning time for 59 iterations is (508 + 627) + (186 + 226) * 59 = 25,443 s). The resultant Pareto frontier of the two objective functions is shown in Fig. 11. We observed a discontinuity in the middle of the calculated Pareto solutions, and it is concluded that this is due to C_d constraint violation (the Pareto frontier from the optimization without the C_d constraint is smoothly connected: this result is shown in “Appendix”). Among the Pareto solutions, six designs (A1–A3 and B1–B3) were selected for further analysis, and their airfoil shapes are shown in Fig. 12. Herein, it can be seen that along with the Pareto frontier, airfoil shapes of selected six designs change sequentially: as the performance criterion changes from area to L/D (from A1 to B3), the airfoil thickness gradually decreases. Xfoil was performed on these six designs to estimate the error between the QoI from framework prediction and solver calculation, as in single-objective optimization. The corresponding results are summarized in Table 4: the errors between the QoI predicted using the framework and those obtained using Xfoil are within a reasonable range. In particular, A2 and B1 have quite large errors, because there are few points added nearby in the active learning process (the percentage errors of C_m are also large, but this is due to their scale). The accuracy near these points can be increased by adding points close to them (it is up to the engineer where to infill points in the Pareto solutions). The accuracy of the trained MLP model was evaluated by verifying that the six selected designs actually have C_p distributions generated by the VAE. Figure 13 shows that the MLP is accurate enough in that the C_p calculated using Xfoil and that generated using the VAE are almost indistinguishable, as in single-objective optimization.

Table 4 Summary of the QoI of six selected Pareto solutions

Full size table

In this framework, the latent variable goes through the VAE decoder and MLP sequentially to predict the QoI values. Based on this two-step approach, an optimization technique is applied to obtain a latent variable that maximizes/minimizes the QoI. Therefore, for efficient optimization through this two-step deep learning approach, two-step deep learning models should learn the precise correlation between the two spaces (the latent space and QoI space). However, in real engineering problems, this can be difficult due to abrupt changes of physical conditions such as shock waves. When the mapping is inaccurate, the efficiency of the optimization technique based on their mapping will be significantly undermined. In this context, this study investigated the mapping between the latent space and QoI through heatmaps to verify how they are correlated, as shown in Fig. 14: heatmap of L/D and area is shown in Fig. 14a and b, respectively. In this figure, it can be observed that the latent space is mapped continuously to both objective functions, indicating how optimization in this framework could be performed successfully. On the other hand, sharp changes in both objectives are detected when z₂ has a value of approximately − 0.55 (as indicated by the yellow squares). To investigate the changes occurring in the flowfield across this boundary, a total of 12 points were extracted nearby, as shown at the top of Fig. 14. The nomenclatures of these 12 points are summarized in Table 5.

Table 5 Nomenclatures of 12 points extracted to investigate the sharp changes in the QoI heatmaps

Full size table

To find the reason for the sharp change in the QoI at z₂ ≈ − 0.55, the C_p distributions generated through VAE from 12 selected points’ latent variables are shown in Fig. 15 (in this figure, the horizontal and vertical axes of the plots represent x/c and C_p, respectively, and they are scaled to the same range for the comparison). In this figure, the local leading-edge suction peaks on the lower curve are not observed in the C_p plots of z₂ > − 0.55, whereas they are observed in the plots of z₂ < − 0.55. Therefore, it can be inferred that z₂ ≈ − 0.55 is the boundary between the presence and absence of the leading-edge suction peak. In this regard, it is known that the sharper the leading-edge of the airfoil, the larger the suction peak behind the leading-edge (as the leading-edge radius decreases, the angle at which the flow should bend increases, thereby increasing the suction to attach the flow to the airfoil surface [57]). Therefore, the trends in the leading-edge radius (R_L.E.) are depicted in Fig. 16 to verify whether the rapid changes in C_p are accompanied by the changes in R_L.E.. Indeed, as z₂ decreases (as the number corresponding to the second character of the nomenclature decreases), the leading-edge radius decreases, which is consistent with the prior knowledge. Moreover, in all cases of L, M, and R, there are noticeable gaps between 2 and 3. Considering that nomenclatures 1, 2, 3, and 4 are equally spaced in the z₂-direction, it can be concluded that a rapid change in the leading-edge radius at the boundary between 2 and 3 (z₂ ≈ − 0.55) leads to a sudden change in the trend of the leading-edge suction. Additionally, the six points selected from the Pareto solutions in Fig. 11 were scrutinized in a similar way. In Fig. 13, the leading-edge suction peaks are not found in the C_p distributions of A1–A3, but are found in B1–B3. In fact, when the latent variables of these designs are visualized as shown in Fig. 14b, the two groups (A1–A3 and B1–B3) are separated by a boundary of z₂ ≈ − 0.55. In summary, by analyzing the heatmaps, mapping between the latent space and QoI using the two-step approach is verified to be generally continuous. Additionally, this mapping is verified to accurately reflect the rapid changes in the QoI, which occur frequently in real-world engineering applications. This flexibility of the two-step deep learning approach enables the optimization to be performed efficiently owing to continuous and accurate mapping between the optimization inputs and outputs, highlighting the capability of the proposed inverse design optimization framework.

5 Conclusions

This study proposed a novel inverse design optimization framework with a two-step deep learning approach, which refers to consecutive coupling of VAE and MLP. Herein, the VAE generates a realistic target distribution and MLP predicts the QoI and shape parameters from the generated distribution. Then, the target distribution was optimized based on this two-step approach. To increase the accuracy, we used active learning to retrain the models with newly added designs. Herein, transfer learning was coupled to reduce the computational cost required for retraining. These techniques increase the accuracy of the framework with efficient computational resources. Finally, the limitations of previous inverse design studies can be eliminated through the proposed framework as follows:

(1)
The conventional inverse design process is substituted with the MLP surrogate model so that the iterations coupled with the flow solver are not required.
(2)
From the target distribution, the MLP surrogate model not only predicts the design shape, but also the QoI. Therefore, theoretical/empirical assumptions are not required for predicting the QoI, and the geometric constraints can be handled directly in the target distribution optimization process.
(3)
For the inverse design optimization, realistic target performance distributions are generated using a VAE deep generative model so that the loss of the diverse representation capacity due to the parameterization of the distribution is mitigated, and excessive constraints for the realistic distribution are not required.

The proposed framework was validated using two constrained optimization problems: single-objective and multi-objective airfoil optimizations of the tip region of a megawatt-class wind turbine blade. In the single-objective optimization, the prediction accuracy of the trained MLP model and the validity of the trained VAE model for generating realistic data were verified. In the multi-objective results, continuous mapping between the inputs and outputs of the framework was verified, which enables successful optimization through the two-step approach. Furthermore, this mapping was confirmed to accurately reflect the rapid changes in the QoI, which occur frequently in real-world engineering applications. In summary, the results of the optimizations show that the proposed inverse design optimization framework via a two-step deep learning approach is accurate, efficient, and flexible enough to be applied to any other inverse design problem.

Considering that this novel framework can be coupled with any numerical solver with arbitrary design shape and QoI, it can be easily applied and extended to various engineering fields. Moreover, the deep learning models in the two-step approach can be replaced by other suitable alternatives: the VAE by any data generator model and the MLP by any other regression model. For instance, the MLP, a widely used but quite simple model, was used in this study as the problem we investigated is not much complex to apply other advanced deep learning models. However, when the problem is complicated, other models such as convolutional neural networks or recurrent neural networks can be used instead of simple MLP model. This flexibility contributes to the versatility of the framework by allowing it to be utilized with any model suitable for the engineering problem to which it applies.

References

Ibrahim AH, Tiwari SN (2004) A variational method in design optimization and sensitivity analysis for aerodynamic applications. Eng Comput 20:88–95. https://doi.org/10.1007/s00366-004-0273-7
Article Google Scholar
Sekar V, Zhang M, Shu C, Khoo BC (2019) Inverse design of airfoil using a deep convolutional neural network. AIAA J 57:993–1003. https://doi.org/10.2514/1.j057894
Article Google Scholar
Renganathan SA, Maulik R, Ahuja J (2021) Enhanced data efficiency using deep neural networks and Gaussian processes for aerodynamic design optimization. Aerosp Sci Technol 111:106522. https://doi.org/10.1016/j.ast.2021.106522
Article Google Scholar
Daneshkhah K, Ghaly W (2007) Aerodynamic inverse design for viscous flow in turbomachinery blading. J Propuls Power 23:814–820. https://doi.org/10.2514/1.27740
Article MATH Google Scholar
Li Z, Zheng X (2017) Review of design optimization methods for turbomachinery aerodynamics. Prog Aerosp Sci 93:1–23. https://doi.org/10.1016/j.paerosci.2017.05.003
Article Google Scholar
Lane K, Marshall D (2010) Inverse airfoil design utilizing CST parameterization. In: 48th AIAA aerospace sciences meeting including the new horizons forum and aerospace exposition. American Institute of Aeronautics and Astronautics. https://doi.org/10.2514/6.2010-1228
Hornik K, Stinchcombe M, White H (1989) Multilayer feedforward networks are universal approximators. Neural Netw 2:359–366. https://doi.org/10.1016/0893-6080(89)90020-8
Article MATH Google Scholar
Kharal A, Saleem A (2012) Neural networks based airfoil generation for a given using Bezier–PARSEC parameterization. Aerosp Sci Technol 23:330–344. https://doi.org/10.1016/j.ast.2011.08.010
Article Google Scholar
Sun G, Sun Y, Wang S (2015) Artificial neural network based inverse design: airfoils and wings. Aerosp Sci Technol 42:415–428. https://doi.org/10.1016/j.ast.2015.01.030
Article Google Scholar
Wang X, Wang S, Tao J, Sun G, Mao J (2018) A PCA–ANN-based inverse design model of stall lift robustness for high-lift device. Aerosp Sci Technol 81:272–283. https://doi.org/10.1016/j.ast.2018.08.019
Article Google Scholar
Obayashi S, Takanashi S (1996) Genetic optimization of target pressure distributions for inverse design methods. AIAA J 34:881–886. https://doi.org/10.2514/3.13163
Article Google Scholar
Kim HJ, Rho OH (1998) Aerodynamic design of transonic wings using the target pressure optimization approach. J Aircr 35:671–677. https://doi.org/10.2514/2.2374
Article Google Scholar
Zhu Y, Ju Y, Zhang C (2020) Proper orthogonal decomposition assisted inverse design optimisation method for the compressor cascade airfoil. Aerosp Sci Technol 105:105955. https://doi.org/10.1016/j.ast.2020.105955
Article Google Scholar
Kingma DP, Welling M (2013) Auto-encoding variational bayes. arXiv Preprint. ArXiv:1312.6114
Goodfellow I, Bengio Y, Courville A (2016) Deep learning. MIT Press, Cambridge
MATH Google Scholar
Jahirul MI, Rasul MG, Brown RJ, Senadeera W, Hosen MA, Haque R, Saha SC, Mahlia TMI (2021) Investigation of correlation between chemical composition and properties of biodiesel using principal component analysis (PCA) and artificial neural network (ANN). Renew Energy 168:632–646. https://doi.org/10.1016/j.renene.2020.12.078
Article Google Scholar
Meng X, Karniadakis GE (2020) A composite neural network that learns from multi-fidelity data: application to function approximation and inverse PDE problems. J Comput Phys 401:109020. https://doi.org/10.1016/j.jcp.2019.109020
Article MathSciNet MATH Google Scholar
Chau NL, Tran NT, Dao TP (2021) A hybrid approach of density-based topology, multilayer perceptron, and water cycle-moth flame algorithm for multi-stage optimal design of a flexure mechanism. Eng Comput. https://doi.org/10.1007/s00366-021-01417-4
Article Google Scholar
Tao F, Liu X, Du H, Yu W (2020) Physics-informed artificial neural network approach for axial compression buckling analysis of thin-walled cylinder. AIAA J 58:2737–2747. https://doi.org/10.2514/1.j058765
Article Google Scholar
Kong C, Chang J, Li Y, Li N (2020) Flowfield reconstruction and shock train leading edge detection in scramjet isolators. AIAA J 58:4068–4080. https://doi.org/10.2514/1.j059302
Article Google Scholar
Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th International Conference on Machine Learning (ICML)
Duchi J, Hazan E, Singer Y (2011) Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res 12:2121–2159
MathSciNet MATH Google Scholar
Tieleman T, Hinton G (2012) Lecture 6.5—RMSProp: divide the gradient by a running average of its recent magnitude. Coursera 4:26–31
Google Scholar
Kingma DP, Ba JL (2014) Adam: a method for stochastic optimization. arXiv Preprint. ArXiv:1412.6980
Kingma DP, Welling M (2019) An introduction to variational autoencoders. Found Trends® Mach Learn 12:307–392. https://doi.org/10.1561/2200000056
Goodfellow I, Pouget-Abadie J, Mirza M, Xu B, Warde-Farley D, Ozair S, Courville A, Bengio Y (2014) Generative adversarial networks. arXiv Preprint. ArXiv:1406.2661
Chen W, Chiu K, Fuge MD (2020) Airfoil design parameterization and optimization using bézier generative adversarial networks. AIAA J 58:4723–4735. https://doi.org/10.2514/1.j059317
Article Google Scholar
Chen W, Ramamurthy A (2021) Deep generative model for efficient 3D airfoil parameterization and generation. In: AIAA scitech 2021 forum. American Institute of Aeronautics and Astronautics. https://doi.org/10.2514/6.2021-1690
Achour G, Sung WJ, Pinon-Fischer OJ, Mavris DN (2020) Development of a conditional generative adversarial network for airfoil shape optimization. In: AIAA scitech 2020 forum. American Institute of Aeronautics and Astronautics. https://doi.org/10.2514/6.2020-2261
Wiatrak M, Albrecht SV, Nystrom A (2020) Stabilizing generative adversarial networks: a survey. arXiv Preprint. ArXiv:1910.00927
Kramer MA (1991) Nonlinear principal component analysis using autoassociative neural networks. AIChE J 37:233–243. https://doi.org/10.1002/aic.690370209
Article Google Scholar
Kingma DP, Salimans T, Welling M (2015) Variational dropout and the local reparameterization trick. arXiv Preprint. ArXiv:1506.02557
Settles B (2012) Active learning. Synth Lect Artif Intell Mach Learn 6:1–114. https://doi.org/10.2200/s00429ed1v01y201207aim018
Article MathSciNet MATH Google Scholar
Yang X, Cheng X, Liu Z, Wang T (2021) A novel active learning method for profust reliability analysis based on the Kriging model. Eng Comput. https://doi.org/10.1007/s00366-021-01447-y
Article Google Scholar
Pan SJ, Yang Q (2010) A survey on transfer learning. IEEE Trans Knowl Data Eng 22:1345–1359. https://doi.org/10.1109/tkde.2009.191
Article Google Scholar
Li Y, Jiang W, Zhang G, Shu L (2021) Wind turbine fault diagnosis based on transfer learning and convolutional autoencoder with small-scale data. Renew Energy 171:103–115. https://doi.org/10.1016/j.renene.2021.01.143
Article Google Scholar
Moghadassian B, Sharma A (2020) Designing wind turbine rotor blades to enhance energy capture in turbine arrays. Renew Energy 148:651–664. https://doi.org/10.1016/j.renene.2019.10.153
Article Google Scholar
Kollar LE, Mishra R (2019) Inverse design of wind turbine blade sections for operation under icing conditions. Energy Convers Manag 180:844–858. https://doi.org/10.1016/j.enconman.2018.11.015
Article Google Scholar
Moghadassian B, Sharma A (2018) Inverse design of single- and multi-rotor horizontal axis wind turbine blades using computational fluid dynamics. J Sol Energy Eng 140:021003. https://doi.org/10.1115/1.4038811
Article Google Scholar
Drela M (1989) XFOIL: an analysis and design system for low Reynolds number airfoils. In: Mueller TJ (ed) Low reynolds number aerodynamics. Lecture notes in engineering. Springer, Berlin, pp 1–12
Google Scholar
Ceruti A (2018) Meta-heuristic multidisciplinary design optimization of wind turbine blades obtained from circular pipes. Eng Comput 35:363–379. https://doi.org/10.1007/s00366-018-0604-8
Article Google Scholar
Zhu WJ, Shen WZ, Sørensen JN (2014) Integrated airfoil and blade design method for large wind turbines. Renew Energy 70:172–183. https://doi.org/10.1016/j.renene.2014.02.057
Article Google Scholar
Mohammadi S, Hassanalian M, Arionfard H, Bakhtiyarov S (2020) Optimal design of hydrokinetic turbine for low-speed water flow in Golden Gate Strait. Renew Energy 150:147–155. https://doi.org/10.1016/j.renene.2019.12.142
Article Google Scholar
McGhee RJ, Beasley WD (1973) Low-speed aerodynamic characteristics of 17-percent-thick airfoil section designed for general aviation applications, NASA TN D-7428
Tandis E, Assareh E (2017) Inverse design of airfoils via an intelligent hybrid optimization technique. Eng Comput 33:361–374. https://doi.org/10.1007/s00366-016-0478-6
Article Google Scholar
Barone S (2001) Gear geometric design by B-spline curve fitting and sweep surface modelling. Eng Comput 17:66–74. https://doi.org/10.1007/s003660170024
Article MATH Google Scholar
Li Y, Wei K, Yang W, Wang Q (2020) Improving wind turbine blade based on multi-objective particle swarm optimization. Renew Energy 161:525–542. https://doi.org/10.1016/j.renene.2020.07.067
Article Google Scholar
Yang S, Yee K (2022) Design rule extraction using multi-fidelity surrogate model for unmanned combat aerial vehicles. J Aircr Artic Adv. https://doi.org/10.2514/1.C036489
Grasso F (2011) Usage of numerical optimization in wind turbine airfoil design. J Aircr 48:248–255. https://doi.org/10.2514/1.c031089
Article Google Scholar
Hamad H, Al-Smadi A (2007) Space partitioning in engineering design via metamodel acceptance score distribution. Eng Comput 23:175–185. https://doi.org/10.1007/s00366-007-0056-z
Article Google Scholar
Liu L, Moayedi H, Rashid ASA, Rahman SSA, Nguyen H (2019) Optimizing an ANN model with genetic algorithm (GA) predicting load-settlement behaviours of eco-friendly raft-pile foundation (ERP) system. Eng Comput 36:421–433. https://doi.org/10.1007/s00366-019-00767-4
Article Google Scholar
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6:182–197. https://doi.org/10.1109/4235.996017
Article Google Scholar
Blank J, Deb K (2020) Pymoo: multi-objective optimization in python. IEEE Access 8:89497–89509. https://doi.org/10.1109/access.2020.2990567
Article Google Scholar
Vinokur M (1983) On one-dimensional stretching functions for finite-difference calculations. J Comput Phys 50:215–234. https://doi.org/10.1016/0021-9991(83)90065-7
Article MathSciNet MATH Google Scholar
He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: surpassing human-level performance on ImageNet classification. arXiv Preprint. ArXiv:1502.01852
Paszke A, Gross S, Massa F et al (2019) PyTorch: an imperative style, high-performance deep learning library. arXiv Preprint. ArXiv:1912.01703
Verhaagen NG (2012) Leading-edge radius effects on aerodynamic characteristics of 50-degree delta wings. J Aircr 49:521–531. https://doi.org/10.2514/1.c031550
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Research Foundation of Korea (NRF) Grant funded by the Ministry of Science and ICT (NRF-2017R1A5A1015311).

Author information

Authors and Affiliations

Department of Aerospace Engineering, Seoul National University, Seoul, 08826, Republic of Korea
Sunwoong Yang & Kwanjung Yee
Korea Institute of Industrial Technology, Incheon, 21999, Republic of Korea
Sanga Lee

Authors

Sunwoong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Sanga Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kwanjung Yee
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Data curation: SY; software: SY; methodology: SY; formal analysis: SY; investigation: SY; validation: SY; visualization: SY; writing—original draft: SY; conceptualization: SL; methodology: SL; supervision: SL; writing—review and editing: SL; funding acquisition: KY; project administration: KY; resources: KY; supervision: KY; writing—review and editing KY.

Corresponding author

Correspondence to Kwanjung Yee.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

The Pareto frontier from the optimization without the C_d constraint is shown in Fig.

17: from the continuous Pareto solutions, it can be inferred that the discontinuity in the Pareto solutions in Fig. 11 was due to C_d constraint violation. Also, as the constraint is eliminated, Pareto solutions without C_d constraint have better performance near B1 design than Pareto solutions with C_d constraint.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yang, S., Lee, S. & Yee, K. Inverse design optimization framework via a two-step deep learning approach: application to a wind turbine airfoil. Engineering with Computers 39, 2239–2255 (2023). https://doi.org/10.1007/s00366-022-01617-6

Download citation

Received: 01 July 2021
Accepted: 28 January 2022
Published: 04 March 2022
Issue Date: June 2023
DOI: https://doi.org/10.1007/s00366-022-01617-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Inverse design optimization framework via a two-step deep learning approach: application to a wind turbine airfoil

Abstract

Similar content being viewed by others

Neural network design for data-driven prediction of target geometry for an aerodynamic inverse design algorithm

Rapid Airfoil Inverse Design Method with a Deep Neural Network and Hyperparameter Selection

Data-driven design exploration method using conditional variational autoencoder for airfoil design

1 Introduction