Abstract
Surrogate models provide an affordable alternative to the evaluation of expensive deterministic functions. However, the construction of accurate surrogate models with many independent variables is currently prohibitive because they require a large number of function evaluations for the desired accuracy. Gradient-enhanced kriging has the potential to reduce the number of evaluations when efficient gradient computation, such as an adjoint method, is available. However, current gradient-enhanced kriging methods do not scale well with the number of sampling points because of the rapid growth in the size of the correlation matrix, where new information is added for each sampling point in each direction of the design space. Furthermore, they do not scale well with the number of independent variables because of the increase in the number of hyperparameters that must be estimated. To address this issue, we develop a new gradient-enhanced surrogate model approach that drastically reduces the number of hyperparameters through the use of the partial least squares method to maintain accuracy. In addition, this method is able to control the size of the correlation matrix by adding only relevant points defined by the information provided by the partial least squares method. To validate our method, we compare the global accuracy of the proposed method with conventional kriging surrogate models on two analytic functions with up to 100 dimensions, as well as engineering problems of varied complexity with up to 15 dimensions. We show that the proposed method requires fewer sampling points than conventional methods to obtain the desired accuracy, or it provides more accuracy for a fixed budget of sampling points. In some cases, we get models that are over three times more accurate than previously developed surrogate models for the same computational time, and over 3200 times faster than standard gradient-enhanced kriging models for the same accuracy.
Similar content being viewed by others
Abbreviations
- d :
-
Number of dimensions
- B :
-
Hypercube expressed by the product between intervals of each direction space
- n :
-
Number of sampling points
- h :
-
Number of principal components
- \(\mathbf{x}\), \(\mathbf{x}'\) :
-
\(1\times d\) vector
- \(x_j\) :
-
\(j{\text {th}}\) element of \(\mathbf{x}\) for \(j=1,\dots ,d\)
- \(\mathbf{X}\) :
-
\(n\times d\) matrix containing sampling points
- \(\mathbf{y}\) :
-
\(n\times 1\) vector containing simulation of \(\mathbf{X}\)
- \(\mathbf{x}^{(i)}\) :
-
\(i{\text {th}}\) sampling point for \(i=1,\dots ,n\) (\(1\times d\) vector)
- \(y^{(i)}\) :
-
\(i{\text {th}}\) evaluated output point for \(i=1,\dots ,n\)
- \(\mathbf{X}^{(0)}\) :
-
\(\mathbf{X}\)
- \(\mathbf{X}^{(l-1)}\) :
-
Matrix containing residual of the \((l-1){\text {th}}\) inner regression
- \(k(\cdot ,\cdot )\) :
-
Covariance function
- \(\mathbf{r}_{\mathbf{x}\mathbf{x}'}\) :
-
Spatial correlation between \(\mathbf{x}\) and \(\mathbf{x}'\)
- \(\mathbf{R}\) :
-
Covariance matrix
- \(s^2(\mathbf{x})\) :
-
Prediction of the kriging variance
- \(\sigma ^2\) :
-
Process variance
- \(\theta _i\) :
-
ith parameter of the covariance function for \(i=1,\dots ,d\)
- \(Y(\mathbf{x})\) :
-
Gaussian process
- \(\mathbf{1}\) :
-
n-vector of ones
- \(\mathbf{t}_l\) :
-
lth principal component for \(l=1,\dots ,h\)
- \(\mathbf{w}\) :
-
Weight vector for partial least squares
- \(\Delta x_j\) :
-
First-order Taylor approximation step in the \(j \text{th}\) direction
References
Abraham L (2009) pydoe: the experimental design package for python. https://pythonhosted.org/pyDOE/index.html. https://pythonhosted.org/pyDOE/index.html
Alberto PR, González FG (2012) Partial least squares regression on symmetric positive-definite matrices. Revista Colombiana de Estadística 36(1):177–192
An J, Owen A (2001) Quasi-Regression. J Complex 17(4):588–607
Barber D (2012) Bayesian reasoning and machine learning. Cambridge University Press, New York
Bartoli N, Bouhlel MA, Kurek I, Lafage R, Lefebvre T, Morlier J, Priem R, Stilz V, Regis R (2016) Improvement of efficient global optimization with application to aircraft wing design. In: 17th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, Washington, DC. AIAA-2016-4001
Bouhlel MA, Bartoli N, Morlier J, Otsmane A (2016a) An improved approach for estimating the hyperparameters of the Kriging model for high-dimensional problems through the partial least squares method. Math Probl En 2016: 6723410. https://doi.org/10.1155/2016/6723410
Bouhlel MA, Bartoli N, Otsmane A, Morlier J (2016b) Improving kriging surrogates of high-dimensional design models by pLeast squares dimension reduction. Struct Multidisc Optim 53(5):935–952. ISSN 1615-1488
Bouhlel MA, Bartoli N, Regis RG, Otsmane A, Morlier J (2018) Efficient global optimization for high-dimensional constrained problems by using the kriging models combined with the partial least squares method. Eng Optim. https://doi.org/10.1080/0305215X.2017.1419344
Box G, Hunter J, Hunter W (2005) Statistics for experimenters: design, innovation, and discovery. Wiley-Interscience, Wiley Series in Probability and Statistics. https://books.google.ca/books?id=oYUpAQAAMAAJ
Choi S, Chung H, Alonso J (2004) Design of Low-Boom Supersonic Business Jet With Evolutionary Algorithms Using Adaptive Unstructured Mesh. In 45th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference. Palm Springs, California. AIAA-2004-1758
Chung HS, Alonso J (2002) Design of a Low-Boom Supersonic Business Jet Using Cokriging Approximation Models. 9th AIAA/ISSMO Symposium on Multidisciplinary Analysis and Optimization, Multidisciplinary Analysis Optimization Conferences. AIAA-2002-5598
Cressie N (1988) Spatial prediction and ordinary kriging. Math Geol 20(4):405–421
Deb K (2000) An efficient constraint handling method for genetic algorithms. Comput Methods Appl Mech Eng 186:311–338
Forrester AIJ, Sóbester A, Keane AJ (2008) Engineering design via surrogate modeling: a practical guide. Wiley, New York
Frank IE, Friedman JH (1993) A statistical view of some chemometrics regression tools. Technometrics 35:109–148
Haftka R, Villanueva D, Chaudhuri A (2016) Parallel surrogate-assisted global optimization with expensive functions—a survey. Struct Multidiscipl Optim 54:3–13
Helland IS (1988) On the structure of partial least squares regression. Commun Stat Simul Comput 17:581–607
Jeong S, Murayama M, Yamamoto K (2005) Efficient optimization design method using Kriging model. J Aircr 42(2):413–420
Jones DR (2001) A taxonomy of global optimization methods based on response surfaces. J Global Optim 21(4):345–383
Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. J Global Optim 13(4):455–492
Kenway GKW, Kennedy GJ, Martins JRRA (2014) Scalable parallel approach for high-fidelity steady-state aeroelastic analysis and derivative computations. AIAA J 52(5):935–951. https://doi.org/10.2514/1.J052255
Kleijnen J, Van Beers W, Van Nieuwenhuyse I (2010) Constrained optimization in expensive simulation: novel approach. Eur J Oper Res 202(1):164–174
Kleijnen J, Beers W, Nieuwenhuyse I (2012) Expected improvement in efficient global optimization through bootstrapped kriging. J Global Optim 54(1):59–73
Kleijnen JPC (2015) Design and analysis of simulation experiments, vol 230. Springer, New York
Kleijnen JPC (2017) Regression and kriging metamodels with their experimental designs in simulation: a review. Eur J Oper Res 256(1):1–16. https://doi.org/10.1016/j.ejor.2016.06.04
Krige DG (1951) A statistical approach to some basic mine valuation problems on the witwatersrand. J Chem Metall Mining Soc 52:119–139
Laurenceau J, Sagaut P (2008) Building efficient response surfaces of aerodynamic functions with kriging and cokriging. AIAA J 46(2):498–507
Lewis RM (1998) Using sensitivity information in the construction of Kriging models for design optimization. AIAA-98-4799. 7th AIAA/USAF/NASA/ISSMO Symposium on Multidisciplinary Analysis and Optimization, Multidisciplinary Analysis Optimization Conferences, pp 730–737
Liem RP, Kenway GK, Martins JRRA (2012) Multi-point, multi-mission, high-fidelity aerostructural optimization of a long-range aircraft configuration. In: Proceedings of the 14th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference. Indianapolis. https://doi.org/10.2514/6.2012-5706
Liem RP, Kenway GKW, Martins JRRA (2015a) Multimission aircraft fuel burn minimization via multipoint aerostructural optimization. AIAA J 53(1):104–122. https://doi.org/10.2514/1.J052940.
Liem RP, Mader CA, Martins JRRA (2015b) Surrogate models and mixtures of experts in aerodynamic performance prediction for aircraft mission analysis. Aerosp Sci Technol 43:126–151. https://doi.org/10.1016/j.ast.2015.02.019
Liping W, Don B, Gene W, Mahidhar R (2006) A comparison of metamodeling methods using practical industry requirements. In: Proceedings of the 47th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials Conference. Newport, RI
Liu W (2003) Development of gradient-enhanced Kriging approximations for multidisciplinary design optimization. PhD thesis, University of Notre Dame
Loeppky JL, Sacks S, Welch WJ (2009) Choosing the sample size of a computer experiment: a practical guide. Technometrics 51(4):366–376. https://doi.org/10.1198/TECH.2009.08040
Mader CA, Martins JRRA, Alonso JJ, van der Weide E (2008) ADjoint: an approach for the rapid development of discrete adjoint solvers. AIAA J 46(4):863–873. https://doi.org/10.2514/1.29123
Mardia KV, Watkins AJ (1989) On multimodality of the likelihood in the spatial linear model. Biometrika 76(2):289. https://doi.org/10.1093/biomet/76.2.289
Martins JRRA, Hwang JT (2013) Review and unification of methods for computing derivatives of multidisciplinary computational models. AIAA J 51(11):2582–2599. https://doi.org/10.2514/1.J052184
Matheron G (1963) Principles of geostatistics. Econ Geol 58(8):1246–1266
Morris MD, Mitchell TJ, Ylvisaker D (1993) Bayesian design and analysis of computer experiments: use of derivatives in surface prediction. Technometrics 35(3):243–255
Ollar J, Mortished C, Jones R, Sienz J, Toropov V (2017) Gradient based hyper-parameter optimisation for well conditioned Kriging metamodels. Struct Multidiscipl Optim 55:2029–2044
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Rettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
Pironneau O (1974) On optimum design in fluid mechanics. J Fluid Mech 64(1):97–110
Rasmussen CE, Williams CKI (2006) Gaussian processes for machine learning. Adaptive computation and machine learning. MIT Press, Cambridge
Sacks J, Schiller SB, Welch WJ (1989a) Designs for computer experiments. Technometrics 31(1):41–47
Sacks J, Welch WJ, Mitchell WJ, Wynn HP (1989b) Design and analysis of computer experiments. Stat Sci 4(4):409–435
Sakata S, Ashida F, Zako M (2003) Structural optimization using Kriging approximation. Comput Methods Appl Mech Eng 192(417):923–939
Simpson TW, Mauery TM, Korte JJ, Mistree F (2001a) Kriging models for global approximation in simulation-based multidisciplinary design optimization. AIAA J 39(12):2233–2241
Simpson TW, Poplinski JD, Koch PN, Allen JK (2001b) Metamodels for computer-based engineering design: survey and recommendations. Eng Comput 17(2):129–150
Tenenhaus M (1998) La Régression PLS: Théorie et Pratique. Éd, Technip
Toal DJJ, Bressloff NW, Keane AJ (2008) Geometric filtration using POD for aerodynamic design optimization. In: 26th AIAA Applied Aerodynamics Conference. http://uos-app00353-si.soton.ac.uk/59225/
Ulaganathan S, Couckuyt I, Dhaene T, Laermans E, Degroote J (2014) On the use of gradients in Kriging surrogate models. In: Proceedings of the 2014 Winter Simulation Conference, Savannah, GA, USA, December 7–10, pp 2692–2701. https://doi.org/10.1109/WSC.2014.7020113
Viana FAC, Simpson TW, Balabanov V, Toropov V (2014) Metamodeling in multidisciplinary design optimization: how far have we really come? AIAA J 52:670–690. https://doi.org/10.2514/1.J052375
Welch WJ, Buck RJ, Sacks J, Wynn HP, Mitchell TJ, Morris MD (1992) Screening, predicting, and computer experiments. Technometrics 34(1):15–25
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Definition of the engineering cases
The analytical expressions for the engineering cases are as follows.
1.1 \(\text{A.1}\;\text {P}_1,\text { P}_2, \text { and P}_3\)
The responses are the deflection \(\delta\), bending stress \(\sigma\), and shear stress \(\tau\), respectively, of a welded beam problem [13].
where
The table below gives the ranges of the input variables.
Input variable | Range |
---|---|
h | [0.125, 1] |
b | [0.1, 1] |
l, t | [5, 10] |
1.2 \(\text{A.2}\;\text {P}_4\)
This problem characterizes the flow of water through a borehole that is drilled from the ground surface through two aquifers [39]. The water flow rate (m\(^3\)/yr) is given by
The table below gives the ranges of the input variables.
Input variable | Range | Input variable | Range |
---|---|---|---|
\(r_w\) | [0.05, 0.15] | r | [100, 50000] |
\(T_u\) | [63070, 115600] | \(H_u\) | [990, 1110] |
\(T_l\) | [63.1, 116] | \(H_l\) | [700, 820] |
L | [1120, 1680] | \(K_w\) | [9855, 12045] |
1.3 \(\text{A.3}\;\text {P}_5\)
This function represents the position of a robot arm [3]:
The table below gives the ranges of the input variables.
Input variable | Range |
---|---|
\(L_i\) | [0, 1] |
\(\theta _j\) | \([0,2\pi ]\) |
1.4 \(\text{A.4}\;\text {P}_6\)
This is an estimate of the weight of a light aircraft wing [14]:
The table below gives the ranges of the input variables.
Input variable | Range | Input variable | Range |
---|---|---|---|
\(S_w\) | [150, 200] | \(W_{fw}\) | [220, 300] |
A | [6, 10] | \(\Lambda\) | \([-10,10]\) |
q | [16, 45] | \(\lambda\) | [0.5, 1] |
tc | [0.08, 0.18] | \(N_z\) | [2.5, 6] |
\(W_{dg}\) | [1700, 2500] | \(W_p\) | [0.025, 0.08] |
1.5 \(\text{A.5}\;\text {P}_7\text { and P}_8\)
These are the weight and lowest natural frequency of a torsion vibration problem [32]:
where
and
The table below gives the ranges of the input variables.
Input variable | Range | Input variable | Range |
---|---|---|---|
\(d_1\) | [1.8, 2.2] | \(L_1\) | [9, 11] |
\(G_1\) | [105300000, 128700000] | \(\lambda _1\) | [0.252, 0.308] |
\(d_2\) | [1.638, 2.002] | \(L_2\) | [10.8, 13.2] |
\(G_2\) | [5580000, 6820000] | \(\lambda _2\) | [0.144, 0.176] |
\(d_3\) | [2.025, 2.475] | \(L_3\) | [7.2, 8.8] |
\(G_3\) | [3510000, 4290000] | \(\lambda _3\) | [0.09, 0.11] |
\(D_1\) | [10.8, 13.2] | \(t_1\) | [2.7, 3.3] |
\(\rho _1\) | [0.252, 0.308] | \(D_2\) | [12.6, 15.4] |
\(t_2\) | [3.6, 4.4] | \(\rho _1\) | [0.09, 0.11] |
Appendix B: Results for the analytic cases
Appendix C: Results for the engineering cases
Rights and permissions
About this article
Cite this article
Bouhlel, M.A., Martins, J.R.R.A. Gradient-enhanced kriging for high-dimensional problems. Engineering with Computers 35, 157–173 (2019). https://doi.org/10.1007/s00366-018-0590-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00366-018-0590-x