Gradient-enhanced kriging for high-dimensional problems

Bouhlel, Mohamed A.; Martins, Joaquim R. R. A.

doi:10.1007/s00366-018-0590-x

Gradient-enhanced kriging for high-dimensional problems

Original Article
Published: 26 February 2018

Volume 35, pages 157–173, (2019)
Cite this article

Engineering with Computers Aims and scope Submit manuscript

2541 Accesses
86 Citations
3 Altmetric
Explore all metrics

Abstract

Surrogate models provide an affordable alternative to the evaluation of expensive deterministic functions. However, the construction of accurate surrogate models with many independent variables is currently prohibitive because they require a large number of function evaluations for the desired accuracy. Gradient-enhanced kriging has the potential to reduce the number of evaluations when efficient gradient computation, such as an adjoint method, is available. However, current gradient-enhanced kriging methods do not scale well with the number of sampling points because of the rapid growth in the size of the correlation matrix, where new information is added for each sampling point in each direction of the design space. Furthermore, they do not scale well with the number of independent variables because of the increase in the number of hyperparameters that must be estimated. To address this issue, we develop a new gradient-enhanced surrogate model approach that drastically reduces the number of hyperparameters through the use of the partial least squares method to maintain accuracy. In addition, this method is able to control the size of the correlation matrix by adding only relevant points defined by the information provided by the partial least squares method. To validate our method, we compare the global accuracy of the proposed method with conventional kriging surrogate models on two analytic functions with up to 100 dimensions, as well as engineering problems of varied complexity with up to 15 dimensions. We show that the proposed method requires fewer sampling points than conventional methods to obtain the desired accuracy, or it provides more accuracy for a fixed budget of sampling points. In some cases, we get models that are over three times more accurate than previously developed surrogate models for the same computational time, and over 3200 times faster than standard gradient-enhanced kriging models for the same accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A gradient-enhanced partition-of-unity surrogate model and adaptive sampling strategy for non-analytic functions

Article 29 June 2023

Linewalker: line search for black box derivative-free optimization and surrogate model construction

Article 21 February 2024

An effective gradient and geometry enhanced sequential sampling approach for Kriging modeling

Article 09 September 2021

Notes

https://github.com/SMTorg/SMT.

Abbreviations

d :: Number of dimensions
B :: Hypercube expressed by the product between intervals of each direction space
n :: Number of sampling points
h :: Number of principal components
$\mathbf{x}$, $\mathbf{x}'$ :: $1\times d$ vector
$x_j$ :: $j{\text {th}}$ element of $\mathbf{x}$ for $j=1,\dots ,d$
$\mathbf{X}$ :: $n\times d$ matrix containing sampling points
$\mathbf{y}$ :: $n\times 1$ vector containing simulation of $\mathbf{X}$
$\mathbf{x}^{(i)}$ :: $i{\text {th}}$ sampling point for $i=1,\dots ,n$ ($1\times d$ vector)
$y^{(i)}$ :: $i{\text {th}}$ evaluated output point for $i=1,\dots ,n$
$\mathbf{X}^{(0)}$ :: $\mathbf{X}$
$\mathbf{X}^{(l-1)}$ :: Matrix containing residual of the $(l-1){\text {th}}$ inner regression
$k(\cdot ,\cdot )$ :: Covariance function
$\mathbf{r}_{\mathbf{x}\mathbf{x}'}$ :: Spatial correlation between $\mathbf{x}$ and $\mathbf{x}'$
$\mathbf{R}$ :: Covariance matrix
$s^2(\mathbf{x})$ :: Prediction of the kriging variance
$\sigma ^2$ :: Process variance
$\theta _i$ :: ith parameter of the covariance function for $i=1,\dots ,d$
$Y(\mathbf{x})$ :: Gaussian process
$\mathbf{1}$ :: n-vector of ones
$\mathbf{t}_l$ :: lth principal component for $l=1,\dots ,h$
$\mathbf{w}$ :: Weight vector for partial least squares
$\Delta x_j$ :: First-order Taylor approximation step in the $j \text{th}$ direction

References

Abraham L (2009) pydoe: the experimental design package for python. https://pythonhosted.org/pyDOE/index.html. https://pythonhosted.org/pyDOE/index.html
Alberto PR, González FG (2012) Partial least squares regression on symmetric positive-definite matrices. Revista Colombiana de Estadística 36(1):177–192
MathSciNet MATH Google Scholar
An J, Owen A (2001) Quasi-Regression. J Complex 17(4):588–607
Article MathSciNet MATH Google Scholar
Barber D (2012) Bayesian reasoning and machine learning. Cambridge University Press, New York
MATH Google Scholar
Bartoli N, Bouhlel MA, Kurek I, Lafage R, Lefebvre T, Morlier J, Priem R, Stilz V, Regis R (2016) Improvement of efficient global optimization with application to aircraft wing design. In: 17th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference, Washington, DC. AIAA-2016-4001
Bouhlel MA, Bartoli N, Morlier J, Otsmane A (2016a) An improved approach for estimating the hyperparameters of the Kriging model for high-dimensional problems through the partial least squares method. Math Probl En 2016: 6723410. https://doi.org/10.1155/2016/6723410
Bouhlel MA, Bartoli N, Otsmane A, Morlier J (2016b) Improving kriging surrogates of high-dimensional design models by pLeast squares dimension reduction. Struct Multidisc Optim 53(5):935–952. ISSN 1615-1488
Bouhlel MA, Bartoli N, Regis RG, Otsmane A, Morlier J (2018) Efficient global optimization for high-dimensional constrained problems by using the kriging models combined with the partial least squares method. Eng Optim. https://doi.org/10.1080/0305215X.2017.1419344
Box G, Hunter J, Hunter W (2005) Statistics for experimenters: design, innovation, and discovery. Wiley-Interscience, Wiley Series in Probability and Statistics. https://books.google.ca/books?id=oYUpAQAAMAAJ
Choi S, Chung H, Alonso J (2004) Design of Low-Boom Supersonic Business Jet With Evolutionary Algorithms Using Adaptive Unstructured Mesh. In 45th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics and Materials Conference. Palm Springs, California. AIAA-2004-1758
Chung HS, Alonso J (2002) Design of a Low-Boom Supersonic Business Jet Using Cokriging Approximation Models. 9th AIAA/ISSMO Symposium on Multidisciplinary Analysis and Optimization, Multidisciplinary Analysis Optimization Conferences. AIAA-2002-5598
Cressie N (1988) Spatial prediction and ordinary kriging. Math Geol 20(4):405–421
Article MathSciNet MATH Google Scholar
Deb K (2000) An efficient constraint handling method for genetic algorithms. Comput Methods Appl Mech Eng 186:311–338
Article MATH Google Scholar
Forrester AIJ, Sóbester A, Keane AJ (2008) Engineering design via surrogate modeling: a practical guide. Wiley, New York
Book Google Scholar
Frank IE, Friedman JH (1993) A statistical view of some chemometrics regression tools. Technometrics 35:109–148
Article MATH Google Scholar
Haftka R, Villanueva D, Chaudhuri A (2016) Parallel surrogate-assisted global optimization with expensive functions—a survey. Struct Multidiscipl Optim 54:3–13
Article MathSciNet Google Scholar
Helland IS (1988) On the structure of partial least squares regression. Commun Stat Simul Comput 17:581–607
Article MathSciNet MATH Google Scholar
Jeong S, Murayama M, Yamamoto K (2005) Efficient optimization design method using Kriging model. J Aircr 42(2):413–420
Article Google Scholar
Jones DR (2001) A taxonomy of global optimization methods based on response surfaces. J Global Optim 21(4):345–383
Article MathSciNet MATH Google Scholar
Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. J Global Optim 13(4):455–492
Article MathSciNet MATH Google Scholar
Kenway GKW, Kennedy GJ, Martins JRRA (2014) Scalable parallel approach for high-fidelity steady-state aeroelastic analysis and derivative computations. AIAA J 52(5):935–951. https://doi.org/10.2514/1.J052255
Article Google Scholar
Kleijnen J, Van Beers W, Van Nieuwenhuyse I (2010) Constrained optimization in expensive simulation: novel approach. Eur J Oper Res 202(1):164–174
Article MATH Google Scholar
Kleijnen J, Beers W, Nieuwenhuyse I (2012) Expected improvement in efficient global optimization through bootstrapped kriging. J Global Optim 54(1):59–73
Article MathSciNet MATH Google Scholar
Kleijnen JPC (2015) Design and analysis of simulation experiments, vol 230. Springer, New York
MATH Google Scholar
Kleijnen JPC (2017) Regression and kriging metamodels with their experimental designs in simulation: a review. Eur J Oper Res 256(1):1–16. https://doi.org/10.1016/j.ejor.2016.06.04
Article MathSciNet MATH Google Scholar
Krige DG (1951) A statistical approach to some basic mine valuation problems on the witwatersrand. J Chem Metall Mining Soc 52:119–139
Google Scholar
Laurenceau J, Sagaut P (2008) Building efficient response surfaces of aerodynamic functions with kriging and cokriging. AIAA J 46(2):498–507
Article Google Scholar
Lewis RM (1998) Using sensitivity information in the construction of Kriging models for design optimization. AIAA-98-4799. 7th AIAA/USAF/NASA/ISSMO Symposium on Multidisciplinary Analysis and Optimization, Multidisciplinary Analysis Optimization Conferences, pp 730–737
Liem RP, Kenway GK, Martins JRRA (2012) Multi-point, multi-mission, high-fidelity aerostructural optimization of a long-range aircraft configuration. In: Proceedings of the 14th AIAA/ISSMO Multidisciplinary Analysis and Optimization Conference. Indianapolis. https://doi.org/10.2514/6.2012-5706
Liem RP, Kenway GKW, Martins JRRA (2015a) Multimission aircraft fuel burn minimization via multipoint aerostructural optimization. AIAA J 53(1):104–122. https://doi.org/10.2514/1.J052940.
Liem RP, Mader CA, Martins JRRA (2015b) Surrogate models and mixtures of experts in aerodynamic performance prediction for aircraft mission analysis. Aerosp Sci Technol 43:126–151. https://doi.org/10.1016/j.ast.2015.02.019
Article Google Scholar
Liping W, Don B, Gene W, Mahidhar R (2006) A comparison of metamodeling methods using practical industry requirements. In: Proceedings of the 47th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials Conference. Newport, RI
Liu W (2003) Development of gradient-enhanced Kriging approximations for multidisciplinary design optimization. PhD thesis, University of Notre Dame
Loeppky JL, Sacks S, Welch WJ (2009) Choosing the sample size of a computer experiment: a practical guide. Technometrics 51(4):366–376. https://doi.org/10.1198/TECH.2009.08040
Article MathSciNet Google Scholar
Mader CA, Martins JRRA, Alonso JJ, van der Weide E (2008) ADjoint: an approach for the rapid development of discrete adjoint solvers. AIAA J 46(4):863–873. https://doi.org/10.2514/1.29123
Article Google Scholar
Mardia KV, Watkins AJ (1989) On multimodality of the likelihood in the spatial linear model. Biometrika 76(2):289. https://doi.org/10.1093/biomet/76.2.289
Article MathSciNet MATH Google Scholar
Martins JRRA, Hwang JT (2013) Review and unification of methods for computing derivatives of multidisciplinary computational models. AIAA J 51(11):2582–2599. https://doi.org/10.2514/1.J052184
Article Google Scholar
Matheron G (1963) Principles of geostatistics. Econ Geol 58(8):1246–1266
Article Google Scholar
Morris MD, Mitchell TJ, Ylvisaker D (1993) Bayesian design and analysis of computer experiments: use of derivatives in surface prediction. Technometrics 35(3):243–255
Article MathSciNet MATH Google Scholar
Ollar J, Mortished C, Jones R, Sienz J, Toropov V (2017) Gradient based hyper-parameter optimisation for well conditioned Kriging metamodels. Struct Multidiscipl Optim 55:2029–2044
Article MathSciNet Google Scholar
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Rettenhofer P, Weiss R, Dubourg V, Vanderplas J, Passos A, Cournapeau D, Brucher M, Perrot M, Duchesnay E (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830
MathSciNet MATH Google Scholar
Pironneau O (1974) On optimum design in fluid mechanics. J Fluid Mech 64(1):97–110
Article MathSciNet MATH Google Scholar
Rasmussen CE, Williams CKI (2006) Gaussian processes for machine learning. Adaptive computation and machine learning. MIT Press, Cambridge
MATH Google Scholar
Sacks J, Schiller SB, Welch WJ (1989a) Designs for computer experiments. Technometrics 31(1):41–47
Article MathSciNet Google Scholar
Sacks J, Welch WJ, Mitchell WJ, Wynn HP (1989b) Design and analysis of computer experiments. Stat Sci 4(4):409–435
Article MathSciNet MATH Google Scholar
Sakata S, Ashida F, Zako M (2003) Structural optimization using Kriging approximation. Comput Methods Appl Mech Eng 192(417):923–939
Article MATH Google Scholar
Simpson TW, Mauery TM, Korte JJ, Mistree F (2001a) Kriging models for global approximation in simulation-based multidisciplinary design optimization. AIAA J 39(12):2233–2241
Article Google Scholar
Simpson TW, Poplinski JD, Koch PN, Allen JK (2001b) Metamodels for computer-based engineering design: survey and recommendations. Eng Comput 17(2):129–150
Article MATH Google Scholar
Tenenhaus M (1998) La Régression PLS: Théorie et Pratique. Éd, Technip
MATH Google Scholar
Toal DJJ, Bressloff NW, Keane AJ (2008) Geometric filtration using POD for aerodynamic design optimization. In: 26th AIAA Applied Aerodynamics Conference. http://uos-app00353-si.soton.ac.uk/59225/
Ulaganathan S, Couckuyt I, Dhaene T, Laermans E, Degroote J (2014) On the use of gradients in Kriging surrogate models. In: Proceedings of the 2014 Winter Simulation Conference, Savannah, GA, USA, December 7–10, pp 2692–2701. https://doi.org/10.1109/WSC.2014.7020113
Viana FAC, Simpson TW, Balabanov V, Toropov V (2014) Metamodeling in multidisciplinary design optimization: how far have we really come? AIAA J 52:670–690. https://doi.org/10.2514/1.J052375
Article Google Scholar
Welch WJ, Buck RJ, Sacks J, Wynn HP, Mitchell TJ, Morris MD (1992) Screening, predicting, and computer experiments. Technometrics 34(1):15–25
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Michigan, Ann Arbor, USA
Mohamed A. Bouhlel & Joaquim R. R. A. Martins

Authors

Mohamed A. Bouhlel
View author publications
You can also search for this author in PubMed Google Scholar
Joaquim R. R. A. Martins
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohamed A. Bouhlel.

Appendices

Appendix A: Definition of the engineering cases

The analytical expressions for the engineering cases are as follows.

1.1 $\text{A.1}\;\text {P}_1,\text { P}_2, \text { and P}_3$

The responses are the deflection $\delta$, bending stress $\sigma$, and shear stress $\tau$, respectively, of a welded beam problem [13].

$$\begin{aligned} \text {P}_1:\,\delta =\frac{2.1952}{t^3b}, \end{aligned}$$

$$\begin{aligned} \text {P}_2:\,\sigma =\frac{504000}{t^2b}, \end{aligned}$$

$$\begin{aligned} \text {P}_3:\,\tau =\sqrt{\frac{\tau '^2+\tau ''^2+l\tau '\tau ''}{\sqrt{0.25\left( l^2+(h+t)^2\right) }}}, \end{aligned}$$

where

$$\begin{aligned} \tau '=\frac{6000}{\sqrt{2}hl} \text { and} \quad \tau ''=\frac{6000(14+0.5l)\sqrt{0.25\left( l^2+(h+t)^2\right) }}{2\left[ 0.707hl\left( \frac{l^2}{12}+0.25(h+t)^2\right) \right] }. \end{aligned}$$

The table below gives the ranges of the input variables.

Input variable	Range
h	[0.125, 1]
b	[0.1, 1]
l, t	[5, 10]

1.2 $\text{A.2}\;\text {P}_4$

This problem characterizes the flow of water through a borehole that is drilled from the ground surface through two aquifers [39]. The water flow rate (m$^3$/yr) is given by

$$\begin{aligned} \text {P}_4:\,y=\frac{2\pi T_u\left( H_u-H_l\right) }{\ln \left( \frac{r}{r_w}\right) \left[ 1+\frac{2LT_u}{\ln \left( \frac{r}{r_w}\right) r_w^2K_w}+\frac{T_u}{T_l}\right] }. \end{aligned}$$

The table below gives the ranges of the input variables.

Input variable	Range	Input variable	Range
$r_w$	[0.05, 0.15]	r	[100, 50000]
$T_u$	[63070, 115600]	$H_u$	[990, 1110]
$T_l$	[63.1, 116]	$H_l$	[700, 820]
L	[1120, 1680]	$K_w$	[9855, 12045]

1.3 $\text{A.3}\;\text {P}_5$

This function represents the position of a robot arm [3]:

$$\begin{aligned} \text {P}_5:\,y=\sqrt{\left( \sum \limits _{i=1}^4L_i\cos \left( \sum \limits _{j=1}^i\theta _j\right) \right) ^2+\left( \sum \limits _{i=1}^4L_i\sin \left( \sum \limits _{j=1}^i\theta _j\right) \right) ^2}. \end{aligned}$$

The table below gives the ranges of the input variables.

Input variable	Range
$L_i$	[0, 1]
$\theta _j$	$[0,2\pi ]$

1.4 $\text{A.4}\;\text {P}_6$

This is an estimate of the weight of a light aircraft wing [14]:

$$\begin{aligned} \text {P}_6:\,y=0.036S_w^{0.758}W_{fw}^{0.0035}\left( \frac{A}{\cos ^2\Lambda }\right) q^{0.006}\lambda ^{0.04}\left( \frac{100tc}{\cos \Lambda }\right) ^{-0.3}\left( N_zW_{dg}\right) ^{0.49}+S_wW_p. \end{aligned}$$

The table below gives the ranges of the input variables.

Input variable	Range	Input variable	Range
$S_w$	[150, 200]	$W_{fw}$	[220, 300]
A	[6, 10]	$\Lambda$	$[-10,10]$
q	[16, 45]	$\lambda$	[0.5, 1]
tc	[0.08, 0.18]	$N_z$	[2.5, 6]
$W_{dg}$	[1700, 2500]	$W_p$	[0.025, 0.08]

1.5 $\text{A.5}\;\text {P}_7\text { and P}_8$

These are the weight and lowest natural frequency of a torsion vibration problem [32]:

$$\begin{aligned} \text {P}_7:\,y=\sum \limits _{i=1}^3\lambda _i\pi L_i \left( \frac{d_i}{2}\right) ^2+\sum \limits _{j=1}^2\rho _j\pi T_j \left( \frac{D_j}{2}\right) ^2, \end{aligned}$$

$$\begin{aligned} \text {P}_8:\,y=\frac{\sqrt{\frac{-b-\sqrt{b^2-4c}}{2}}}{2\pi }, \end{aligned}$$

where

$$\begin{aligned} K_i=\frac{\pi G_id_i}{32L_i},\quad M_j=\frac{\rho _j \pi t_jD_j}{4g}, \end{aligned}$$

$$\begin{aligned} J_j=0.5M_j\frac{D_j}{2}, \end{aligned}$$

$$\begin{aligned} b=-\left( \frac{K_1+K2}{J_1}+\frac{K_2+K3}{J_2}\right) , \end{aligned}$$

and

$$\begin{aligned} c=\frac{K_1K_2+K_2K_3+K_3K_1}{J_1J_2}. \end{aligned}$$

The table below gives the ranges of the input variables.

Input variable	Range	Input variable	Range
$d_1$	[1.8, 2.2]	$L_1$	[9, 11]
$G_1$	[105300000, 128700000]	$\lambda _1$	[0.252, 0.308]
$d_2$	[1.638, 2.002]	$L_2$	[10.8, 13.2]
$G_2$	[5580000, 6820000]	$\lambda _2$	[0.144, 0.176]
$d_3$	[2.025, 2.475]	$L_3$	[7.2, 8.8]
$G_3$	[3510000, 4290000]	$\lambda _3$	[0.09, 0.11]
$D_1$	[10.8, 13.2]	$t_1$	[2.7, 3.3]
$\rho _1$	[0.252, 0.308]	$D_2$	[12.6, 15.4]
$t_2$	[3.6, 4.4]	$\rho _1$	[0.09, 0.11]

Appendix B: Results for the analytic cases

Table 2 Mean of the error values (upper table) and computational times (lower table) for kriging, KPLS, KPLSK, and GE-KPLSm for $m=1,\cdots ,5$ with 10 trials

Full size table

Appendix C: Results for the engineering cases

Table 3 Mean of the error values (upper table) and computational times (lower table) for kriging, KPLS, KPLSK, and GE-KPLSm for $m=1,\cdots ,5$ with 10 trials

Full size table

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bouhlel, M.A., Martins, J.R.R.A. Gradient-enhanced kriging for high-dimensional problems. Engineering with Computers 35, 157–173 (2019). https://doi.org/10.1007/s00366-018-0590-x

Download citation

Received: 10 June 2017
Accepted: 31 January 2018
Published: 26 February 2018
Issue Date: 09 January 2019
DOI: https://doi.org/10.1007/s00366-018-0590-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Gradient-enhanced kriging for high-dimensional problems

Abstract

Access this article

Similar content being viewed by others

A gradient-enhanced partition-of-unity surrogate model and adaptive sampling strategy for non-analytic functions

Linewalker: line search for black box derivative-free optimization and surrogate model construction

An effective gradient and geometry enhanced sequential sampling approach for Kriging modeling

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Definition of the engineering cases

1.1 \(\text{A.1}\;\text {P}_1,\text { P}_2, \text { and P}_3\)

1.2 \(\text{A.2}\;\text {P}_4\)

1.3 \(\text{A.3}\;\text {P}_5\)

1.4 \(\text{A.4}\;\text {P}_6\)

1.5 \(\text{A.5}\;\text {P}_7\text { and P}_8\)

Appendix B: Results for the analytic cases

Appendix C: Results for the engineering cases

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Input variable	Range
\(L_i\)	[0, 1]
\(\theta _j\)	\([0,2\pi ]\)

Gradient-enhanced kriging for high-dimensional problems

Abstract

Access this article

Similar content being viewed by others

A gradient-enhanced partition-of-unity surrogate model and adaptive sampling strategy for non-analytic functions

Linewalker: line search for black box derivative-free optimization and surrogate model construction

An effective gradient and geometry enhanced sequential sampling approach for Kriging modeling

Notes

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix A: Definition of the engineering cases

1.1 \(\text{A.1}\;\text {P}_1,\text { P}_2, \text { and P}_3\)

1.2 \(\text{A.2}\;\text {P}_4\)

1.3 \(\text{A.3}\;\text {P}_5\)

1.4 \(\text{A.4}\;\text {P}_6\)

1.5 \(\text{A.5}\;\text {P}_7\text { and P}_8\)

Appendix B: Results for the analytic cases

Appendix C: Results for the engineering cases

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation