Ensemble of metamodels: extensions of the least squares approach to efficient global optimization

Ferreira, Wallace G.; Serpa, Alberto L.

doi:10.1007/s00158-017-1745-x

Ensemble of metamodels: extensions of the least squares approach to efficient global optimization

RESEARCH PAPER
Published: 05 July 2017

Volume 57, pages 131–159, (2018)
Cite this article

Structural and Multidisciplinary Optimization Aims and scope Submit manuscript

Wallace G. Ferreira¹ &
Alberto L. Serpa²

680 Accesses
11 Citations
Explore all metrics

Abstract

In this work we present LSEGO, an approach to drive efficient global optimization (EGO), based on LS (least squares) ensemble of metamodels. By means of LS ensemble of metamodels it is possible to estimate the uncertainty of the prediction with any kind of model (not only kriging) and provide an estimate for the expected improvement function. For the problems studied, the proposed LSEGO algorithm has shown to be able to find the global optimum with less number of optimization cycles than required by the classical EGO approach. As more infill points are added per cycle, the faster is the convergence to the global optimum (exploitation) and also the quality improvement of the metamodel in the design space (exploration), specially as the number of variables increases, when the standard single point EGO can be quite slow to reach the optimum. LSEGO has shown to be a feasible option to drive EGO with ensemble of metamodels as well as for constrained problems, and it is not restricted to kriging and to a single infill point per optimization cycle.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ensemble of metamodels: the augmented least squares approach

Article 15 December 2015

Automatic selection for general surrogate models

Article 20 February 2018

Robust ensemble of metamodels based on the hybrid error measure

Article 19 August 2021

Notes

For instance, even with high end computers clusters used nowadays in automotive industry, one single full vehicle analysis of high fidelity safety crash FEM model takes up to 15 processing hours with 48 CPU in parallel. With respect to CFD analysis one single complete car aerodynamics model for drag calculation, by using 96 CPU, should take up 30 hours to finish. An interesting essay regarding this “never-ending” need of computer resources in structural optimization can be found in Venkatararaman and Haftka (2004).
Only for notational convenience, without loss of generality, we will assume that all equality constraints h(x) can be properly transformed into inequality constraints g(x).
Matlab is a well known and widely used numerical programing platform and it is developed and distributed by The Mathworks Inc., see www.mathworks.com.
Boxplot is a common statistical graph used for visual comparison of the distribution of different variables in a same plane. The box is defined by lines at the lower quartile (25%), median (50%) and upper quartile (75%) of the data. The lines extending above and upper each box (named as whiskers) indicate the spread for the rest of the data out of the quartiles definition. If existent, outliers are represented by plus signs “ + ”, above/below the whiskers. We used the Matlab function boxplot (with default parameters) to create the plots.
For further details and recent updates on SURROGATES Toolbox refer to the website: https://sites.google.com/site/srgtstoolbox/.
As a common practice for metamodel based optimization purposes, the number of points in initial DOE is often in the range 5n _v to 10n _v.

References

Chaudhuri A, Haftka RT (2012) Efficient global optimization with adaptive target setting. AIAA J 52 (7):1573–1578
Article Google Scholar
Desautels T, Krause A, Burdick J (2012) Parallelizing exploration-exploitation tradeoffs in gaussian process bandit optimization. J Mach Learn Res 15(1):13,873–3923
MathSciNet MATH Google Scholar
Encisoa SM, Branke J (2015) Tracking global optima in dynamic environments with efficient global optimization. Eur J Oper Res 242:744–755
Article MathSciNet MATH Google Scholar
Fang KT, Li R, Sudjianto A (2006) Design and modeling for computer experiments. Computer science and data analysis series. Chapman & Hall/CRC, Boca Raton
MATH Google Scholar
Ferreira WG (2016) Efficient global optimization driven by ensemble of metamodels: new directions opened by least squares approximation. PhD thesis, Faculty of Mechanical Engineering, University of Campinas (UNICAMP), Campinas, Brazil
Ferreira WG, Serpa AL (2016) Ensemble of metamodels: the augmented least squares approach. Struct Multidiscip Optim 53(5):1019–1046
Article Google Scholar
Forrester A, Keane A (2009) Recent advances in surrogate-based optimization. Prog Aerosp Sci 45:50–79
Article Google Scholar
Forrester A, Sóbester A, Keane A (2008) Engineering desing via surrogate modelling—a practical guide. Wiley, United Kingdom
Book Google Scholar
Ginsbourger D, Riche RL, Carraro L (2010) Kriging is well-suited to parallelize optimization. In: Computational intelligence in expensive optimization problems - adaptation learning and optimization, springer, vol 2, pp 131–162
Google Scholar
Giunta AA, Watson LT (1998) Comparison of approximation modeling techniques: polynomial versus interpolating models. In: 7th AIAA/USAF/NASA/ISSMO symposium on multidisciplinary analysis and optimization, AIAA-98-4758, pp 392–404
Google Scholar
Gunn SR (1997) Support vector machines for classification and regression. Technical Report. Image, Speech and Inteligent Systems Research Group, University of Southhampton, UK
Haftka RT, Villanueva D, Chaudhuri A (2016) Parallel surrogate-assisted global optimization with expensive functions - a survey. Struct Multidiscip Optim 54(1):3–13
Article MathSciNet Google Scholar
Han ZH, Zhang KS (2012) Surrogate-based optimization - real-world application of genetic algorithms, ISBN 978-953-51-0146-8 edn. InTech, Dr. Olympia Roeva - Editor, Shanghai, China
Google Scholar
Henkenjohann N, Kukert J (2007) An efficient sequential optimization approach based on the multivariate expected improvement criterion. Qual Eng 19(4):267–280
Article Google Scholar
Janusevskis J, Riche RL, Ginsbourger D, Girdziusas R (2012) Expected improvements for the asynchronous parallel global optimization of expensive functions: potentials and challenges. Learning and Intelligent Optimization 7219:413–418
Article Google Scholar
Jekabsons G (2009) RBF: radial basis function interpolation for matlab/octave. Riga Technical University, Latvia version 1.1 ed
Jin R, Chen W, Sudjianto A (2002) On sequential sampling for global metamodeling in engineering design. In: Engineering technical conferences and computers and information in engineering conference, DETC2002/DAC-34092, ASME 2002 Design, Montreal-Canada
Google Scholar
Jones DR (2001) A taxonomy of global optimization methods based on response surfaces. J Glob Optim 21:345–383
Article MathSciNet MATH Google Scholar
Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. J Glob Optim 13:455–492
Article MathSciNet MATH Google Scholar
Jurecka F (2007) Optimization based on metamodeling techniques. PhD thesis, Technische Universität München, München-Germany
Koziel S, Leifesson L (2013) Surrogate-based modeling and optimization - applications in engineering. Springer, New York, USA
Book Google Scholar
Krige DG (1951) A statistical approach to some mine valuations and Allied problems at the Witwatersrand. Master’s thesis, University of Witwatersrand, Witwatersrand
Lophaven SN, Nielsen HB, Sondergaard J (2002) DACE - a matlab kriging toolbox. Tech. Rep. IMM-TR-2002-12, Technical University of Denmark
Mehari MT, Poorter E, Couckuyt I, Deschrijver D, Gerwen JV, Pareit D, Dhaene T, Moerman I (2015) Efficient global optimization of multi-parameter network problems on wireless testbeds. Ad Hoc Netw 29:15–31
Article Google Scholar
Mockus J (1994) Application of bayesian approach to numerical methods of global and stochastic optimization. J Glob Optim 4:347–365
Article MathSciNet MATH Google Scholar
Ponweiser W, Wagner T, Vincze M (2008) Clustered multiple generalized expected improvement: a novel infill sampling criterion for surrogate models. In: Wang J (ed) 2008 IEEE World congress on computational intelligence, IEEE computational intelligence society. IEEE Press, Hong Kong, pp 3514–3521
Google Scholar
Queipo NV et al (2005) Surrogate-based analysis and optimization. Prog Aerosp Sci 41:1–28
Article Google Scholar
Rasmussen CE, Williams CK (2006) Gaussian processes for machine learning. The MIT Press
Rehman SU, Langelaar M, Keulen FV (2014) Efficient kriging-based robust optimization of unconstrained problems. Journal of Computational Science 5:872–881
Article MathSciNet Google Scholar
Schonlau M (1997) Computer experiments and global optimization. PhD thesis, University of Waterloo, Watterloo, Ontario, Canada
Simpson TW, Toropov V, Balabanov V, Viana FAC (2008) Design and analysis of computer experiments in multidisciplinary design optimization: a review of how far we have come - or not. In: 12th AIAA/ISSMO multidisciplinary analysis and optimization conference, Victoria, British Columbia
Google Scholar
Sóbester A, Leary SJ, Keane A (2004) A parallel updating scheme for approximating and optimizing high fidelity computer simulations. Struct Multidiscip Optim 27:371–383
Article Google Scholar
Thacker WI, Zhang J, Watson LT, Birch JB, Iyer MA, Berry MW (2010) Algorithm 905: SHEPPACK: modified shepard algorithm for interpolation of scattered multivariate data. ACM Trans Math Softw 37(3):1–20
Article MATH Google Scholar
Venkatararaman S, Haftka RT (2004) Structural optimization complexity: what has moore’s law done for us? Struct Multidiscip Optim 28:375–287
Article Google Scholar
Viana FAC (2009) SURROGATES Toolbox user’s guide version 2.0 (release 3). Available at website: http://fchegury.googlepages.com
Viana FAC (2011) Multiples surrogates for prediction and optimization. PhD thesis, University of Florida, Gainesville, FL, USA
Viana FAC, Haftka RT (2010) Surrogage-based optimization with parallel simulations using probability of improvement. In: Proceedings of the 13th AIAA/SSMO multidisciplinary analysis optimization conference, Forth Worth, Texas, USA
Google Scholar
Viana FAC, Haftka RT, Steffen V (2009) Multiple surrogates: how cross-validation error can help us to obtain the best predictor. Struct Multidiscip Optim 39(4):439–457
Article Google Scholar
Viana FAC, Cogu C, Haftka RT (2010) Making the most out of surrogate models: tricks of the trade. In: Proceedings of the ASME 2010 international design engineering technical conferences & computers and information in engineering conference IDETC/CIE 2010, Montreal, Quebec, Canada
Google Scholar
Viana FAC, Haftka RT, Watson LT (2013) Efficient global optimization algorithm assisted by multiple surrogates techniques. J Glob Optim 56:669–689
Article MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank Dr. F.A.C. Viana for the prompt help with the SURROGATES Toolbox and also for the useful comments and discussions about the preliminary results of this work.

W.G. Ferreira would like to thank Ford Motor Company and also the support of his colleagues at the MDO group and Product Development department that helped in the development of this work, which is part of his doctoral research concluded at UNICAMP by the end of 2016.

Finally, the authors are grateful for the questions and comments from the journal editors and reviewers. Undoubtedly their valuable suggestions helped to improve the clarity and consistency of the present text.

Author information

Authors and Affiliations

CAE & Optimization Engineering, Ford Motor Company Brazil, Av. Taboão, 899, 09655-900, São Bernardo do Campo, SP, Brazil
Wallace G. Ferreira
School of Mechanical Engineering - FEM, Department of Computational Mechanics - DMC, University of Campinas - UNICAMP, 13083-970, Campinas, SP, Brazil
Alberto L. Serpa

Authors

Wallace G. Ferreira
View author publications
You can also search for this author in PubMed Google Scholar
Alberto L. Serpa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wallace G. Ferreira.

Appendices

Appendix: A: The kriging metamodel

The Kriging model, originally proposed by Krige (1951), is an interpolating metamodel in which the basis functions, as stated in (1), are of the form

$$ \psi^{(i)} = \psi \left( \left\|\mathbf{x}^{(i)} - \mathbf{x}\right\|\right) = \exp\left( -\sum\limits_{j=1}^{k} {\theta_{j} \left| x^{(i)} -x_{j} \right|^{p_{j}}} \right), $$

(12)

with tuning parameters 𝜃 _j and p _j normally determined by maximum likelihood estimates.

With the parameters estimated, the final kriging predictor is of the form

$$ \hat{f}(\mathbf{x}) = \hat{\mu} + \boldsymbol{\psi}^{T}\boldsymbol{\Psi}^{-1}\left( \mathbf{y} -\mathbf{1}\hat{\mu}\right), $$

(13)

where $\mathbf {y} = \left [y^{(1)}{\ldots } y^{(N)}\right ]^{T}$, 1 is a vector of ones, Ψ = ψ ^(r)(s) is the so called N × N matrix of correlations between the sample data, calculated by means of (12) as

$$ \boldsymbol{\Psi} =\psi \left( \left\|\mathbf{x}^{(r)} - \mathbf{x}^{(s)} \right\|\right) $$

(14)

and $\hat {\mu }$ is given by

$$ \hat{\mu} = \frac{\mathbf{1}^{T}\boldsymbol{\Psi}^{-1}\mathbf{y}}{\mathbf{1}^{T}\boldsymbol{\Psi}^{-1}\mathbf{1}}. $$

(15)

One of the key benefits of kriging models is the provision of uncertainty estimate for the prediction (mean squared error, MSE) at each point x, given by

$$ \hat{s}^{2}(\mathbf{x}) = \hat{\sigma}^{2}\left[1 - \boldsymbol{\psi}^{T}\boldsymbol{\Psi}^{-1}\boldsymbol{\psi} + \frac{1-\mathbf{1}^{T}\boldsymbol{\Psi}^{-1}\mathbf{y}}{\mathbf{1}^{T}\boldsymbol{\Psi}^{-1}\mathbf{1}}\right], $$

(16)

with variance estimated by

$$ \hat{\sigma}^{2} = \frac{\left( \mathbf{y} -\mathbf{1}\hat{\mu}\right)^{T}\boldsymbol{\Psi}^{-1}\left( \mathbf{y} -\mathbf{1}\hat{\mu}\right)}{N}. $$

(17)

Refer to Forrester et al. (2008) or Fang et al. (2006) for further details on metamodel formulation.

Appendix: B: Analytical benchmark functions

These functions were chosen since they are widely used to validate both metamodeling and optimization methods, as for example in and Jones et al. (1998) and Viana et al. (2013).

Branin-Hoo

$$\begin{array}{@{}rcl@{}} y\left( \mathbf{x}\right) & = &\left( x_{2} + \frac{5.1{x_{1}^{2}}}{4\pi^{2}} + \frac{5x_{1}}{\pi} -6\right)^{2}\\ & &+ 10\left( 1-\frac{1}{8\pi}\right)\cos\left( x_{1}\right) + 10, \end{array} $$

(18)

for the region − 5 ≤ x ₁ ≤ 10 and 0 ≤ x ₂ ≤ 15. There are 3 minima in this region, i.e., $\mathbf {x}^{\ast } \approx \left (-\pi , 12.275 \right ), \left (\pi , 2.275 \right ), \left (3\pi , 2.475 \right )$ with $f\left (\mathbf {x}^{\ast }\right ) = \frac {5}{4\pi }$.

Hartman

$$ y(\mathbf{x})= -\sum\limits_{i=1}^{4}{c_{i}\exp\left[ -\sum\limits_{j=1}^{n_{v}}{a_{ij}\left( x_{j} - p_{ij} \right)^{2}} \right]} , $$

(19)

where $x_{i} \in \left [0, 1\right ]^{n_{v}}$, with constants c _i, a _{i
j} and p _{i
j} given in Table 3, for the case n _v = 3 (Hartman-3); and in Tables 4 and 5, for the case n _v = 6 (Hartman-6).

Table 3 Data for Hartman-3 function

Full size table

Table 4 Data for Hartman-6 function, c _i and a _{i
j}

Full size table

Table 5 Data for Hartman-6 function, p _{i
j}

Full size table

In case of Hartman-3, there are four local minima,

$$\mathbf{x}_{local} \approx \left( p_{i1},p_{i2},p_{i3}\right),$$

with f _{l
o
c
a
l} ≈−c _i and the global minimum is located at

$$\mathbf{x}^{\ast} \approx \left( 0.114614, 0.555649, 0.852547 \right),$$

with $f\left (\mathbf {x}^{\ast } \right ) \approx -3.862782$.

In case of Hartman-6, there are four local minima,

$$\mathbf{x}_{local} \approx \left( p_{i1},p_{i2},p_{i3},p_{i4},p_{i5},p_{i6}\right), $$

with f _{l
o
c
a
l} ≈−c _i and the global minimum is located at

$$\mathbf{x}^{\ast} \approx (0.201690, 0.150011,0.476874, $$

$$\ \ \ \ \ \ \ \ \ \ \ 0.275332, 0.3111652, 0.657301), $$

with $f\left (\mathbf {x}^{\ast } \right ) \approx -3.322368$.

Giunta-Watson

This is the “noise-free” version of the function used by Giunta and Watson (1998)

$$ y(\mathbf{x})=\sum\limits_{i=1}^{n_{v}}\left[ \frac{3}{10}+\sin \left( \frac{16}{15}x_{i}-1\right) +\sin^{2}\left( \frac{16}{15}x_{i}-1\right) \right] , $$

(20)

where $\mathbf {x} \in \left [-2, 4\right ]^{n_{v}}$.

Appendix: C: SURROGATES Toolbox

The SURROGATES Toolbox (ref. Viana 2009) is a Matlab based toolbox that aggregates and extends several open-source tools previously developed in the literature for design and analysis of computer experiments, i.e., metamodeling and optimization. We used the version v2.0, but v3.0 already includes EGO variants.^{Footnote 5}

The SURROGATES Toolbox uses the following collection of third party software published: SVM by Gunn (1997), DACE by Lophaven et al. (2002), GPML by Rasmussen and Williams (2006), RBF by Jekabsons (2009), and SHEPPACK by Thacker et al. (2010). The compilation in a single framework has been implemented and applied in previous research by Viana and co-workers, as for example Viana et al. (2009) and Viana (2011).

Appendix: D: A note on sequential sampling vs. one-stage approach

In Ferreira (2016) we investigated some examples with analytical engineering functions. We repeated the one-stage optimization ten times, with different initial DOE, at a very large rate of number of sampling points in terms of number of variables,^{Footnote 6} i.e., for f ₁(x) of Three-Bar Truss N = 120 (60n _v); for Cantilever Beam N = 120 (60n _v); for Helical Spring N = 360 (120n _v) and for Pressure Vessel N = 460 (120n _v).

The results for this experiment are presented in Fig. 18. For the cases investigated, the results showed that there is no guarantee to achieve the exact optimum with a one-stage approach, even starting the optimization with a high density of sampling points in the design space. Probably, the majority of these points are working only for improving the overall quality of the metamodels (exploration) and these points are not being effective to help finding the exact minimum (exploitation), what is clearly a waste of resources for optimization objectives in mind.

These results confirm our beliefs that it is worthwhile to apply sequential sampling approaches like EGO-type algorithms, or some hybrid approach (allied to clustering, for instance), in order to increase the number of points slowly and “surgically” at regions of the design space, with real chance or expectation of improvement in the objective and constraint responses.

In this sense, we reinforce the comments of Forrester and Keane (2009), that the metamodel based optimization must always include some form of iterative search and repetitive infill process to ensure the accuracy in the areas of interest in the design space. In this direction, we agree on the recommendations that a reasonable number of points for starting the sequential sampling metamodel based optimization is about one third (33%) of the available budget in terms of true function/model evaluations (or processing time) to be spent in the whole optimization cycle.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ferreira, W.G., Serpa, A.L. Ensemble of metamodels: extensions of the least squares approach to efficient global optimization. Struct Multidisc Optim 57, 131–159 (2018). https://doi.org/10.1007/s00158-017-1745-x

Download citation

Received: 27 September 2015
Revised: 01 May 2017
Accepted: 12 June 2017
Published: 05 July 2017
Issue Date: January 2018
DOI: https://doi.org/10.1007/s00158-017-1745-x

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Ensemble of metamodels: extensions of the least squares approach to efficient global optimization

Abstract

Access this article

Similar content being viewed by others

Ensemble of metamodels: the augmented least squares approach

Automatic selection for general surrogate models

Robust ensemble of metamodels based on the hybrid error measure

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix: A: The kriging metamodel

Appendix: B: Analytical benchmark functions

Branin-Hoo

Hartman

Giunta-Watson

Appendix: C: SURROGATES Toolbox

Appendix: D: A note on sequential sampling vs. one-stage approach

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Ensemble of metamodels: extensions of the least squares approach to efficient global optimization

Abstract

Access this article

Similar content being viewed by others

Ensemble of metamodels: the augmented least squares approach

Automatic selection for general surrogate models

Robust ensemble of metamodels based on the hybrid error measure

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix: A: The kriging metamodel

Appendix: B: Analytical benchmark functions

Branin-Hoo

Hartman

Giunta-Watson

Appendix: C: SURROGATES Toolbox

Appendix: D: A note on sequential sampling vs. one-stage approach

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation