Global optimization in machine learning: the design of a predictive analytics application

Candelieri, Antonio; Archetti, Francesco

doi:10.1007/s00500-018-3597-8

Global optimization in machine learning: the design of a predictive analytics application

Focus
Published: 01 November 2018

Volume 23, pages 2969–2977, (2019)
Cite this article

Soft Computing Aims and scope Submit manuscript

Antonio Candelieri¹ &
Francesco Archetti¹

828 Accesses
17 Citations
Explore all metrics

Abstract

Global optimization, especially Bayesian optimization, has become the tool of choice in hyperparameter tuning and algorithmic configuration to optimize the generalization capability of machine learning algorithms. The contribution of this paper was to extend this approach to a complex algorithmic pipeline for predictive analytics, based on time-series clustering and artificial neural networks. The software environment R has been used with mlrMBO, a comprehensive and flexible toolbox for sequential model-based optimization. Random forest has been adopted as surrogate model, due to the nature of decision variables (i.e., conditional and discrete hyperparameters) of the case studies considered. Two acquisition functions have been considered: Expected improvement and lower confidence bound, and results are compared. The computational results, on a benchmark and a real-world dataset, show that even in a complex search space, up to 80 dimensions related to integer, categorical, and conditional variables (i.e., hyperparameters), sequential model-based optimization is an effective solution, with lower confidence bound requiring a lower number of function evaluations than expected improvement to find the same optimal solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MultiETSC: automated machine learning for early time series classification

Article Open access 16 August 2021

Feature Selection Guided by CVOA Metaheuristic for Deep Neural Networks: Application to Multivariate Time Series Forecasting

A new algorithm for time series prediction using machine learning models

Article 20 June 2022

Notes

https://archive.ics.uci.edu/ml/datasets/Appliances+energy+prediction.

References

Bischl B, Richter J, Bossek J, Horn D, Thomas J, Lang M (2017) mlrMBO: a modular framework for model-based optimization of expensive black-box functions. arXiv:1703.03373
Candelieri A (2017) Clustering and support vector regression for water demand forecasting and anomaly detection. Water 9(3):224
Article Google Scholar
Candelieri A, Archetti F (2014) Identifying typical urban water demand patterns for a reliable short-term forecasting—the icewater project approach. Procedia Eng 89:1004–1012
Article Google Scholar
Candelieri A, Soldi D, Archetti F (2015) Short-term forecasting of hourly water consumption by using automatic metering readers data. Procedia Eng 119(1):844–853
Article Google Scholar
Candelieri A, Giordani I, Archetti F (2017) Automatic configuration of kernel-based clustering: an optimization approach. In: International conference on learning and intelligence optimization. Springer, Cham, pp 34–49
Dhillon IS, Guan Y, Kulis B (2004) Kernel k-means: spectral clustering and normalized cuts. In: Proceedings of the tenth ACM SIGKDD international conference on knowledge discovery and data mining, pp 551–556
Feurer M, Klein A, Eggensperger K, Springenberg J, Blum M, Hutter F (2015) Efficient and robust automated machine learning. In: Advances in neural information processing systems, pp 2962–2970
Huang D, Allen TT, Notz WI, Zeng N (2006) Global optimization of stochastic black-box systems via sequential kriging meta-models. J Glob Optim 34(3):441–466
Article MathSciNet MATH Google Scholar
Kandasamy K, Schneider J, Pòczos B (2015) High dimensional Bayesian optimisation and bandits via additive models. In: International conference on machine learning, vol 37, pp 295–304
Mockus J, Tiesis V, Zilinskas A (1978) The application of Bayesian methods for seeking the extremum. In: Dixon L, Szego G (eds) Towards global optimisation 2. Elsevier, New York, pp 117–130
Google Scholar
Shahriari B, Swersky K, Wang Z, Adams RP, de Freitas N (2016) Taking the human out of the loop: a review of bayesian optimization. Proc IEEE 104(1):148–175
Article Google Scholar
Snoek J, Larochelle H, Adams RP (2012) Practical Bayesian optimization of machine learning algorithms. arXiv:1206.2944[stat.ML]
Thornton C, Hutter F, Hoos HH, Leyton-Brown K (2013) Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms. In: Proceedings of ACM SIGKDD, pp 847–855
Wang Z, Zoghi M, Hutter F, Matheson D, De Freitas N (2013) Bayesian optimization in high dimensions via random embeddings. In: Proceedings of the international joint conference on artificial intelligence, pp 1778–1784
Wolpert DH (1996) The lack of a priori distinctions between learning algorithms. Neural Comput 8(7):1341–1390
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Systems and Communication, University of Milano-Bicocca, 20126, Milan, Italy
Antonio Candelieri & Francesco Archetti

Authors

Antonio Candelieri
View author publications
You can also search for this author in PubMed Google Scholar
Francesco Archetti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Antonio Candelieri.

Ethics declarations

Conflict of interest

Antonio Candelieri and Francesco Archetti declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Communicated by P. Beraldi, M.Boccia, C. Sterle.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Candelieri, A., Archetti, F. Global optimization in machine learning: the design of a predictive analytics application. Soft Comput 23, 2969–2977 (2019). https://doi.org/10.1007/s00500-018-3597-8

Download citation

Published: 01 November 2018
Issue Date: 01 May 2019
DOI: https://doi.org/10.1007/s00500-018-3597-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Global optimization in machine learning: the design of a predictive analytics application

Abstract

Access this article

Similar content being viewed by others

MultiETSC: automated machine learning for early time series classification

Feature Selection Guided by CVOA Metaheuristic for Deep Neural Networks: Application to Multivariate Time Series Forecasting

A new algorithm for time series prediction using machine learning models

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Global optimization in machine learning: the design of a predictive analytics application

Abstract

Access this article

Similar content being viewed by others

MultiETSC: automated machine learning for early time series classification

Feature Selection Guided by CVOA Metaheuristic for Deep Neural Networks: Application to Multivariate Time Series Forecasting

A new algorithm for time series prediction using machine learning models

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation