Cooperative Bayesian Optimization for Imperfect Agents

Khoshvishkaie, Ali; Mikkola, Petrus; Murena, Pierre-Alexandre; Kaski, Samuel

doi:10.1007/978-3-031-43412-9_28

Ali Khoshvishkaie¹²,
Petrus Mikkola¹²,
Pierre-Alexandre Murena^12,13 &
…
Samuel Kaski^12,14

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14169))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

1488 Accesses
1 Altmetric

Abstract

We introduce a cooperative Bayesian optimization problem for optimizing black-box functions of two variables where two agents choose together at which points to query the function but have only control over one variable each. This setting is inspired by human-AI teamwork, where an AI-assistant helps its human user solve a problem, in this simplest case, collaborative optimization. We formulate the solution as sequential decision-making, where the agent we control models the user as a computationally rational agent with prior knowledge about the function. We show that strategic planning of the queries enables better identification of the global maximum of the function as long as the user avoids excessive exploration. This planning is made possible by using Bayes Adaptive Monte Carlo planning and by endowing the agent with a user model that accounts for conservative belief updates and exploratory sampling of the points to query.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty

The Bayesian Search Game

Exploration costs as a means for improving performance in multiagent systems

Article 19 October 2014

Notes

1.
Implementation of our method and source code for the experiments are available at https://github.com/ChessGeek95/AI-assisted-Bayesian-optimization/.

References

Bard, N., et al.: The hanabi challenge: a new frontier for AI research. Artif. Intell. 280, 103216 (2020)
Article MathSciNet MATH Google Scholar
Borji, A., Itti, L.: Bayesian optimization explains human active search. In: Burges, C., Bottou, L., Welling, M., Ghahramani, Z., Weinberger, K. (eds.) Advances in Neural Information Processing Systems, vol. 26. Curran Associates, Inc. (2013)
Google Scholar
Chalkiadakis, G., Elkind, E., Wooldridge, M.: Cooperative game theory: basic concepts and computational challenges. IEEE Intell. Syst. 27(3), 86–90 (2012)
Article MATH Google Scholar
Cox, D.D., John, S.: A statistical method for global optimization. In: Proceedings of the 1992 IEEE International Conference on Systems, Man, and Cybernetics, pp. 1241–1246. IEEE (1992)
Google Scholar
Duan, Q., Shao, C., Qu, L., Shi, Y., Niu, B.: When cooperative co-evolution meets coordinate descent: theoretically deeper understandings and practically better implementations. In: 2019 IEEE Congress on Evolutionary Computation (CEC), pp. 721–730. IEEE (2019)
Google Scholar
El-Gamal, M.A., Grether, D.M.: Are people Bayesian? uncovering behavioral strategies. J. Am. Stat. Assoc. 90(432), 1137–1145 (1995)
Article MATH Google Scholar
Etel, E., Slaughter, V.: Theory of mind and peer cooperation in two play contexts. J. Appl. Dev. Psychol. 60, 87–95 (2019)
Article Google Scholar
Gershman, S.J., Horvitz, E.J., Tenenbaum, J.B.: Computational rationality: a converging paradigm for intelligence in brains, minds, and machines. Science 349(6245), 273–278 (2015)
Article MathSciNet MATH Google Scholar
Guez, A., Silver, D., Dayan, P.: Scalable and efficient bayes-adaptive reinforcement learning based on monte-carlo tree search. J. Artif. Intell. Res. 48, 841–883 (2013)
Article MathSciNet MATH Google Scholar
Helander, M.G.: Handbook of human-computer interaction. Elsevier (2014)
Google Scholar
Hildreth, C.: A quadratic programming procedure. Naval Res. Logistics Q. 4(1), 79–85 (1957)
Article MathSciNet Google Scholar
Jiang, P., Cheng, Y., Liu, J.: Cooperative Bayesian optimization with hybrid grouping strategy and sample transfer for expensive large-scale black-box problems. Knowl.-Based Syst. 254, 109633 (2022)
Article Google Scholar
Kovach, M.: Conservative updating. arXiv preprint arXiv:2102.00152 (2021)
Larson, L., DeChurch, L.A.: Leading teams in the digital age: four perspectives on technology and what they mean for leading teams. Leadersh. Q. 31(1), 101377 (2020)
Article Google Scholar
Mikkola, P., Todorović, M., Järvi, J., Rinke, P., Kaski, S.: Projective preferential bayesian optimization. In: Proceedings of the 37th International Conference on Machine Learning, pp. 6884–6892. PMLR (2020)
Google Scholar
O’Neill, T., McNeese, N., Barron, A., Schelble, B.: Human-autonomy teaming: a review and analysis of the empirical literature. Hum. Factors 64(5), 904–938 (2022)
Article Google Scholar
Potter, M.A., De Jong, K.A.: A cooperative coevolutionary approach to function optimization. In: Davidor, Y., Schwefel, H.-P., Männer, R. (eds.) PPSN 1994. LNCS, vol. 866, pp. 249–257. Springer, Heidelberg (1994). https://doi.org/10.1007/3-540-58484-6_269
Chapter Google Scholar
Rasmussen, C.E., Williams, C.K.I.: Gaussian processes for machine learning. MIT Press, Adaptive Computation and Machine Learning (2006)
MATH Google Scholar
Sears, A., Jacko, J.A.: Human-Computer Interaction Fundamentals. CRC Press (2009)
Google Scholar
Sim, R.H.L., Zhang, Y., Low, B.K.H., Jaillet, P.: Collaborative Bayesian optimization with fair regret. In: Proceedings of the International Conference on Machine Learning, pp. 9691–9701. PMLR (2021)
Google Scholar
Sundin, I., et al.: Human-in-the-loop assisted de novo molecular design. J. Cheminformatics 14(1), 1–16 (2022)
Article Google Scholar
Thurstone, L.L.: A law of comparative judgment. Psychol. Rev. 101(2), 266 (1994)
Article Google Scholar
Tversky, A., Kahneman, D.: Judgment under uncertainty: heuristics and biases: biases in judgments reveal some heuristics of thinking under uncertainty. Science 185(4157), 1124–1131 (1974)
Article Google Scholar

Download references

Acknowledgements

This research was supported by EU Horizon 2020 (HumanE AI NET, 952026) and UKRI Turing AI World-Leading Researcher Fellowship (EP/W002973/1). Computational resources were provided by the Aalto Science-IT project from Computer Science IT. The authors would like to thank Prof. Frans Oliehoek and Dr. Mert Celikok for their help in setting up the project and the reviewers for their insightful comments.

Author information

Authors and Affiliations

Department of Computer Science, Aalto University, Helsinki, Finland
Ali Khoshvishkaie, Petrus Mikkola, Pierre-Alexandre Murena & Samuel Kaski
Hamburg University of Technology, Hamburg, Germany
Pierre-Alexandre Murena
Department of Computer Science, University of Manchester, Manchester, UK
Samuel Kaski

Authors

Ali Khoshvishkaie
View author publications
You can also search for this author in PubMed Google Scholar
Petrus Mikkola
View author publications
You can also search for this author in PubMed Google Scholar
Pierre-Alexandre Murena
View author publications
You can also search for this author in PubMed Google Scholar
Samuel Kaski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pierre-Alexandre Murena .

Editor information

Editors and Affiliations

University of Michigan, Ann Arbor, MI, USA
Danai Koutra
University of Vienna, Vienna, Austria
Claudia Plant
Max Planck Institute for Software Systems, Kaiserslautern, Germany
Manuel Gomez Rodriguez
Politecnico di Torino, Turin, Italy
Elena Baralis
CENTAI, Turin, Italy
Francesco Bonchi

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 2607 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khoshvishkaie, A., Mikkola, P., Murena, PA., Kaski, S. (2023). Cooperative Bayesian Optimization for Imperfect Agents. In: Koutra, D., Plant, C., Gomez Rodriguez, M., Baralis, E., Bonchi, F. (eds) Machine Learning and Knowledge Discovery in Databases: Research Track. ECML PKDD 2023. Lecture Notes in Computer Science(), vol 14169. Springer, Cham. https://doi.org/10.1007/978-3-031-43412-9_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-43412-9_28
Published: 17 September 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43411-2
Online ISBN: 978-3-031-43412-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the ECML PKDD community (opens in a new tab)

Cooperative Bayesian Optimization for Imperfect Agents

Abstract

Access this chapter

Similar content being viewed by others

A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty

The Bayesian Search Game

Exploration costs as a means for improving performance in multiagent systems

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 2607 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Cooperative Bayesian Optimization for Imperfect Agents

Abstract

Access this chapter

Similar content being viewed by others

A Two-Stage Online Approach for Collaborative Multi-agent Planning Under Uncertainty

The Bayesian Search Game

Exploration costs as a means for improving performance in multiagent systems

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 2607 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation