Orchestrating Multiagent Learning of Penalty Games

Bazzan, Ana L. C.

doi:10.1007/978-3-642-34459-6_15

Ana L. C. Bazzan²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7589))

Included in the following conference series:

Brazilian Symposium on Artificial Intelligence

1298 Accesses

Abstract

In comparison to single agent learning, reinforcement learning in a multiagent scenario is more challenging, since there is an increase in the space of combination of actions that may have to be explored before agents learn an efficient policy. Among other approaches, there has been a proposition to address this problem by means of biasing the exploration. We follow this track using an organizational structure where low-level agents mainly use reinforcement learning, while also getting recommendations from agents possessing a broader view. These agents keep a base of cases in order to give such recommendations, orchestrating the process. We show that this approach is able to accelerate and improve learning in penalty games, a especial case of coordination games.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 49.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bazzan, A.L.C.: Coordinating many agents in stochastic games. In: Proc. of the IEEE IJCNN 2012 (June 2012)
Google Scholar
Claus, C., Boutilier, C.: The dynamics of reinforcement learning in cooperative multiagent systems. In: Proceedings of the Fifteenth National Conference on Artificial Intelligence, pp. 746–752 (1998)
Google Scholar
Guestrin, C., Lagoudakis, M.G., Parr, R.: Coordinated reinforcement learning. In: Proceedings of the Nineteenth International Conference on Machine Learning (ICML), pp. 227–234. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Hines, G., Larson, K.: Learning when to take advice: A statistical test for achieving a correlated equilibrium. In: McAllester, D.A., Myllymäki, P. (eds.) UAI, pp. 274–281. AUAI Press (2008)
Google Scholar
Hu, J., Wellman, M.P.: Multiagent reinforcement learning: Theoretical framework and an algorithm. In: Proc. 15th International Conf. on Machine Learning, pp. 242–250. Morgan Kaufmann (1998)
Google Scholar
Kapetanakis, S., Kudenko, D.: Reinforcement learning of coordination in cooperative multi-agent systems. In: AAAI/IAAI, pp. 326–331 (2002)
Google Scholar
Kuminov, D., Tennenholtz, M.: As safe as it gets: Near-optimal learning in multi-stage games with imperfect monitoring. In: Proceeding of the ECAI 2008, pp. 438–442. IOS Press, Amsterdam (2008)
Google Scholar
Lauer, M., Riedmiller, M.: An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In: Proc. 17th International Conference on Machine Learning, pp. 535–542. Morgan Kaufmann, San Francisco (2000)
Google Scholar
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, ML, pp. 157–163. Morgan Kaufmann, New Brunswick (1994)
Google Scholar
Wang, X., Sandholm, T.: Reinforcement learning to play an optimal nash equilibrium in team markov games. In: Advances in Neural Information Processing Systems 15, NIPS 2002 (2002)
Google Scholar
Zhang, C., Abdallah, S., Lesser, V.: Integrating organizational control into multi-agent learning. In: Sichman, J.S., Decker, K.S., Sierra, C., Castelfranchi, C. (eds.) Proceedings of the 8th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Budapest, Hungary (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

PPGC / Instituto de Informática, Universidade Federal do Rio Grande do Sul (UFRGS), Caixa Postal 15.064, 91.501-970, Porto Alegre, RS, Brazil
Ana L. C. Bazzan

Authors

Ana L. C. Bazzan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

IME, Department of Computer Science, University of São Paulo, Brazil
Leliane N. Barros
IME, Department of Computer Science, University of São Paulo, São Paulo, Brazil
Marcelo Finger
DInf - Federal University of Paraná, CEP 19031-970, Curitiba, Brazil
Aurora T. Pozo
Federal University of Technology, Paraná, Brazil
Gustavo A. Gimenénez-Lugo
Department of Inforumatics, Federal University of Paraná, Curitiba, Brazil
Marcos Castilho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bazzan, A.L.C. (2012). Orchestrating Multiagent Learning of Penalty Games. In: Barros, L.N., Finger, M., Pozo, A.T., Gimenénez-Lugo, G.A., Castilho, M. (eds) Advances in Artificial Intelligence - SBIA 2012. SBIA 2012. Lecture Notes in Computer Science(), vol 7589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34459-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-34459-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34458-9
Online ISBN: 978-3-642-34459-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics