Using the Max-Plus Algorithm for Multiagent Decision Making in Coordination Graphs

Kok, Jelle R.; Vlassis, Nikos

doi:10.1007/11780519_1

Jelle R. Kok²² &
Nikos Vlassis²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4020))

Included in the following conference series:

Robot Soccer World Cup

2619 Accesses
18 Citations

Abstract

Coordination graphs offer a tractable framework for cooperative multiagent decision making by decomposing the global payoff function into a sum of local terms. Each agent can in principle select an optimal individual action based on a variable elimination algorithm performed on this graph. This results in optimal behavior for the group, but its worst-case time complexity is exponential in the number of agents, and it can be slow in densely connected graphs. Moreover, variable elimination is not appropriate for real-time systems as it requires that the complete algorithm terminates before a solution can be reported. In this paper, we investigate the max-plus algorithm, an instance of the belief propagation algorithm in Bayesian networks, as an approximate alternative to variable elimination. In this method the agents exchange appropriate payoff messages over the coordination graph, and based on these messages compute their individual actions. We provide empirical evidence that this method converges to the optimal solution for tree-structured graphs (as shown by theory), and that it finds near optimal solutions in graphs with cycles, while being much faster than variable elimination.

Download to read the full chapter text

Chapter PDF

Computational social choice for coordination in agent networks

Article 13 June 2015

Graph Patterns, Reinforcement Learning and Models of Reputation for Improving Coalition Formation in Collaborative Multi-agent Systems

Multi-Agent Control: A Graph-Theoretic Perspective

Article 26 October 2021

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Weiss, G. (ed.): Multiagent Systems: a Modern Approach to Distributed Artificial Intelligence. MIT Press, Cambridge (1999)
Google Scholar
Vlassis, N.: A concise introduction to multiagent systems and distributed AI, Informatics Institute, University of Amsterdam (2003)
Google Scholar
Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., Osawa, E.: RoboCup: The Robot World Cup Initiative. In: Proc. of the IJCAI 1995 Workshop on Entertainment and AI/Alife (1995)
Google Scholar
Guestrin, C., Koller, D., Parr, R.: Multiagent planning with factored MDPs. In: Advances in Neural Information Processing Systems, vol. 14. MIT Press, Cambridge (2002)
Google Scholar
Kok, J.R., Spaan, M.T.J., Vlassis, N.: Non-communicative multi-robot coordination in dynamic environments. Robotics and Autonomous Systems 50, 99–114 (2005)
Article Google Scholar
Vlassis, N., Elhorst, R., Kok, J.R.: Anytime algorithms for multiagent decision making using coordination graphs. In: Proc. of the International Conference on Systems, Man and Cybernetics, The Hague, The Netherlands (2004)
Google Scholar
Pearl, J.: Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Francisco (1988)
Google Scholar
Yedidia, J., Freeman, W., Weiss, Y.: Understanding belief propagation and its generalizations. In: Exploring Artificial Intelligence in the New Millennium, pp. 239–269. Morgan Kaufmann Publishers Inc., San Francisco (2003)
Google Scholar
Wainwright, M., Jaakkola, T., Willsky, A.: Tree consistency and bounds on the performance of the max-product algorithm and its generalizations. Statistics and Computing 14, 143–166 (2004)
Article MathSciNet Google Scholar
Zhang, N.L., Poole, D.: Exploiting causal independence in bayesian network inference. Journal of Artificial Intelligence Research 5, 301–328 (1996)
MATH MathSciNet Google Scholar
Bertelé, U., Brioschir, F.: Nonserial dynamic programming. Academic Press, London (1972)
MATH Google Scholar
Wainwright, M., Jaakkola, T., Willsky, A.: Tree consistency and bounds on the performance of the max-product algorithm and its generalizations. Technical report, P-2554, LIDS-MIT (2002)
Google Scholar
Crick, C., Pfeffer, A.: Loopy belief propagation as a basis for communication in sensor networks. In: Proc. of the 19th Conference on Uncertainty in AI (2003)
Google Scholar
Murphy, K., Weiss, Y., Jordan, M.: Loopy belief propagation for approximate inference: An empirical study. In: Proc. 15th Conf. on Uncertainty in Artificial Intelligence, Stockholm, Sweden (1999)
Google Scholar
Loeliger, H.A.: An introduction to factor graphs. IEEE Signal Proc. Mag., 28–41 (2004)
Google Scholar
Kok, J.R., Vlassis, N.: Sparse Cooperative Q-learning. In: Greiner, R., Schuurmans, D. (eds.) Proc. of the 21st Int. Conf. on Machine Learning, pp. 481–488. ACM Press, New York (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Informatics Institute, University of Amsterdam, The Netherlands
Jelle R. Kok & Nikos Vlassis

Authors

Jelle R. Kok
View author publications
You can also search for this author in PubMed Google Scholar
Nikos Vlassis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fraunhofer Institute for Autonomous Intelligent Systems (AIS), D-53754, Sankt Augustin, Germany
Ansgar Bredenfeld
Intelligent Systems Division, National Institute of Standards and Technology, USA
Adam Jacoff
Information Technology Research Institute National Institute of Advanced Industrial Science and Technology,, 1-1-1 Umezono, Tsukuba, Ibaraki, Japan
Itsuki Noda
Dept. of Adaptive Machine Systems, Graduate School of Engineering, Osaka University,
Yasutake Takahashi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kok, J.R., Vlassis, N. (2006). Using the Max-Plus Algorithm for Multiagent Decision Making in Coordination Graphs. In: Bredenfeld, A., Jacoff, A., Noda, I., Takahashi, Y. (eds) RoboCup 2005: Robot Soccer World Cup IX. RoboCup 2005. Lecture Notes in Computer Science(), vol 4020. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780519_1

Download citation

DOI: https://doi.org/10.1007/11780519_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-35437-6
Online ISBN: 978-3-540-35438-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using the Max-Plus Algorithm for Multiagent Decision Making in Coordination Graphs

Abstract

Chapter PDF

Similar content being viewed by others

Computational social choice for coordination in agent networks

Graph Patterns, Reinforcement Learning and Models of Reputation for Improving Coalition Formation in Collaborative Multi-agent Systems

Multi-Agent Control: A Graph-Theoretic Perspective

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Using the Max-Plus Algorithm for Multiagent Decision Making in Coordination Graphs

Abstract

Chapter PDF

Similar content being viewed by others

Computational social choice for coordination in agent networks

Graph Patterns, Reinforcement Learning and Models of Reputation for Improving Coalition Formation in Collaborative Multi-agent Systems

Multi-Agent Control: A Graph-Theoretic Perspective

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation