Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning

Carlin, Alan; Zilberstein, Shlomo

doi:10.1007/978-3-642-24647-0_1

Alan Carlin⁶ &
Shlomo Zilberstein⁶

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 28))

2 Citations

Abstract

Metareasoning has been used as a means for achieving bounded rationality by optimizing the tradeoff between the cost and value of the decision making process. Effective monitoring techniques have been developed to allow agents to stop their computation at the “right” time so as to optimize the overall time-dependent utility of the decision. However, these methods were designed for a single decision maker. In this chapter, we analyze the problems that arise when several agents solve components of a larger problem, each using an anytime algorithm. Metareasoning is more challenging in this case because each agent is uncertain about the progress made so far by the others. We develop a formal framework for decentralized monitoring of decision making, establish the complexity of several interesting variants of the problem, and propose solution techniques for each case.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Robust Networked Multiagent Optimization: Designing Agents to Repair Their Own Utility Functions

Article 01 September 2022

Severity-sensitive norm-governed multi-agent planning

Article Open access 07 July 2017

Distributed Line Search for Multiagent Convex Optimization

References

Anderson, M.: A review of recent research in metareasoning and metalearning. AI Magazine 28(1), 7–16 (2007)
Google Scholar
Becker, R., Carlin, A., Lesser, V., Zilberstein, S.: Analyzing myopic approaches for multi-agent communication. Computational Intelligence 25(1), 31–50 (2009)
Article MathSciNet Google Scholar
Becker, R., Zilberstein, S., Lesser, V., Goldman, C.: Solving transition independent decentralized Markov decision processes. Journal of Artificial Intelligence Research 22, 423–455 (2004)
MathSciNet MATH Google Scholar
Bernstein, D., Givan, R., Immerman, N., Zilberstein, S.: The complexity of decentralized control of Markov decision processes. Mathematics of Operations Research 27(4), 819–840 (2002)
Article MathSciNet MATH Google Scholar
Carlin, A., Zilberstein, S.: Myopic and non-myopic communication under partial observability. In: Proceedings of the 2009 IEEE/WIC/ACM International Conference on Intelligent Agent Technology (2009)
Google Scholar
Cheng, S., Raja, A., Lesser, V.: Multiagent Meta-level Control for a Network of Weather Radars. In: Proceedings of 2010 IEEE/WIC/ACM International Conference on Intelligent Agent Technology, pp. 157–164 (2010)
Google Scholar
Cox, M., Raja, A.: Metareasoning: Thinking about thinking. MIT Press, Cambridge (2011)
Google Scholar
Dean, T., Boddy, M.: An analysis of time-dependent planning. In: Proceedings of the Seventh National Conference on Artificial Intelligence, pp. 49–54 (1988)
Google Scholar
Ford, L., Fulkerson, D.: Maximal flow through a network. Canadian Journal of Mathematics 8, 399–404 (1956)
Article MathSciNet MATH Google Scholar
Gigerenzer, G., Todd, P.: ABC Research Group: Simple Heuristics That Make Us Smart. Oxford University Press, Oxford (1999)
Google Scholar
Goldman, C., Zilberstein, S.: Decentralized control of cooperative systems: Categorization and complexity analysis. Journal of Artificial Intelligence Research 22, 143–174 (2004)
MathSciNet MATH Google Scholar
Hansen, E., Zilberstein, S.: Monitoring and control of anytime algorithms: A dynamic programming approach. Artificial Intelligence 126(1-2), 139–157 (2001)
Article MathSciNet MATH Google Scholar
Horvitz, E.: Reasoning about beliefs and actions under computational resource constraints. In: Proceedings of Third Workshop on Uncertainty in Artificial Intelligence, pp. 429–444 (1987)
Google Scholar
Laasri, B., Laasri, H., Lesser, V.: An analysis of negotiation and its role for coordinating cooperative distributed problem solvers. In: Proceedings of General Conference on Second Generation Expert Systems; Eleventh International Conference on Expert Systems and their Applications, vol. 2, pp. 81–94 (1991)
Google Scholar
Petrik, M., Zilberstein, S.: A bilinear approach for multiagent planning. Journal of Artificial Intelligence Research 35, 235–274 (2009)
MathSciNet MATH Google Scholar
Puterman, M.: Markov decision processes, Discrete stochastic dynamic programming. John Wiley and Sons Inc., Chichester (2005)
MATH Google Scholar
Raja, A., Lesser, V.: A framework for meta-level control in multi-agent systems. Autonomous Agents and Multi-Agent Systems 15, 147–196 (2007)
Article Google Scholar
Russell, S., Subramanian, D., Parr, R.: Provably bounded optimal agents. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence, pp. 575–609 (1993)
Google Scholar
Russell, S., Wefald, E.: Principles of metareasoning. In: Proceedings of the First International Conference on Principles of Knowledge Representation and Reasoning, pp. 400–411 (1989)
Google Scholar
Smith, T., Simmons, R.: Heuristic search value iIteration for POMDPs. In: Proceedings of the International Conference on Uncertainty in Artificial Intelligence, pp. 520–527 (2004)
Google Scholar
Sandholm, T.W.: Terminating decision algorithms optimally. In: Rossi, F. (ed.) CP 2003. LNCS, vol. 2833, pp. 950–955. Springer, Heidelberg (2003)
Chapter Google Scholar
Schut, M., Wooldridge, M.: The control of reasoning in resource-bounded agents. Knowledge Engineering Review 16(3), 215–240 (2001)
Article Google Scholar
Simon, H.: A behavioral model of rational choice. Quaterly Journal of Economics 69, 99–118 (1955)
Article Google Scholar
Tsitsiklis, J., Athans, M.: On the complexity of decentralized decision making and detection problems. IEEE Transactions on Automatic Control 30(5), 440–446 (1985)
Article MathSciNet MATH Google Scholar
Wald, A.: Sequential tests of statistical hypotheses. The Annals of Mathematical Statistics 16, 117–186 (1945)
Article MathSciNet MATH Google Scholar
Wellman, M.: Formulation of Tradeoffs in Planning under Uncertainty. Pitman, London (1990)
Google Scholar
Xuan, P., Lesser, V., Zilberstein, S.: Communication decisions in multi-agent cooperation: model and experiments. In: Proceedings of the Fifth International Conference on Autonomous Agents, pp. 616–623 (2001)
Google Scholar
Zilberstein, S.: Operational rationality through compilation of anytime algorithms. Ph.D. Dissertation, Computer Science Division. University of California, Berkeley (1993)
Google Scholar
Zilberstein, S., Russell, S.: Optimal composition of real-time systems. Artificial Intelligence 82(1-2), 181–213 (1996)
Article MathSciNet Google Scholar
Zilberstein, S.: Metareasoning and bounded rationality. In: Cox, M., Raja, A. (eds.) Metareasoning: Thinking about Thinking. MIT Press, Cambridge (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Massachusetts, Amherst, MA, 01003
Alan Carlin & Shlomo Zilberstein

Authors

Alan Carlin
View author publications
You can also search for this author in PubMed Google Scholar
Shlomo Zilberstein
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Adaptive Systems, Institute of Information Theory and Automation of the ASCR, PO Box 18, 18208, Praha 8, Czech Republic
Tatiana Valentine Guy
Institute of Information Theory and Automation of the ASCR, PO Box 18, 18208, Praha 8, Czech Republic
Miroslav Kárný
Intelligent Systems Division Ames Research Center, NASA, Mail Stop 269-1, 94035, Moffett Field, CA, USA
David H. Wolpert

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Carlin, A., Zilberstein, S. (2012). Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning. In: Guy, T.V., Kárný, M., Wolpert, D.H. (eds) Decision Making with Imperfect Decision Makers. Intelligent Systems Reference Library, vol 28. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24647-0_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-24647-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24646-3
Online ISBN: 978-3-642-24647-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning

Abstract

Access this chapter

Preview

Similar content being viewed by others

Robust Networked Multiagent Optimization: Designing Agents to Repair Their Own Utility Functions

Severity-sensitive norm-governed multi-agent planning

Distributed Line Search for Multiagent Convex Optimization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Bounded Rationality in Multiagent Systems Using Decentralized Metareasoning

Abstract

Access this chapter

Preview

Similar content being viewed by others

Robust Networked Multiagent Optimization: Designing Agents to Repair Their Own Utility Functions

Severity-sensitive norm-governed multi-agent planning

Distributed Line Search for Multiagent Convex Optimization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation