Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture

Pynadath, David V.; Rosenbloom, Paul S.; Marsella, Stacy C.

doi:10.1007/978-3-319-09274-4_14

David V. Pynadath²²,
Paul S. Rosenbloom^22,23 &
Stacy C. Marsella²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8598))

Included in the following conference series:

International Conference on Artificial General Intelligence

1249 Accesses
6 Citations

Abstract

One of the most common applications of human intelligence is social interaction, where people must make effective decisions despite uncertainty about the potential behavior of others around them. Reinforcement learning (RL) provides one method for agents to acquire knowledge about such interactions. We investigate different methods of multiagent reinforcement learning within the Sigma cognitive architecture. We leverage Sigma’s architectural mechanism for gradient descent to realize four different approaches to multiagent learning: (1) with no explicit model of the other agent, (2) with a model of the other agent as following an unknown stationary policy, (3) with prior knowledge of the other agent’s possible reward functions, and (4) through inverse reinforcement learning (IRL) of the other agent’s reward function. While the first three variations re-create existing approaches from the literature, the fourth represents a novel combination of RL and IRL for social decision-making. We show how all four styles of adaptive Theory of Mind are realized through Sigma’s same gradient descent algorithm, and we illustrate their behavior within an abstract negotiation task.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abbeel, P., Ng, A.Y.: Apprenticeship learning via inverse reinforcement learning. In: ICML, pp. 1–8. ACM (2004)
Google Scholar
Brown, G.W.: Iterative solution of games by fictitious play. Activity Analysis of Production and Allocation 13(1), 374–376 (1951)
Google Scholar
Busoniu, L., Babuska, R., De Schutter, B.: A comprehensive survey of multiagent reinforcement learning. IEEE Transactions on Systems, Man, and Cybernetics 38(2), 156–172 (2008)
Article Google Scholar
Gmytrasiewicz, P., Doshi, P.: A framework for sequential planning in multi-agent settings. JAIR 24, 49–79 (2005)
MATH Google Scholar
Hu, J., Wellman, M.P.: Multiagent reinforcement learning: theoretical framework and an algorithm. In: ICML, pp. 242–250 (1998)
Google Scholar
Hu, J., Wellman, M.P.: Learning about other agents in a dynamic multiagent system. Journal of Cognitive Systems Research 2, 67–79 (2001)
Article Google Scholar
Koller, D., Friedman, N.: Probabilistic graphical models: principles and techniques. MIT Press (2009)
Google Scholar
Kschischang, F.R., Frey, B.J., Loeliger, H.A.: Factor graphs and the sum-product algorithm. IEEE Transactions on Information Theory 47(2), 498–519 (2001)
Article MATH MathSciNet Google Scholar
Langley, P., Laird, J.E., Rogers, S.: Cognitive architectures: Research issues and challenges. Cognitive Systems Research 10(2), 141–160 (2009)
Article Google Scholar
Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: ICML, vol. 94, pp. 157–163 (1994)
Google Scholar
Ng, A.Y., Russell, S.J.: Algorithms for inverse reinforcement learning. In: ICML, pp. 663–670 (2000)
Google Scholar
Pynadath, D.V., Marsella, S.C.: PsychSim: Modeling theory of mind with decision-theoretic agents. In: IJCAI, pp. 1181–1186 (2005)
Google Scholar
Pynadath, D.V., Rosenbloom, P.S., Marsella, S.C., Li, L.: Modeling two-player games in the Sigma graphical cognitive architecture. In: Kühnberger, K.-U., Rudolph, S., Wang, P. (eds.) AGI 2013. LNCS (LNAI), vol. 7999, pp. 98–108. Springer, Heidelberg (2013)
Chapter Google Scholar
Rosenbloom, P.S.: From memory to problem solving: Mechanism reuse in a graphical cognitive architecture. In: Schmidhuber, J., Thórisson, K.R., Looks, M. (eds.) AGI 2011. LNCS (LNAI), vol. 6830, pp. 143–152. Springer, Heidelberg (2011)
Chapter Google Scholar
Rosenbloom, P.S.: Deconstructing reinforcement learning in Sigma. In: Bach, J., Goertzel, B., Iklé, M. (eds.) AGI 2012. LNCS (LNAI), vol. 7716, pp. 262–271. Springer, Heidelberg (2012)
Chapter Google Scholar
Rosenbloom, P.S.: Extending mental imagery in Sigma. In: Bach, J., Goertzel, B., Iklé, M. (eds.) AGI 2012. LNCS (LNAI), vol. 7716, pp. 272–281. Springer, Heidelberg (2012)
Chapter Google Scholar
Rosenbloom, P.S., Demski, A., Han, T., Ustun, V.: Learning via gradient descent in Sigma. In: ICCM (2013)
Google Scholar
Russell, S., Binder, J., Koller, D., Kanazawa, K.: Local learning in probabilistic networks with hidden variables. In: IJCAI, pp. 1146–1152 (1995)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
Google Scholar
Watkins, C.J., Dayan, P.: Q-learning. Machine Learning 8(3-4), 279–292 (1992)
Article MATH Google Scholar
Whiten, A. (ed.): Natural Theories of Mind. Basil Blackwell, Oxford (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Creative Technologies, University of Southern California, Los Angeles, CA, USA
David V. Pynadath & Paul S. Rosenbloom
Department of Computer Science, University of Southern California, Los Angeles, CA, USA
Paul S. Rosenbloom
Northeastern University, Boston, MA, USA
Stacy C. Marsella

Authors

David V. Pynadath
View author publications
You can also search for this author in PubMed Google Scholar
Paul S. Rosenbloom
View author publications
You can also search for this author in PubMed Google Scholar
Stacy C. Marsella
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

OpenCog Foundation, G/F, 51C Lung Mei Village, Tai Po, N.T., Hong Kong
Ben Goertzel
AgroParisTech, 16 rue Claude Bernard, 75005, Paris, France
Laurent Orseau
Google Inc., 1600 Amphitheatre Parkway, 94043, Mountain View, CA, USA
Javier Snaider

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pynadath, D.V., Rosenbloom, P.S., Marsella, S.C. (2014). Reinforcement Learning for Adaptive Theory of Mind in the Sigma Cognitive Architecture. In: Goertzel, B., Orseau, L., Snaider, J. (eds) Artificial General Intelligence. AGI 2014. Lecture Notes in Computer Science(), vol 8598. Springer, Cham. https://doi.org/10.1007/978-3-319-09274-4_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-09274-4_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09273-7
Online ISBN: 978-3-319-09274-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics