Networked reinforcement learning

Oku, Makito; Aihara, Kazuyuki

doi:10.1007/s10015-008-0565-x

Networked reinforcement learning

Original Article
Published: 14 December 2008

Volume 13, pages 112–115, (2008)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Makito Oku¹ &
Kazuyuki Aihara^1,2,3

67 Accesses
Explore all metrics

Abstract

Recently, many models of reinforcement learning with hierarchical or modular structures have been proposed. They decompose a task into simpler subtasks and solve them by using multiple agents. However, these models impose certain restrictions on the topological relations of agents and so on. By relaxing these restrictions, we propose networked reinforcement learning, where each agent in a network acts autonomously by regarding the other agents as a part of its environment. Although convergence to an optimal policy is no longer assured, by means of numerical simulations, we show that our model functions appropriately, at least in certain simple situations.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Emergence in complex networks of simple agents

Article Open access 23 May 2023

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

References

Sutton RS, Barto AG (1998) Reinforcement learning: An introduction. MIT Press
Bakker B, Schmidhuber J (2004) Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization. In: Proceedings of the 8-th Conference on Intelligent Autonomous Systems, pp 438–445
Dayan P, Hinton GE (1993) Feudal reinforcement learning. Adv Neural Inf Process Syst 5:271–278
Google Scholar
Dietterich TG (2000) Hierarchical reinforcement learning with the MAXQ value function decomposition. J Artif Intell Res 13:227–303
MATH MathSciNet Google Scholar
Doya K, Samejima K, Katagiri K, et al (2002) Multiple model-based reinforcement learning. Neural Comput 14:1347–1369
Article MATH Google Scholar
Parr R, Russell S (1998) Reinforcement learning with hierarchies of machines. Adv Neural Inf Process Syst 10:1043–1049
Google Scholar
Singh SP (1992) Transfer of learning by composing solutions of elemental sequential tasks. Mach Learn 8:323–339
MATH Google Scholar
Sutton RS, Precup D, Singh S (1999) Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artif Intell 112:181–211
Article MATH MathSciNet Google Scholar
Cassandra AR, Kaelbling LP, Littman ML (1994) Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on Artificial Intelligence, pp 1023–1028
Dolgov D, Durfee E (2004) Graphical models in local, asymmetric multi-agent Markov decision processes. In: Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, pp 956–963
Nair R, Varakantham P, Tambe M, et al (2005) Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In: Proceedings of the Twentieth National Conference on Artificial Intelligence, pp 133–139

Download references

Author information

Authors and Affiliations

Department of Mathematical Informatics Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan
Makito Oku & Kazuyuki Aihara
Institute of Industrial Science, The University of Tokyo, Tokyo, Japan
Kazuyuki Aihara
Aihara Complexity Modelling Project, ERATO, Japan Science and Technology Agency (JST), Tokyo, Japan
Kazuyuki Aihara

Authors

Makito Oku
View author publications
You can also search for this author in PubMed Google Scholar
Kazuyuki Aihara
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Makito Oku.

Additional information

This work was presented in part at the 13th International Symposium on Artificial Life and Robotics, Oita, Japan, January 31–February 2, 2008

About this article

Cite this article

Oku, M., Aihara, K. Networked reinforcement learning. Artif Life Robotics 13, 112–115 (2008). https://doi.org/10.1007/s10015-008-0565-x

Download citation

Received: 03 July 2008
Accepted: 03 July 2008
Published: 14 December 2008
Issue Date: December 2008
DOI: https://doi.org/10.1007/s10015-008-0565-x

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Networked reinforcement learning

Abstract

Access this article

Similar content being viewed by others

Emergence in complex networks of simple agents

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Key words

Navigation

Networked reinforcement learning

Abstract

Access this article

Similar content being viewed by others

Emergence in complex networks of simple agents

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

A practical guide to multi-objective reinforcement learning and planning

References

Author information

Authors and Affiliations

Corresponding author

Additional information

About this article

Cite this article

Share this article

Key words

Search

Navigation