Improvement of the Relaxation Procedure in Concurrent Q-Learning

Murakami, Kazunori; Ozeki, Tomoko

doi:10.1007/978-3-642-42042-9_11

Kazunori Murakami²⁰ &
Tomoko Ozeki²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8227))

Included in the following conference series:

International Conference on Neural Information Processing

3600 Accesses

Abstract

In this paper, we point out problems in concurrent Q-learning (CQL), which is one of the adaptation techniques to dynamic environment in reinforcement learning and propose the modification of the relaxation procedure in CQL. We apply the proposed algorithm to the problem of maze in reinforcement learning and validate what kind of behavior the original CQL and the proposed algorithm show for the changes of environment such as the change of goals and the emergence of obstacles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.S., Barto, A.G.: Reinforcement learning - an introduction. MIT Press (1998)
Google Scholar
Morris, R.G.M.: Spatial localization does not require the presence of local cues. Learning and Motivation 12, 239–260 (1981)
Article Google Scholar
Foster, D.J., Morris, R.G.M., Dayan, P.: A model of hippocampally dependent navigation using the temporal difference learning rule. Hippocampus 10, 1–16 (2000)
Article Google Scholar
Kaelbling, L.P.: Learning to achieve goals. In: Proceedings of the Thirteenth International Joint Conference on Artificial Intelligence (1993)
Google Scholar
Ollington, R.B., Vamplew, P.W.: Concurrent Q-learning: reinforcement learning for dynamic goals and environments. International Journal of Intelligent Systems 20, 1037–1052 (2005)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Engineering, Tokai University, Hiratsuka, Japan
Kazunori Murakami
Department of Human and Information Science, Tokai University, Hiratsuka, Japan
Tomoko Ozeki

Authors

Kazunori Murakami
View author publications
You can also search for this author in PubMed Google Scholar
Tomoko Ozeki
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kyungpook National University, 1370 Sankyuk-Dong, Puk-Gu, 702-701, Taegu, Korea
Minho Lee
The University of Tokyo, 7-3-1 Hongo, 113-8656, Bunkyo-ku, Tokyo, Japan
Akira Hirose
Institute of Automation, Key Laboratory of Complex Systems and Intelligence Science, Chinese Academy of Sciences, 100190, Beijing, China
Zeng-Guang Hou
Sungkyunkwan University, 2066, Seobu-ro, Jangan-gu, 440-746, Suwon, Korea
Rhee Man Kil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Murakami, K., Ozeki, T. (2013). Improvement of the Relaxation Procedure in Concurrent Q-Learning. In: Lee, M., Hirose, A., Hou, ZG., Kil, R.M. (eds) Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, vol 8227. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-42042-9_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-42042-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-42041-2
Online ISBN: 978-3-642-42042-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics