One-Shot Supervised Reinforcement Learning for Multi-targeted Tasks: RL-SAS

Takeuchi, Johane; Tsujino, Hiroshi

doi:10.1007/978-3-642-15822-3_26

One-Shot Supervised Reinforcement Learning for Multi-targeted Tasks: RL-SAS

Johane Takeuchi¹⁹ &
Hiroshi Tsujino¹⁹

Conference paper

1798 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6353))

Abstract

Our ultimate goal is to realize artificial agents, which can be taught and can behave appropriately in volatile environments. Supervised reinforcement learning (SRL) will play a crucial role in this endeavor as SRL enables agents to function in situations that partly deviate from what has been taught. Currently reinforcement learning (RL) is typically implemented for single tasks, which restricts teaching plural behavioral sequences. Herein we introduce a SRL scheme, which exploits explicit state-action lists to facilitate reuse of learned behavioral sequences. By combining the constructed learning system with a standard RL algorithm, the system could solve a problem in one-shot for the supervised portions and use RL to compensate for the unsupervised portions.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Singh, S., Barto, A.G., Chentanez, N.: Intrinsically motivated reinforcement learning. In: Advances in Neural Information Processing Systems, vol. 17, pp. 1281–1288. MIT Press, Cambridge (2005)
Google Scholar
Sutton, R.S., Precup, D., Singh, S.P.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112, 181–211 (1999)
Article MATH MathSciNet Google Scholar
Pearl, J.: Heuristics: intelligent search strategies for computer problem solving. Addison-Wesley Pub., Boston (1984)
Google Scholar

Download references

Author information

Authors and Affiliations

Honda Research Institute Japan Co., Ltd., 8-1 Honcho, Wako-shi, Saitama, 351-0188, Japan
Johane Takeuchi & Hiroshi Tsujino

Authors

Johane Takeuchi
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Tsujino
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, TEI of Thessaloniki, 57400, Sindos, Greece
Konstantinos Diamantaras
School of Physics, Astronomy, and Informatics, Department of Informatics, Nicolaus Copernicus University, ul. Grudziadzka 5, 87-100, Torun, Poland
Wlodek Duch
Department of Forestry and Management of the Environment and Natural Resources, Democritus University of Thrace, Pantazidou 193, 68200, Orestiada Thrace, Greece
Lazaros S. Iliadis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Takeuchi, J., Tsujino, H. (2010). One-Shot Supervised Reinforcement Learning for Multi-targeted Tasks: RL-SAS. In: Diamantaras, K., Duch, W., Iliadis, L.S. (eds) Artificial Neural Networks – ICANN 2010. ICANN 2010. Lecture Notes in Computer Science, vol 6353. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15822-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-15822-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15821-6
Online ISBN: 978-3-642-15822-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics