Emergence of Higher Exploration in Reinforcement Learning Using a Chaotic Neural Network

Goto, Yuki; Shibata, Katsunari

doi:10.1007/978-3-319-46687-3_5

Emergence of Higher Exploration in Reinforcement Learning Using a Chaotic Neural Network

Yuki Goto¹⁹ &
Katsunari Shibata¹⁹

Conference paper
First Online: 29 September 2016

2559 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 9947))

Abstract

Aiming for the emergence of higher functions such as “logical thinking”, our group has proposed completely novel reinforcement learning where exploration is performed based on the internal dynamics of a chaotic neural network. In this paper, in the learning of an obstacle avoidance task, it was examined that in the process of growing the dynamics through learning, the level of exploration changes from “lower” to “higher”, in other words, from “motor level” to “more abstract level”. It was shown that the agent learned to reach the goal while avoiding the obstacle and there is an area where the agent looks to pass through the right side or left side of the obstacle randomly. The result shows the possibility of the “higher exploration” though the agent sometimes collided with the obstacle and was trapped for a while as learning progressed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Shibata, K., Okabe, Y.: Reinforcement learning when visual signals are directly given as inputs. In: Proceedings of ICNN 1997, vol. 3, pp. 1716–1720 (1997)
Google Scholar
Shibata, K.: Emergence of intelligence through reinforcement learning with a neural network. In: Mellouk, A. (ed.) Advances in Reinforcement Learning, pp. 99–120. InTech (2011)
Google Scholar
Krizhevsky, A., et al.: ImageNet classification with deep convolutional neural networks. Adv. NIPS 25, 1097–1105 (2012)
Google Scholar
Mnih, V., et al.: Playing Atari with deep reinforcement learning. In: NIPS Deep Learning Workshop 2013 (2013)
Google Scholar
Shibata, K., Utsunomiya, H.: Discovery of pattern meaning from delayed rewards by reinforcement learning with a recurrent neural network. In: Proceedings of IJCNN 2011, pp. 1445–1452 (2011)
Google Scholar
Shibata, K., Goto, K.: Emergence of flexible prediction-based discrete decision making and continuous motion generation through actor-Q-learning. In: Proceedings of ICDL-Epirob 2013, ID 15 (2013)
Google Scholar
Sawatsubashi, Y., et al.: Emergence of discrete and abstract state representation through reinforcement learning in a continuous input task. In: Kim, J.-H., Matson, E.T., Myung, H., Xu, P. (eds.) Robot Intelligence Technology and Applications 2012. AISC, vol. 208, pp. 13–22. Springer, Heidelberg (2012)
Google Scholar
Shibata, K., Sakashita, Y.: Reinforcement learning with internal-dynamics-based exploration using a chaotic neural network. In: Proceedings of IJCNN 2015, #15231 (2015)
Google Scholar
Sussillo, D.C.: Learning in Chaotic Recurrent Neural Networks. Columbia University, Ph.D. thesis (2009)
Google Scholar
Hoerzer, G.M., et al.: Emergence of complex computational structures from chaotic neural networks through reward-modulated Hebbian learning. Cereb. Cortex 24(3), 677–690 (2014)
Article Google Scholar
Shibata, K., et al.: Direct-vision-based reinforcement learning in “Going a Target” task with an obstacle and with a variety of target sizes. In: Proceedings of NEURAP 1998, pp. 95–102 (1998)
Google Scholar

Download references

Acknowledgement

This work was supported by JSPS KAKENHI Grant Number 15K00360.

Author information

Authors and Affiliations

Department of Electrical and Electronic Engineering, Oita University, 700 Dannoharu, Oita, 870-1192, Japan
Yuki Goto & Katsunari Shibata

Authors

Yuki Goto
View author publications
You can also search for this author in PubMed Google Scholar
Katsunari Shibata
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yuki Goto .

Editor information

Editors and Affiliations

The University of Tokyo, Tokyo, Japan
Akira Hirose
Kobe University, Kobe, Japan
Seiichi Ozawa
Okinawa Institute of Science and Technology Graduate University, Onna, Japan
Kenji Doya
Nara Institute of Science and Technology, Ikoma, Japan
Kazushi Ikeda
Kyungpook National University, Daegu, Korea (Republic of)
Minho Lee
Chinese Academy of Sciences, Beijing, China
Derong Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Goto, Y., Shibata, K. (2016). Emergence of Higher Exploration in Reinforcement Learning Using a Chaotic Neural Network. In: Hirose, A., Ozawa, S., Doya, K., Ikeda, K., Lee, M., Liu, D. (eds) Neural Information Processing. ICONIP 2016. Lecture Notes in Computer Science(), vol 9947. Springer, Cham. https://doi.org/10.1007/978-3-319-46687-3_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-46687-3_5
Published: 29 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46686-6
Online ISBN: 978-3-319-46687-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics