Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Nagayoshi, Masato; Murao, Hajime; Tamaki, Hisashi

doi:10.1007/s10015-012-0041-5

Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Original Article
Published: 09 August 2012

Volume 17, pages 204–210, (2012)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Masato Nagayoshi¹,
Hajime Murao² &
Hisashi Tamaki³

224 Accesses
5 Citations
Explore all metrics

Abstract

Engineers and researchers are paying more attention to reinforcement learning (RL) as a key technique for realizing adaptive and autonomous decentralized systems. In general, however, it is not easy to put RL into practical use. Our approach mainly deals with the problem of designing state and action spaces. Previously, an adaptive state space construction method which is called a “state space filter” and an adaptive action space construction method which is called “switching RL”, have been proposed after the other space has been fixed. Then, we have reconstituted these two construction methods as one method by treating the former method and the latter method as a combined method for mimicking an infant’s perceptual and motor developments and we have proposed a method which is based on introducing and referring to “entropy”. In this paper, a computational experiment was conducted using a so-called “robot navigation problem” with three-dimensional continuous state space and two-dimensional continuous action space which is more complicated than a so-called “path planning problem”. As a result, the validity of the proposed method has been confirmed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

Article 18 January 2016

References

Sutton RS, Barto AG (1998) Reinforcement Learning. A Bradford Book. MIT Press, Cambridge
Kimura H, Kobayashi S (2000) An analysis of actor–critic algorithms using eligibility traces: reinforcement learning with imperfect value functions. JSAI J 15(2):267–275 (in Japanese)
Google Scholar
Nagayoshi M, Murao H, Tamaki H (2006) A state space filter for reinforcement learning. Proc AROB 11th’06, pp 615–618 (GS1-3)
Nagayoshi M, Murao H, Tamaki H (2010) A reinforcement learning with switching controllers for continuous action space. Artif Life Robotics 15(1):97–100
Article Google Scholar
Nagayoshi M, Murao H, Tamaki H (2011) Adaptive co-construction of state and action spaces in reinforcement learning. Artif Life Robotics 16(1):48–52
Article Google Scholar

Download references

Author information

Authors and Affiliations

Niigata College of Nursing, 240, Shinnan, Joetsu, 943-0147, Japan
Masato Nagayoshi
Faculty of Cross-Cultural Studies, Kobe University, 1-2-1, Tsurukabuto, Nada-ku, Kobe, 657-8501, Japan
Hajime Murao
Graduate School of Engineering, Kobe University, Rokko-dai, Nada-ku, Kobe, 657-8501, Japan
Hisashi Tamaki

Authors

Masato Nagayoshi
View author publications
You can also search for this author in PubMed Google Scholar
Hajime Murao
View author publications
You can also search for this author in PubMed Google Scholar
Hisashi Tamaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Masato Nagayoshi.

About this article

Cite this article

Nagayoshi, M., Murao, H. & Tamaki, H. Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces. Artif Life Robotics 17, 204–210 (2012). https://doi.org/10.1007/s10015-012-0041-5

Download citation

Received: 22 February 2012
Accepted: 09 July 2012
Published: 09 August 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s10015-012-0041-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Abstract

Access this article

Similar content being viewed by others

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Keywords

Navigation

Developing reinforcement learning for adaptive co-construction of continuous high-dimensional state and action spaces

Abstract

Access this article

Similar content being viewed by others

An Entropy-Guided Adaptive Co-construction Method of State and Action Spaces in Reinforcement Learning

Abstraction of State-Action Space Utilizing Properties of the Body and Environment

Reinforcement learning in dynamic environment: abstraction of state-action space utilizing properties of the robot body and environment

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Keywords

Search

Navigation