A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Aung, Kathy Thi; Fuchida, Takayasu

doi:10.1007/s10015-011-0961-5

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Original Article
Published: 22 February 2012

Volume 16, pages 473–477, (2012)
Cite this article

Artificial Life and Robotics Aims and scope Submit manuscript

Kathy Thi Aung¹ &
Takayasu Fuchida¹

1 Citation
Explore all metrics

Abstract

In this article, we examine the learning performance of various strategies under different conditions using the Voronoi Q-value element (VQE) based on reward in a single-agent environment, and decide how to act in a certain state. In order to test our hypotheses, we performed computational experiments using several situations such as various angles of rotation of VQEs which are arranged into a lattice structure, various angles of an agent’s action rotation that has 4 actions, and a random arrangement of VQEs to correctly evaluate the optimal Q-values for state and action pairs in order to deal with continuous-valued inputs. As a result, the learning performance changes when the angle of VQEs and the angle of action are changed by a specific relative position.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Article 08 May 2024

A practical guide to multi-objective reinforcement learning and planning

Article Open access 13 April 2022

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. Bradford Books, MIT Press
Aung KT, Fuchida T (2010) Reinforcement learning using Voronoi space division. Artif Life Robotics 15:330–334
Article Google Scholar
Gaskett C, Wettergreen D, Zelinsky A (1999) Q-learning in continuous state and action spaces. Adv Topics Artif Intell 1747

Download references

Author information

Authors and Affiliations

Department of Information and Computer Science, Graduate School of Science and Engineering, Kagoshima University, 1-21-40 Korimoto, Kagoshima, 890-0065, Japan
Kathy Thi Aung & Takayasu Fuchida

Authors

Kathy Thi Aung
View author publications
You can also search for this author in PubMed Google Scholar
Takayasu Fuchida
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Takayasu Fuchida.

About this article

Cite this article

Aung, K.T., Fuchida, T. A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment. Artif Life Robotics 16, 473–477 (2012). https://doi.org/10.1007/s10015-011-0961-5

Download citation

Received: 27 April 2011
Accepted: 27 April 2011
Published: 22 February 2012
Issue Date: February 2012
DOI: https://doi.org/10.1007/s10015-011-0961-5

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Abstract

Access this article

Similar content being viewed by others

Learning to optimize: A tutorial for continuous and mixed-integer optimization

A practical guide to multi-objective reinforcement learning and planning

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Key words

Navigation

A comparison of learning performance in two-dimensional Q-learning by the difference of Q-values alignment

Abstract

Access this article

Similar content being viewed by others

Learning to optimize: A tutorial for continuous and mixed-integer optimization

A practical guide to multi-objective reinforcement learning and planning

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

References

Author information

Authors and Affiliations

Corresponding author

About this article

Cite this article

Share this article

Key words

Search

Navigation