Simulation-Based Evaluations of Reinforcement Learning Algorithms for Autonomous Mobile Robot Path Planning

Viet, Hoang Huu; Kyaw, Phyo Htet; Chung, TaeChoong

doi:10.1007/978-94-007-2598-0_49

Hoang Huu Viet⁵,
Phyo Htet Kyaw⁵ &
TaeChoong Chung⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 107))

1343 Accesses
7 Citations

Abstract

This work aims to evaluate the efficiency of the five fundamental reinforcement learning algorithms including Q-learning, Sarsa, Watkins’s Q(λ), Sarsa(λ), and Dyna-Q, and indicate which one is the most efficient of the five algorithms for the path planning problem of autonomous mobile robots. In the sense of the reinforcement learning algorithms, the Q-learning algorithm is the most popular and seems to be the most effective model-free algorithm for a learning robot. However, our experimental results show that the Dyna-Q algorithm, a method learns from the past model-learning and direct reinforcement learning is particularly efficient for this problem in a large environment of states.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Research on Path Planning Algorithm for Mobile Robot Based on Improved Reinforcement Learning

An Algorithm for Path Planning Based on Improved Q-Learning

Learning High-Level Navigation Strategies via Inverse Reinforcement Learning: A Comparative Analysis

References

Dudek G, Jenkin M (2010) Computational principles of mobile robotics. Cambridge University Press, New York
Book MATH Google Scholar
Kaelbling LP, Littman ML, Moore AW (1996) Reinforcement learning: a survey. J Artif Intell Res 4:237–285
Google Scholar
Sutton RS, Barto AG (1998) Reinforcement learning: an introduction. The MIT Press, Cambridge
Google Scholar
Watkins C (1989) Learning from delayed rewards. Ph.D. Dissertation, King’s College
Google Scholar
Smart WD, Kaelbling LP (2002) Effective reinforcement learning for mobile robots. In: IEEE international conference on robotics and automation (ICRA’02), vol 4. IEEE Press, Washington, pp 3404–3410
Google Scholar
Zamstein L, Arroyo A, Schwartz E, Keen S, Sutton B, Gandhi G (2006) Koolio: path planning using reinforcement learning on a real robot platform. In: 19th Florida conference on recent advances in robotics, Florida
Google Scholar
Chakraborty IG, Das PK, Konar A, Janarthanan R (2010) Extended Q-learning algorithm for path-planning of a mobile robot. In: LNCS, vol 6457. Springer, Heidelberg, pp 379–383
Google Scholar
Mohammad AKJ, Mohammad AR, Lara Q (2011) Reinforcement based mobile robot navigation in dynamic environment. Robotics Comput-Integr Manuf 27:135–149
Article Google Scholar

Download references

Acknowledgments

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science, and Technology (2010-0012609).

Author information

Authors and Affiliations

Artificial Intelligence Lab, Department of Computer Engineering, School of Electronics and Information, Kyung Hee University, 1-Seocheon, Giheung, Yongin, Gyeonggi, 446–701, South Korea
Hoang Huu Viet, Phyo Htet Kyaw & TaeChoong Chung

Authors

Hoang Huu Viet
View author publications
You can also search for this author in PubMed Google Scholar
Phyo Htet Kyaw
View author publications
You can also search for this author in PubMed Google Scholar
TaeChoong Chung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hoang Huu Viet .

Editor information

Editors and Affiliations

SeoulTech, Computer Science and Engineering, Seoul University of Science & Technology, Gongreung 2-dong 172, Seoul, 139-743, Korea, Republic of (South Korea)
James J. Park
, Computer Science, University of Georgia, GSRC 415, Athens, 30602-7404, Georgia, USA
Hamid Arabnia
, Business Administration, Daejin University, Hogukro 1007, Pocheon-Si, 487-711, Kyonggi-do, Korea, Republic of (South Korea)
Hang-Bae Chang
, Division of Information and Computer Eng, Ajou University, San 5, Suwon, Gyeonggido, 443-749, Korea, Republic of (South Korea)
Taeshik Shon

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Viet, H.H., Kyaw, P.H., Chung, T. (2011). Simulation-Based Evaluations of Reinforcement Learning Algorithms for Autonomous Mobile Robot Path Planning. In: Park, J., Arabnia, H., Chang, HB., Shon, T. (eds) IT Convergence and Services. Lecture Notes in Electrical Engineering, vol 107. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-2598-0_49

Download citation

DOI: https://doi.org/10.1007/978-94-007-2598-0_49
Published: 01 November 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-2597-3
Online ISBN: 978-94-007-2598-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics