Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm

  • Shuda Wang
  • Feng Si
  • Jing Yang
  • Shuoning Wang
  • Jun Yang
Conference paper

DOI: 10.1007/978-3-540-88513-9_68

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5314)
Cite this paper as:
Wang S., Si F., Yang J., Wang S., Yang J. (2008) Study of Cooperation Strategy of Robot Based on Parallel Q-Learning Algorithm. In: Xiong C., Huang Y., Xiong Y., Liu H. (eds) Intelligent Robotics and Applications. ICIRA 2008. Lecture Notes in Computer Science, vol 5314. Springer, Berlin, Heidelberg

Abstract

How to solve MR (Multi-Robots) in a dynamic environment of the study of knowledge, and to complete a task or solve a problem, the robot can have the same goal , also different goals. Therefore, to put forward two architectures, which are more suitable for MR studying, according to the architecture, to design the improved learning methods algorithm Q for MR, which solve the problems of coordination and cooperation, such as the credit distribution, distribution of resources, tasks and conflict resolution. MR may be learning in independent environment, and fusing results after learning cycle, and the final results is going to be shared by all the robots, and as the basis of reference passing into next learning cycle, increase learning chances between MR and environment. Simulation results show that the learning algorithm enables MR learning rapidly and quickly surrounded by a mobile group, complying with better effective.

Keywords

Multi-Robots Reinforcement Learning Q-learning Dynamic Programming Parallel learning 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Shuda Wang
    • 1
  • Feng Si
    • 1
  • Jing Yang
    • 1
  • Shuoning Wang
    • 1
  • Jun Yang
    • 1
  1. 1.College of Computer and Information EngineeringHarbin University of CommerceHarbinChina

Personalised recommendations