Emergency-Response Locomotion of Hexapod Robot with Heuristic Reinforcement Learning Using Q-Learning
The locomotion of legged robot is often controlled by predefined gaits, and this approach works well when all joints and motors are operating normally. However, walking legged robots usually have high risk of being damaged during operation, causing the breakdown of the robotic joints. In this paper, we introduce a reinforcement learning based approach for the legged robot to generate real-time locomotion response to the emergence of locomotion breakdown. Our approach detects the functionality of the available joints, substitutes the pre-defined gaits with proper gait function accordingly, and upgrades the gait-generation function by Q-Learning for the proper locomotion.
KeywordsReinforcement learning Q-Learning Hexapod robot Emergency response
- 5.Auslander, B., Lee-Urban, S., Hogg, C., Muñoz-Avila, H.: Recognizing the enemy: combining reinforcement learning with strategy selection using case-based reasoning. In: Althoff, K.-D., Bergmann, R., Minor, M., Hanft, A. (eds.) ECCBR 2008. LNCS (LNAI), vol. 5239, pp. 59–73. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-85502-6_4CrossRefGoogle Scholar
- 9.Bianchi, R.A.C.: Using heuristics to accelerate reinforcement learning algorithms. Dissertation Ph.D. thesis, University of São Paulo (2004)Google Scholar