Abstract
In this work, a behavior-based adaptive dynamic programming (BADP) method is proposed for coordination control of unmanned ground vehicle-manipulator systems (UGVMs). Through a null-space-based behavioral control (NSBC) framework, the multi-objective coordination control is transformed into a single-objective tracking control at the mission layer. Since cost functions and control constraints are simplified at control layer, the complexity of controller design is reduced. Then, an identifier-actor-critic reinforcement learning algorithm framework is introduced to learn the optimal control policy by balancing the control performance and consumption. Simulation results show that control costs are reduced around 13.5% per sampling period compared to existing multiple objective control methods. Finally, the BADP method is experimentally validated using four real UGVMs.
Similar content being viewed by others
References
G. B. Dai and Y. C. Liu, “Distributed coordination and cooperation control for networked mobile manipulators,” IEEE Transactions on Industrial Electronics, vol. 64, no. 6, pp. 5065–5074, June 2017.
G. B. Dai and Y. C. Liu, “Distributed coordination and cooperation control for networked mobile manipulators,” IEEE Transactions on Industrial Electronics, vol. 64, no. 6, pp. 5065–5074, June 2017.
A. Brahmi, M. Saad, G. Gauthier, W. H. Zhu, and J. Ghommam, “Adaptive control of multiple mobile manipulators transporting a rigid object,” International Journal of Control, Automation and Systems, vol. 15, no. 4, pp. 1779–1789, September 2017.
T. Zhao, Y. Y. Liu, Z. J. Li, C. Y. Su, and Y. Feng, “Adaptive control and optimization of mobile manipulation subject to input saturation and switching constraints,” IEEE Transactions on Automation Science and Engineering, vol. 16, no. 4, pp. 1543–1555, December 2018.
W. Li and R. Xiong, “Dynamical obstacle avoidance of task-constrained mobile manipulation using model predictive control,” IEEE Access, vol. 7, no. 1, pp. 88301–88311, July 2019.
J. Liao, F. Huang, and Z. Chen, “Optimization-based motion planning of mobile manipulator with high degree of kinematic redundancy,” International Journal of Intelligent Robotics and Applications, vol. 3, no. 2, pp. 115–130, April 2019.
U. Ahmad, Y. J. Pan, and H. H. Shen, “Robust control design for teleoperation of multiple mobile manipulators under time delays,” International Journal of Robust and Nonlinear Control, vol. 30, no. 16, pp. 6454–6472, May 2019.
C. Wu, H. Fang, and X. L. Zeng, “Distributed object transport of mobile manipulators with optimal manipulable coordination,” 37th Chinese Control Conference (CCC), pp. 7043–7048, 2018.
Y. Q. Zhang, Z. H. Deng, and Y. G. Hong, “Distributed optimal coordination for multiple heterogeneous Euler-Lagrangian systems,” Automatica, vol. 79, no. 1, pp. 207–213, May 2017.
J. Alonso-Mora, S. Baker, and D. Rus, “Multi-robot formation control and object transport in dynamic environments via constrained optimization,” The International Journal of Robotics Research, vol. 36, no. 9, pp. 1000–1021, September 2017.
J. Chen, S. X. Kai, “Cooperative transportation control of multiple mobile manipulators through distributed optimization,” Science China Information Sciences, vol. 61, no. 12, pp. 1–17, November 2018.
Y. J. Du, B. L. Ma, and D. Y. Meng, “Distributed control for signed networks of nonlinear agents,” International Journal of Control, Automation, and Systems, vol. 18, no. 2, pp. 271–281, July 2019.
D. Liu, S. Xue, Bo Zhao, and Q. L. Wei, “Adaptive dynamic programming for control: a survey and recent advances,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 51, no. 1, pp. 142–160, January 2021.
H. Y. Dong, X. W. Zhao, and Bo Zhao, “Optimal tracking control for uncertain nonlinear systems with prescribed performance via critic-only ADP,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, vol. 52, no. 1, pp. 561–573, January 2022.
Z. F. Ye, Z. J. Li, C. Y. Su, and B. Huang, “Adaptive tracking control of a class of constrained Euler-Lagrange systems by factorization of dynamic mass matrix,” IEEE Transactions on Industrial Electronics, vol. 66, no. 10, pp. 7831–7840, October 2019.
G. X. Wen, C. L. P. Chen, J. Feng, and N. Zhou, “Optimized multi-agent formation control based on an identifier-actor-critic reinforcement learning algorithm,” IEEE Transactions on Fuzzy Systems, vol. 26, no. 5, pp. 2719–2731, October 2018.
G. X. Wen, C. L. P. Chen, and B. Li, “Optimized formation control using simplified reinforcement learning for a class of multiagent systems with unknown dynamics,” IEEE Transactions on Fuzzy Systems, vol. 67, no. 9, pp. 7879–7888, September 2020.
G. X. Wen, C. L. P. Chen, and Y. J. Liu, “Formation control with obstacle avoidance for a class of stochastic multiagent systems,” IEEE Transactions on Industrial Electronics, vol. 65, no. 7, pp. 5847–5855, July 2018.
Z. H. Pan, D. Wang, H. B. Deng and K. W. Li, “A virtual spring method for the multi-robot path planning and formation control,” International Journal of Control, Automation and Systems, vol. 17, no. 5, pp. 1272–1282, December 2018.
V. G. Lopez and F. L. Lewis, “Dynamic multiobjective control for continuous-time systems using reinforcement learning,” IEEE Transactions on Automatic Control, vol. 64, no. 7, pp. 2869–2874, July 2019.
M. Mazouchi, Y. L. Yang, and H. Modares, “Data-driven dynamic multiobjective optimal control: an aspiration-satisfying reinforcement learning approach,” IEEE Transactions on Neural Networks and Learning Systems, vol. 1, no. 1, pp. 1–11, April 2021.
S. Khankalantary, I. Izadi, and F. Sheikholeslam, “Robust ADP-based solution of a class of nonlinear multi-agent systems with input saturation and collision avoidance constraints,” ISA transactions, vol. 107, no. 1, pp. 52–62, July 2020.
M. L. Greene, P. Deptula, S. Nivison and W. E. Dixon, “Sparse learning-based approximate dynamic programming with barrier constraints,” IEEE Control Systems Letters, vol. 4, no. 3, pp. 743–748, July 2020.
Y. L. Yang, K. G. Vamvoudakis, H. Modares, W. He, Y. X. Yin, and D. C. Donald, “Safety-aware reinforcement learning framework with an actor-critic-barrier structure,” 2019 American Control Conference (ACC), pp. 2352–2358, 2018.
X. J. Lan, L. Lei, and Y. J. Wang, “ADP-based intelligent decentralized control for multi-agent systems moving in obstacle environment,” IEEE Access, vol. 7, no. 1, pp. 59624–59630, May 2019.
K. Baizid, G. Giglio, F. Pierri, M. A. Trujillo, G. Antonelli, F. Caccavale, A. Viguria, S. Chiaverini, and A. Ollero, “Behavioral control of unmanned aerial vehicle manipulator systems,” Autonomous Robots, vol. 14, no. 1, pp. 1203–1220, July 2017.
D. W. Wang and L. C. Boon, “Modeling and analysis of skidding and slipping in wheeled mobile robots: Control design perspective,” IEEE Transactions on Robotics, vol. 24, no. 3, pp. 676–687, June 2008.
J. Huang, N. Zhou, and M. Cao, “Adaptive fuzzy behavioral control of second-order autonomous agents with prioritized missions: Theory and experiments,” IEEE Transactions on Industrial Electronics, vol. 66, no. 12, pp. 9612–9622, December 2019.
C. Ott, A. Dietrich, and A. Albu-Schäffer, “Prioritized multi-task compliance control of redundant manipulators,” Automatica, vol. 53, no. 1, pp. 416–423, February 2015.
G. Antonelli, “Stability analysis for prioritized closed-loop inverse kinematic algorithms for redundant robotic systems,” IEEE Transactions on Robotics, vol. 25, no. 5, pp. 985–994, April 2009.
Y. T. Chen, Z. Y. Zhang, and J. Huang, “Dynamic Task Priority Planning for Null-Space Behavioral Control of Multi-Agent Systems,” IEEE Access, vol. 8, no. 1, pp. 2169–3536, August 2020.
Z. Y. Zhang, Z. B. Mo, Y. T. Chen, and J. Huang, “Reinforcement Learning Behavioral Control for Nonlinear Autonomous System,” IEEE/CAA Journal of Automatica Sinica, vol. 9, no. 9, pp. 1561–1573, August 2022.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This work was supported in part by the National Natural Science Foundation of China under Grants 61603094.
Zhenyi Zhang received his B.E. degree in mechanical and electronic engineering, and an M.E. degree in mechanical engineering from Zhejiang Sci-Tech University, Hangzhou, China, in 2016 and 2019. He is now a Ph.D. student at Fuzhou University, China. His research interests include intelligent robot ethology and multiagent learning systems.
Jianfei Chen received his B.E. degree in measurement and control technology and instrumentation from Qingdao University of Science and Technology, Shandong, China, in 2020. He is now an M.E. student at Fuzhou University, Fuzhou, China. His research interests include aerial-ground cooperative systems optimization and multi-agent systems.
Zhibin Mo received his B.E. degree in Shantou University, Shantou, China, in 2018. He is pursuing an M.E. degree in Fuzhou University, Fuzhou, China. His research interests include multi-agent reinforcement learning and human-robot coordination systems.
Yutao Chen received his B.E. degree in automation from Hunan University, China, in 2012, a master’s degree from the School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University, China, in 2014, and a Ph.D. degree from the Department of Information Engineering, University of Padova, Italy, in 2018. From 2019 to 2020, he was a Post-doctoral Researcher with the Department of Electrical Engineering, Eindhoven University of Technology, The Netherlands. He is currently an Assistant Professor with the College of Electrical Engineering and Automation, Fuzhou University, China. His research interests include model predictive control algorithms and unmanned intelligent systems and applications.
Jie Huang received his B.E. degree in electrical engineering and automation, an M.E. degree in control engineering from Fuzhou University, China, and a Ph.D. degree in control science and engineering from Beijing Institute of Technology, Beijing, China, in 2005, 2010, and 2015, respectively. From 2005 to 2015, he was a lecturer with Fujian Institute of Education, Fuzhou, China. From 2014 to 2016, he was postdoctoral researcher with the Faculty of Mathematics and Natural Sciences, University of Groningen, the Netherlands. From 2016 to 2018, he held lecturer appointments with the Faculty of Science and Engineering, University of Groningen, the Netherlands. He is currently a full professor of robotic and control with the College of Electrical Engineering and Automation, Fuzhou University, China and the director of the 5G+ Industrial Internet Institute, Fuzhou University, China. He is the vice-president of the Fujian Automation Association, Fujian Province, China. His research interests include autonomous robots, complex network dynamics, and multi-agent systems.
Rights and permissions
About this article
Cite this article
Zhang, Z., Chen, J., Mo, Z. et al. A Behavior-based Adaptive Dynamic Programming Method for Multiple Mobile Manipulators Coordination Control. Int. J. Control Autom. Syst. 21, 3022–3035 (2023). https://doi.org/10.1007/s12555-021-0904-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12555-021-0904-4