This paper applies a synchronously approximate dynamic programming (ADP) scheme to solve the Nash controls of the dual-driven load system (DDLS) with different motor properties based on game theory. First, a neural network (NN) is applied to approximate the dual-driven servo unknown system model. Because the properties of two motors are different, they have different performance indexes. Another NN is used to approximate performance index function of each motor. In order to minimize the performance index, the Hamilton function is constructed to solve the approximate optimal controls of the load system. Based on parameter error information, an adaptive law is designed to estimate NN weights. Finally, the practical DDLS is simulated to demonstrate that the optimal control inputs can be studied by ADP algorithm.
Servo system Approximate dynamic programming Nash equilibrium Multi-input system Neural networks
This is a preview of subscription content, log in to check access.
The work was supported by National Natural Science Foundation of China (No. 61433003 and No. 61273150).
S. Wang, J. Na, X. Ren, RISE-based asymptotic prescribed performance tracking control of nonlinear servo mechanisms. IEEE Trans. Syst. Man Cybernet. Syst (2017)Google Scholar
W. Zhao, X. Ren, X. Gao, Synchronization and tracking control for multi-motor driving servo systems with backlash and friction. Int. J. Robust Nonlinear Control 26(13), 2745–2766 (2016)MathSciNetCrossRefGoogle Scholar
W. Zhao, X. Ren, S. Wang, Parameter estimation-based time-varying sliding mode control for multimotor driving servo systems. IEEE/ASME Trans. Mechatron. 22(5), 2330–2341 (2017)CrossRefGoogle Scholar
Y. Jia, Robust control with decoupling performance for steering and traction of 4WS vehicles under velocity-varying motion. IEEE Trans. Control Syst. Technol. 8(3), 554–569 (2000)CrossRefGoogle Scholar
Y. Jia, Alternative proofs for improved LMI representations for the analysis and the design of continuous-time systems with polytopic type uncertainty: a predictive approach. Automat. Control IEEE Trans. 48(8), 1413–1416 (2003)MathSciNetCrossRefGoogle Scholar
M. Wang, X. Ren, Q. Chen, S. Wang, X. Gao, Modified dynamic surface approach with bias torque for multi-motor servomechanism. Control Eng. Pract. 50, 57–68 (2016)CrossRefGoogle Scholar
M. Wang, X. Ren, Q. Chen, Cascade optimal control for tracking and synchronization of a multimotor driving system. IEEE Trans. Control Syst. Technol. (2018)Google Scholar
Y. Lv, J. Na, Q. Yang, X. Wu, Y. Guo, Online adaptive optimal control for continuous-time nonlinear systems with completely unknown dynamics. Int. J. Control 89(1), 99–112 (2016)MathSciNetCrossRefGoogle Scholar
Y. Lv, X. Ren, J. Na, Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics. Neurocomputing (2017)Google Scholar
A. Al-Tamimi, F.L. Lewis, M. Abu-Khalaf, Discrete-time nonlinear HJB solution using approximate dynamic programming: convergence proof. IEEE Trans. Syst. Man Cybernet. Part B (Cybernetics) 38(4), 943–949 (2008)CrossRefGoogle Scholar
J. Nash, Non-cooperative games. Ann. Math. 286–295 (1951)Google Scholar
D. Liu, H. Li, D. Wang, Online synchronous approximate optimal learning algorithm for multi-player non-zero-sum games with unknown dynamics. IEEE Trans. Syst. Man Cybernet. Syst. 44(8), 1015–1027 (2014)CrossRefGoogle Scholar
D. Zhao, Q. Zhang, D. Wang, Y. Zhu, Experience replay for optimal control of nonzero-sum game systems with unknown dynamics. IEEE Trans. Cybernet. 46(3), 854–865 (2016)CrossRefGoogle Scholar
J. Na, M.N. Mahyuddin, G. Herrmann, X. Ren, P. Barber, Robust adaptive finite-time parameter estimation and control for robotic systems. Int. J. Robust Nonlinear Control 25(16), 3045–3071 (2015)MathSciNetCrossRefGoogle Scholar
J. Na, G. Herrmann, Online adaptive approximate optimal tracking control with simplified dual approximation structure for continuous-time unknown nonlinear systems. IEEE/CAA J. Automat. Sinica 1(4), 412–422 (2014)CrossRefGoogle Scholar
Y. Lv, J. Na, X. Ren, Online H∞ control for completely unknown nonlinear systems via an identifier–critic-based ADP structure. Int. J. Control, 1–12 (2017)Google Scholar