Abstract
In this paper, an approximate dynamic programming (ADP)-based approach is developed to handle the robust optimal tracking control problem for switched systems with uncertainties in the finite horizon. The switched systems with unknown matched uncertainties are formulated by virtue of system dynamics and reference trajectory, where the complicated tracking problem is converted to a stabilizing robust optimal control problem. To avoid the requirement of system dynamics knowledge, a neural network (NN)-based identifier is utilized to estimate the unknown switched systems dynamics. The actor-critic NNs are constructed to approximate the optimal control input and the corresponding performance index, where the weights are trained backward-in-time in an off-line manner. Benefiting from the Lipschitz continuous condition, the convergence of the proposed approach is proved, which illustrates the iteration approach will converge to the unique solution under a small enough sampling time interval. Finally, two numerical simulation cases are employed to verify the effectiveness of the proposed approach.
Similar content being viewed by others
References
X. Wu, K. Zhang, and C. Ming, “Optimal control of constrained switched systems and application to electrical vehicle energy management,” Nonlinear Analysis-Hybrid Systems, vol. 30, pp. 171–188, 2018.
R. He, X. Tian, Y. Ni, and Y. Xu, “Mode transition coordination control for parallel hybrid electric vehicle based on switched system,” Advances in Mechanical Engineering, vol. 9, no. 8, pp. 1413–1424, 2017.
T. Sardarmehni and A. Heydari, “Sub-optimal switching in anti-lock brake systems using approximate dynamic programming,” IET Control Theory and Applications, vol. 13, no. 9, pp. 1413–1424, 2019.
B. Niu, X. Zhao, X. Fan, and Y. Cheng, “A new control method for state-constrained nonlinear switched systems with application to chemical process,” International Journal of Control, vol. 88, no. 9, pp. 1693–1701, 2015.
Y. Xu, H. Yang, and B. Jiang, “Fault tolerant time optimization of switched systems with application to multi-agent flight control,” International Journal of Control, Automation, and Systems, vol. 17, no. 2, pp. 380–390, 2019.
Z. Xiang and R. Wang, “Robust control for uncertain switched non-linear systems with time delay under asynchronous switching,” IET Control Theory and Applications, vol. 3, no. 8, pp. 1041–1050, 2009.
D. Yang and X. Li, “Robust stability analysis of stochastic switched neural networks with parameter uncertainties via state-dependent switching law,” Neurocomputing, vol. 452, pp. 813–819, 2020.
W. Yang and S. Tong, “Robust stabilization of switched fuzzy systems with actuator dead zone,” Neurocomputing, vol. 173, pp. 1028–1033, 2016.
D. Du, Y. Yang, H. Zhao, and Y. Tan, “Robust fault diagnosis observer design for uncertain switched systems,” International Journal of Control, Automation, and Systems, vol. 18, no. 12, pp. 3159–3166, 2020.
D. Görges, M. Izak, and S. Liu, “Optimal control and scheduling of switched systems,” IEEE Transactions on Automatic Control, vol. 56, no. 1, pp. 135–140, 2010.
B. Lincoln and A. Rantzer, “Relaxing dynamic programming,” IEEE Transactions on Automatic Control, vol. 51, no. 8, pp. 1249–1260, 2006.
W. Zhang, J. Hu, and A. Abate, “On the value functions of the discrete-time switched LQR problem,” IEEE Transactions on Automatic Control, vol. 54, no. 11, pp. 2669–2674, 2009.
N. Cao, H. Zhang, Y. Luo, and D. Feng, “Infinite horizon optimal control of affine nonlinear discrete switched systems using two-stage approximate dynamic programming,” International Journal of Systems Science, vol. 43, no. 9, pp. 1673–1682, 2012.
H. Zhang, C. Qin, and Y. Luo, “Neural-network-based constrained optimal control scheme for discrete-time switched nonlinear system using dual heuristic programming,” IEEE Transactions on Automation Science and Engineering, vol. 11, no. 3, pp. 839–849, 2014.
K. Zhang, H. Zhang, Y. Liang, and Y. Wen, “A new robust output tracking control for discrete-time switched constrained input systems with uncertainty via a critic-only iteration learning method,” Neurocomputing, vol. 396, no. 5, pp. 162–171, 2020.
A. Heydari, “Optimal scheduling for reference tracking or state regulation using reinforcement learning,” Journal of the Franklin Institute, vol. 352, no. 8, pp. 3285–3303, 2015.
A. Heydari and S. N. Balakrishnan, “Optimal switching and control of nonlinear switching systems using approximate dynamic programming,” IEEE Transactions on Neural Networks and Learning Systems, vol. 25, no. 6, pp. 1106–1117, 2013.
A. Heydari and S. N. Balakrishnan, “Optimal switching between controlled subsystems with free mode sequence,” Neurocomputing, vol. 149, pp. 1620–1630, 2015.
F. L. Lewis, D. Vrabie, and V. L. Syrmos, Optimal Control, John Wiley & Sons, 2012.
D. Wang, D. Liu, and H. Li, “Policy iteration algorithm for online design of robust control for a class of continuous-time nonlinear systems,” IEEE Transactions on Automation Science and Engineering, vol. 11, no. 2, pp. 627–632, 2014.
K. Keesman, System Identification: An Introduction, Springer, 2011.
L. Ljung, System Identification, Wiley Encyclopedia of Electrical and Electronics Engineering, 2001.
A. Heydari and S. N. Balakrishnan, “Fixed-final-time optimal control of nonlinear systems with terminal constraints,” Neural Networks, vol. 48, pp. 61–71, 2013.
M. Stone and P. Goldbart, Mathematics for Physics: A Guided Tour for Graduate Students, Cambridge University Press, 2009.
K. Hornik, “Approximation capabilities of multilayer feedforward networks,” Neural Networks, vol. 4, no. 2, pp. 251–257, 1991.
R. Abraham, J. E. Marsden, and T. Ratiu, Manifolds, Tensor Analysis, and Applications, Springer Science & Business Media, 2012.
L. Li, “A new complexity bound for the least-squares problem,” Computers & Mathematics with Applications, vol. 31, no. 12, pp. 15–16, 1996.
G. S. Deaecto, M. Souza, and J. C. Geromel, “Discrete-time switched linear systems state feedback design with application to networked control,” IEEE Transactions on Automatic Control, vol. 60, no. 3, pp. 877–881, 2015.
Y. Ebihara, D. Peaucelle, and D. Arzelier, “Periodically time-varying memory state-feedback controller synthesis for discrete-time linear systems,” Automatica, vol. 47, no. 1, pp. 14–25, 2011.
S. Wang, M. Wu, and Z. Jia, Matrix Inequality, Science Press, 2006.
Funding
This work is supported by National Natural Science Foundation of China (No. 61633019).
Author information
Authors and Affiliations
Corresponding author
Additional information
Shangwei Zhao received his B.S. degree in automation from Northeastern University, Shenyang, China, in 2018. He is currently pursuing a Ph.D. degree with the Department of Automation, Shanghai Jiao Tong University, Shanghai, China from 2018. His current research interests include adaptive dynamic programming and switched systems.
Jingcheng Wang received his B.S. and M.S. degrees from Northwestern Polytechnic University, Xi’an, China, in 1992 and 1995, respectively, and a Ph.D. degree from Zhejiang University, Hangzhou, China, in 1998. Now, he is a Professor with Shanghai Jiao Tong University, Shanghai, China. His research interests include robust control and intelligent control.
Haotian Xu received his B.S. degree from the School of Mathematics, Shandong University, China, in 2016. He is currently pursuing a Ph.D. degree with the Department of Automation, Shanghai Jiao Tong University, Shanghai, China from 2016. His current research interests include distributed control and observer design, and multi-agent system.
Hongyuan Wang received his B.S. degree in automation, Hunan University, Changsha, China, in 2015. He is currently pursuing a Ph.D. degree in the Department of Automation, Shanghai Jiao Tong University, Shanghai, China. Now, his research focuses on stochastic/robust model predictive control.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhao, S., Wang, J., Xu, H. et al. Finite Horizon Robust Optimal Tracking Control Based on Approximate Dynamic Programming for Switched Systems with Uncertainties. Int. J. Control Autom. Syst. 20, 1051–1062 (2022). https://doi.org/10.1007/s12555-020-0982-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12555-020-0982-8