Abstract
In this paper, we propose an actor-critic neuro-control for a class of continuous-time nonlinear systems under nonlinear abrupt faults, which is combined with an adaptive fault diagnosis observer (AFDO). Together with its estimation laws, an AFDO scheme, which estimates the faults in real time, is designed based on Lyapunov analysis. Then, based on the designed AFDO, a fault tolerant actor- critic control scheme is proposed where the critic neural network (NN) is used to approximate the value function and the actor NN updates the fault tolerant policy based on the approximated value function in the critic NN. The weight update laws for critic NN and actor NN are designed using the gradient descent method. By Lyapunov analysis, we prove the uniform ultimately boundedness (UUB) of all the states, their estimation errors, and NN weights of the fault tolerant system under the unpredictable faults. Finally, we verify the effectiveness of the proposed method through numerical simulations.
Similar content being viewed by others
References
Y. Zhang and J. Jiang, “Issues on integration of fault diagnosis and reconfigurable control in active fault tolerant control systems,” Proc. of the 6th IFAC Symposium of Fault Detection Supervision and Safety for Technical Processes, pp. 1513–524, 2006.
H. Niemann and J. Stoustrup, “Passive fault tolerant control of a double inverted pendulum-a case study,” Control Engineering Practice, vol. 13, no. 8, pp. 1047–1059, 2005.
M. M. Polycarpou and J. Helmicki, “Automated fault detection and accommodation: a learning systems approach,” IEEE Trans. Systems, Man, And Cybernetics, vol. 25, no. 11, pp. 1447–1458, 1995.
B. Jiang, J. L. Wang, and Y. C. Soh, “An adaptive technique for robust diagnosis of faults with independent effects on system outputs,” International Journal of Control, vol. 75, no. 11, pp. 792–802, 2002.
C. P. Tan and C. Edwards, “Sliding mode observers for robust detection and reconstruction of actuator and sensor faults,” International Journal of Robust and Nonlinear Control, vol. 13, no. 5, pp. 443–463, 2003.
R. Sreedhar, B. Fernandez, and G. Y. Masada, “Robust fault detection in nonlinear systems using sliding mode observers,” Proc. of IEEE Conf. on Control Applications, pp. 715–721, 1993.
F. L. Lewis, Optimal Control, John Wiley, 1986.
M. M. Polycarpou, “Stable adaptive neural control scheme for nonlinear Systems,” IEEE Trans. on Automatic Control, vol. 41, no. 3, pp. 447–451, 1996.
V. Nevistib and J. A. Primbs, Constrained nonlinear optimal control: a converse HJB approach, California Institute of Technology Pasadena, CA 91125, Tech rep. CIT-CDS 96-021.
Y. Wang, D. Zhou, S. J. Qin, and H. Wang, “Active fault-tolerant control for a class of nonlinear systems with sensor faults,” International Journal of Control, Automation, and System, vol. 6, no. 3, pp. 339–350, 2008.
Z. F. Gao, B. Jiang, P. Shi, and Y. H. Cheng, “Sensor fault estimation and compensation for Microsatellite attitude control systems,” International Journal of Control, Automation, and System, vol. 8, no. 2, pp. 228–237, 2010.
P. Werbos, “Approximate dynamic programming for real-time control and neural modeling,” in Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. Van Nostrand Reinhold, New York, 1992.
A. Al-Tamimi and F. L. Lewis, “Discrete-time Nonlinear HJB solution using approximate dynamic programming: convergence proof,” IEEE Trans. Syst. Man Cybern. Part B Cybern, vol. 38, no. 4, pp. 943–949, August 2008.
D. V. Prokhorov and D. C. Wunsch, “Adaptive critic designs,” IEEE Trans. Neural Networks, vol. 8, no. 5, pp. 997–1007, September 1997.
R. E. Bellman, Dynamic Programming, Princeton Univ. Press, Princeton, NJ, 1957.
S. E. Dreyfus and A. M. Law, The Art and Theory of Dynamic Programming, Academic, New York, NY, 1977.
W. B. Powell, Approximate Dynamic Programming Solving the Curses of Dimensionality, Wiley, Princeton, NJ, 2007.
S. Mohahegi, G. K. Venayagamoorth, and R. G. Harley, “Fully evolvable optimal neurofuzzy controller using adaptive critic designs,” IEEE Trans. Fuzzy Syst., vol. 16, no. 6, pp. 1450–1461, 2008.
S. Mohahegi, G. K. Venayagamoorth, and R. G. Harley, “Adaptive critic design based neuro-fuzzy controller for a static compensator in a multi machine power system,” IEEE Trans. Power Syst., vol. 21, no. 4, pp. 1744–1754, 2006.
J. J. Murray, C. J. Cox, G. G. Lendaris, and R. Saeks, “Adaptive dynamic programming,” IEEE Trans. Syst., Man., Cybern. C, vol. 32, no. 2, pp. 140–153, 2002.
J. Campos and F. Lewis, “Adaptive critic neural network for feedforward compensation,” Proc. IEEE Am. Control Conf., vol. 4, 1999.
S. Bhasin, M. Johnson, and W. E. Dixon, “A modelfree robust policy iteration algorithm for optimal control of nonlinear systems,” Proc. IEEE Conf. Decis. Control, 2010.
Y. Xu, B. Jiang, G. Tao, and Z. Gao, “Fault tolerant control for a class of nonlinear systems with application to near space vehicle,” Circuits. Syst. Signal. Process, vol. 30, no. 3, pp. 655–672, 2011.
S. M. Nail, P. R. Kumar, and B. E. Ydstie, “Robust continuous-time adaptive control by parameter projection,” IEEE Trans. Automat. Contr., vol. 37, no. 2, pp. 182–197, 1992.
K. Doya, “Reinforcement learning in continuous time and space,” Neural Comput., vol. 12, no. 1, pp. 219–245, 2000.
Y. Xu, Y. Li, and S. Tong, “Fuzzy adaptive actuator failure compensation dynamic surface control of multi-input and multi-output nonlinear systems,” Int. J. Innovative Comput. Inf. Control., vol. 9, no. 12, pp. 4875–4888, 2013.
C. Dobre, “A cluster-enhanced fault tolerant peerto-peer system,” Int. J. Innovative Comput. Inf. Control., vol. 10, no. 2, pp. 417–436, 2014.
C. Dobre, “Dynamic route guidance algorithm based on improved hopfield neural network and genetic algorithm,” Int. J. Innovative Comput. Inf. Control., vol. 10, no. 2, pp. 811–822, 2014.
L. Wu, Z. Feng, and W. X. Zheng, “Exponential stability analysis for delayed neural networks with switching parameters: average dwell time approach,” IEEE Trans. Neural Net, vol. 21, no. 9, pp. 1396–1407, 2010.
L. Wu, Z. Feng, and J. Lam, “Stability and synchronization of discrete-time neural networks with switching parameters and time-varying delays,” IEEE Trans. Neural Net, vol. 24, no. 12, pp. 1957–1972, 2013.
Author information
Authors and Affiliations
Corresponding author
Additional information
Seung Jin Chang received his B.S. degree in Electrical and Electronic Engineering from Yonsei University, Seoul, Korea, in 2007. Since 2010, he has been working as a Research Assistant in Control Engineering Laboratory, Yonsei University, Seoul, where he is currently working toward a Ph.D. degree in Electrical and Electronic Engineering. His research interests include dynamic programming applied to fault tolerant control and condition monitoring/diagnosis of cables and signal processing techniques, time-frequency analysis, and estimation theory.
Jae Young Lee received his B.S. degree in Information and Control Engineering from Kwangwoon University, Seoul, Korea, in 2006. He is currently pursuing a Ph.D. degree in Electrical and Electronic Engineering with the Control Engineering Laboratory, Yonsei University, Seoul. He has been a Research Assistant with the Control Engineering Laboratory since 2006. His current research interests include approximate dynamic programming/reinforcement learning, optimal/adaptive control, nonlinear control theories, neural networks, and applications to unmanned vehicles, multiagent systems, robotics, and power systems.
Jin Bae Park received his B.S. degree in Electrical Engineering from Yonsei University, Seoul, Korea, and his M.S. and Ph.D. degrees in Electrical Engineering from Kansas State University, Manhattan, KS, USA, in 1977, 1985, and 1990, respectively. Since 1992, he has been with the Department of Electrical and Electronic Engineering, Yonsei University, where he is currently a Professor. His major research interests include robust control and filtering, nonlinear control, intelligent mobile robot, fuzzy logic control, neural networks, chaos theory, and genetic algorithms. He served as the Editor-in- Chief (2006-2010) for the International Journal of Control, Automation, and Systems, the Vice-President (2009-2011) for Institute of Control, Robot, and Systems Engineers (ICROS), and the President for the ICROS (2013).
Yoon Ho Choi received his B.S., M.S., and Ph.D. degrees in Electrical Engineering from Yonsei University, Seoul, Korea, in 1980, 1982, and 1991, respectively. Since 1993, he has been with Department of Electronic Engineering, Kyonggi University, Suwon, Korea, where he is currently a Professor. He was with Department of Electrical Engineering, The Ohio State University, where he was a Visiting Scholar (2000–2002, 2009–2010). His research interests include nonlinear control, intelligent control, multi-legged and mobile robots, networked control systems, and ADP based control. Prof. Choi was the Director (2003–2004, 2007–2008) of the Institute of Control, Robotics and Systems (ICROS). He is serving as the Vice-President for the ICROS (2012-present).
Rights and permissions
About this article
Cite this article
Chang, S.J., Lee, J.Y., Park, J.B. et al. An online fault tolerant actor-critic neuro-control for a class of nonlinear systems using neural network HJB approach. Int. J. Control Autom. Syst. 13, 311–318 (2015). https://doi.org/10.1007/s12555-014-0034-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12555-014-0034-3