On-Line Learning Control for Discrete Nonlinear Systems Via an Improved ADDHP Method

  • Huaguang Zhang
  • Qinglai Wei
  • Derong Liu
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4491)

Abstract

This paper mainly discusses a generic scheme for on-line adaptive critic design for nonlinear system based on neural dynamic programming (NDP), more exactly, an improved action-depended dual heuristic dynamic programming (ADDHP) method. The principal merit of the proposed method is to avoid the model neural network which predicts the state of next time step, and only use current and previous states in the method, as makes the algorithm more suitable for real-time or on-line application for process control. In this paper, convergence proof of the method will also be given to guarantee the control to reach the optimal. At last, simulation result verifies the performance.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Seong, C.Y., Bermard, W.: Neural Dynamic Optimization for Control Systems-Part I: Background. IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics 31(4), 482–489 (2001)CrossRefGoogle Scholar
  2. 2.
    Danil, P., Don, W.: Adaptive Critic Designs. IEEE Transactions on Neural Networks 8(5), 997–1007 (1997)CrossRefGoogle Scholar
  3. 3.
    Murray, J.J., Cox, C.J., Lendaris, G.G., Saeks, R.: Adaptive Dynamic Programming. IEEE Transactions on Systems, Man, and Cybernetics-Part C: Applications and Reviews 32(2), 140–153 (2002)CrossRefGoogle Scholar
  4. 4.
    Zhang, H.G., Luo, Y.H., Liu, D.R.: A New Fuzzy Identification Method Based on Adaptive Critic Designs. In: Wang, J., Yi, Z., Żurada, J.M., Lu, B.-L., Yin, H. (eds.) ISNN 2006. LNCS, vol. 3971, pp. 804–809. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  5. 5.
    Liu, D.R., Xiong, X.X., Zhang, Y.: Action-Dependent Adaptive Critic Designs. In: IEEE Neural Networks Proceedings, pp. 990–995 (2001)Google Scholar
  6. 6.
    Liu, D.R., Zhang, H.G.: A Neural Dynamic Programming Approach for Learning Control of Failure Avoidance Problems. International Journal of Intelligence Control and Systems 10(1), 21–32 (2005)Google Scholar
  7. 7.
    Liu, D.R., Zhang, Y., Zhang, H.G.: A Self-learning Call Admission Control Scheme for CDMA Cellular Networks. IEEE Transactions on Neural Network 16(5), 804–809 (2006)Google Scholar
  8. 8.
    Jennie, S., Wang, Y.T.: On-Line Learning Control by Association and Reinforcement. IEEE Transactions on Neural Networks 12(2), 264–276 (2001)CrossRefGoogle Scholar
  9. 9.
    George, G.L., Christian, P.: Training Strategies for Critic and Action Neural Networks in Dual Heuristic Programming Method. In: IEEE Neural Networks International Conference, vol. 2, pp. 712–717 (1997)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Huaguang Zhang
    • 1
    • 2
  • Qinglai Wei
    • 1
  • Derong Liu
    • 3
  1. 1.School of Information Science and Engineering, Northeastern University, Shenyang, Liaoning, 110004People’s Republic of China
  2. 2.Key Laboratory of Process Industry Automation, Ministry of EducationPeople’s Republic of China
  3. 3.Department of Electrical and Computer Engineering University of Illinois at Chicago 60607-7053 ChicagoUSA

Personalised recommendations