Skip to main content

A Method of Attitude Control Based on Deep Deterministic Policy Gradient

  • Conference paper
  • First Online:
Cognitive Systems and Signal Processing (ICCSIP 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1006))

Included in the following conference series:

  • 1285 Accesses

Abstract

The traditional methods of attitude control of satellite are represented by PID control, adaptive control, optimal control and intelligent control, etc. With these methods, lots of work of parameter adjustment and simulation needs to do on the earth. We proposed a method based on Deep Deterministic Policy Gradient (DDPG) to learn attitude control strategy in orbit in order to reduce the work and establish the ability of adapting to space environment. Through constructing training environment by using the attitude control system of satellite platform (ACSoSP), we trained an attitude control model and used the model to generate the strategy of attitude control. Validate the method by experiments in simulation environment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Lillicrap, T.P.: Continuous control with deep reinforcement learning. In: ICLR (2016)

    Google Scholar 

  • Lu, J.: Study on attitude control algorithm of three-axis stabilization satellite. Ph.D. thesis, Harbin Institute of Technology (2007)

    Google Scholar 

  • RUMMERY: On-Line Q-Learning Using Connectionist Systems. University of Cambridge, Cambridge (1994)

    Google Scholar 

  • Silver, D.: Deterministic policy gradient algorithms. In: ICML (2014)

    Google Scholar 

  • Sutton, R.S.: Reinforcement Learning: An Introduction. The MIT Press, Cambridge (1998)

    MATH  Google Scholar 

  • Watkins, C.J.: Learning from delayed rewards. Ph.D. thesis. University of Cambridge, Cambridge (1989)

    Google Scholar 

  • Williams, R.J.: Simple statistical gradient-following algorithms forconnectionist reinforcement learning. Mach. Learn. 8(2), 229–256 (1992)

    MathSciNet  MATH  Google Scholar 

  • Yu, J.: Modern Micro-Sat Technology and Application. Shanghai Popular Technology Press, Shanghai (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jian Zhang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Zhang, J., Wu, F., Zhao, J., Xu, F. (2019). A Method of Attitude Control Based on Deep Deterministic Policy Gradient. In: Sun, F., Liu, H., Hu, D. (eds) Cognitive Systems and Signal Processing. ICCSIP 2018. Communications in Computer and Information Science, vol 1006. Springer, Singapore. https://doi.org/10.1007/978-981-13-7986-4_18

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-7986-4_18

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-7985-7

  • Online ISBN: 978-981-13-7986-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics