Autonomous Maneuver Decision of UAV in Air Combat Based on Scenario-Transfer Deep Reinforcement Learning

Jin, Quan; Gao, Xianzhong; Guo, Zheng; Hou, Zhongxi

doi:10.1007/978-981-16-9492-9_257

Quan Jin⁴⁰,
Xianzhong Gao⁴⁰,
Zheng Guo⁴⁰ &
…
Zhongxi Hou⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 861))

Included in the following conference series:

International Conference on Autonomous Unmanned Systems

155 Accesses

Abstract

The traditional command operation depend on the ground station is difficult to adapt to the highly dynamic and uncertain UAV air combat environment. Previous researches on UAV autonomous maneuver decision have limitations due to oversimplified assumptions, large and complex calculations, and lack of flexibility. Aiming at the air combat scenario of Red and Blue UAV, a three-dimensional UAV air combat model based on Markov Decision Process (MDP) is established. We trained Red UAV with Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm to autonomously complete air combat missions. The performance is improved by scenario-transfer training and self-game training techniques. As the training scenario gradually transfers from simple to complex, the Red UAV continue to learn from previous experience steadily and improve the capabilities. The intelligence level is improved through self-game training. The simulation results present that the proposed maneuvering decision-making model and the training method can help the drone obtain effective decision-making strategies to get an advantage and defeat opponents in the confrontation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 549.00; Price excludes VAT (USA)

Softcover Book: USD 699.99; Price excludes VAT (USA)

Hardcover Book: USD 699.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gupta, S.G., Ghonge, M., Jawandhiya, F.: Review of unmanned aircraft system (UAS). SSRN Electron. J. 2, 1646–1658 (2013)
Google Scholar
Shin, H., Lee, J., Kim, H., Shim, D.H.: An autonomous aerial combat framework for two-on-two engagements based on basic fighter maneuvers. Aerosp. Sci. Technol. 72, 305–315 (2017)
Article Google Scholar
Burgin, G.H., Owens, A.J.: An adaptive maneuvering logic computer program for the simulation of one-to-one air-to-air combat. Volume 2: Program description (1975)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. IEEE Trans. Neural Netw. 9(5), 1054–1054 (1998)
Article Google Scholar
Sutton, R.S., Mcallester, D., Singh, S., Mansour, Y.: Policy gradient methods for reinforcement learning with function approximation. In: Advances in Neural Information Processing Systems 12 (1999)
Google Scholar
Mnih, V., Badia, A.P., Mirza, M., Graves, A., Kavukcuoglu, K.: Asynchronous methods for deep reinforcement learning (2016)
Google Scholar
Lillicrap, T.P., et al.: Continuous control with deep reinforcement learning. Computer Science (2015)
Google Scholar
Bernstein, D.S., Zilberstein, S., Immerman, N.: The complexity of decentralized control of Markov decision processes (2013)
Google Scholar
Fujimoto, S., Hoof, H.V., Meger, D.: Addressing function approximation error in actor-critic methods (2018)
Google Scholar
López, N.R., Bikowski, R.: Effectiveness of autonomous decision making for unmanned combat aerial vehicles in dogfight engagements. J. Guidance Control Dyn. 41, 1–7 (2018)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National University of Defense Technology, Changsha, 410073, China
Quan Jin, Xianzhong Gao, Zheng Guo & Zhongxi Hou

Authors

Quan Jin
View author publications
You can also search for this author in PubMed Google Scholar
Xianzhong Gao
View author publications
You can also search for this author in PubMed Google Scholar
Zheng Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhongxi Hou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

College of Intelligence Science and Technology, National University of Defense Technology, Changsha, Hunan, China
Meiping Wu
College of Intelligence Science and Technology, National University of Defense Technology, Changsha, Hunan, China
Yifeng Niu
Beijing HIWING Scientific and Technological Information Institute, Beijing, China
Mancang Gu
Science and Technology on Complex System Control and Intelligent Agent Cooperative Laboratory, Beijing, China
Jin Cheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jin, Q., Gao, X., Guo, Z., Hou, Z. (2022). Autonomous Maneuver Decision of UAV in Air Combat Based on Scenario-Transfer Deep Reinforcement Learning. In: Wu, M., Niu, Y., Gu, M., Cheng, J. (eds) Proceedings of 2021 International Conference on Autonomous Unmanned Systems (ICAUS 2021). ICAUS 2021. Lecture Notes in Electrical Engineering, vol 861. Springer, Singapore. https://doi.org/10.1007/978-981-16-9492-9_257

Download citation

DOI: https://doi.org/10.1007/978-981-16-9492-9_257
Published: 18 March 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-9491-2
Online ISBN: 978-981-16-9492-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics