Abstract
The orbital pursuit-evasion game is typically formulated as a complete-information game, which assumes the payoff functions of the two players are common knowledge. However, realistic pursuit-evasion games typically have incomplete information, in which the lack of payoff information limits the player’s ability to play optimally. To address this problem, this paper proposes a currently optimal escape strategy based on estimation for the evader. In this strategy, the currently optimal evasive controls are first derived based on the evader’s guess of the pursuer’s payoff weightings. Then an online parameter estimation method based on a modified strong tracking unscented Kalman filter is employed to modify the guess and update the strategy during the game. As the estimation becomes accurate, the currently optimal strategy gets closer to the actually optimal strategy. Simulation results show the proposed strategy can achieve optimal evasive controls progressively and the evader’s payoff of the strategy is lower than that of the zero-sum escape strategy. Meanwhile, the proposed strategy is also effective in the case where the pursuer changes its payoff function halfway during the game.
Similar content being viewed by others
References
Buslov E P, Komarov I S, Selivanov V V, et al. Protection of inflatable modules of orbital stations against impacts of particles of space debris. Acta Astronaut, 2019, 163: 54–61
Gong B, Li W, Li S, et al. Angles-only initial relative orbit determination algorithm for non-cooperative spacecraft proximity operations. Astrodynamics, 2018, 2: 217–231
Yang Y, Wang L, Modares H, et al. Data-driven integral reinforcement learning for continuous-time non-zero-sum games. IEEE Access, 2019, 7: 82901–82912
Sun C H, Duan H B. Markov decision evolutionary game theoretic learning for cooperative sensing of unmanned aerial vehicles. Sci China Tech Sci, 2015, 58: 1392–1400
Prince E R, Hess J A, Cobb R G, et al. Elliptical orbit proximity operations differential games. J Guid Control Dyn, 2019, 42: 1458–1472
Shen H X, Casalino L. Revisit of the three-dimensional orbital pursuit-evasion game. J Guid Control Dyn, 2018, 41: 1823–1831
Basar T, Olsder G J. Dynamic Noncooperative Game Theory. Philadelphia: Society for Industrial and Applied Mathematics, 1999. 10–50
Salimi M, Ferrara M. Differential game of optimal pursuit of one evader by many pursuers. Int J Game Theory, 2019, 48: 481–490
Ibragimov G I, Salimi M, Amini M. Evasion from many pursuers in simple motion differential game with integral constraints. Eur J Oper Res, 2012, 218: 505–511
Satak N. Behavior learning in differential game and reorientation maneuvers. Dissertation for Doctoral Degree. Galveston. Qatar: Texa A&M University, 2013. 1–11
Luo Y Z, Li Z Y, Zhu H. Survey on spacecraft orbital pursuit-evasion differential games (in Chinese with English abstract). Sci Sin Tech, 2020, doi: https://doi.org/10.1360/SST-2019-0174
Pontani M, Conway B A. Numerical solution of the three dimensional orbital pursuit-evasion game. J Guid Control Dyn, 2009, 32: 474–487
Hafer W T, Reed H L, Turner J D, et al. Sensitivity methods applied to orbital pursuit evasion. J Guid Control Dyn, 2015, 38: 1118–1126
Li Z Y, Zhu H, Yang Z, et al. A dimension-reduction solution of free-time differential games for spacecraft pursuit-evasion. Acta Astronaut, 2019, 163: 201–210
Li Z Y, Zhu H, Yang Z, et al. Saddle point of orbital pursuit-evasion game under J2-perturbed dynamics. J Guid Control Dyn, 2020, 43: 1733–1739
Liu Y F, Li R F, Wang S Q. Orbital three-player differential game using semi-direct collocation with nonlinear programming. In: 2nd International Conference on Control Science and Systems Engineering. Singapore, 2016. 217–222
Engwerda J. LQ Dynamic Optimization and Differential Games. Hoboken: John Wiley & Sons, Ltd, 2005. 264–340
Sun Y, Ding C, Zhu Y. Saddle point equilibrium under uncertain environment. Math Meth Appl Sci, 2018, 41: 2063–2073
Tur A V. Strategic stability in linear-quadratic differential games with nontransferable payoffs. Autom Remote Control, 2017, 78: 379–388
Mageirou E. Values and strategies for infinite time linear quadratic games. IEEE Trans Automat Contr, 1976, 21: 547–550
Jagat A, Sinclair A J. Optimization of spacecraft pursuit-evasion game trajectories in the Euler-Hill reference frame. In: Proceedings of AIAA/AAS Astrodynamics Specialist Conference 2014. San Diego, 2014. 1–20
Jagat A, Sinclair A J. Nonlinear control for spacecraft pursuit-evasion game using state-dependent Riccati equation method. IEEE Trans Aerosp Electron Syst, 2017, 53: 3032–3042
Satak N, Hurtado J E. A framework for behavior learning in differential games and safe strategies. In: Proceedings of 52nd Aerospace Sciences Meeting AIAA Science and Technology Forum and Exposition. National Harbor, 2014. 1–15
Woodbury T, Hurtado J E. Adaptive play via estimation in uncertain nonzero-sum orbital pursuit-evasion games. In: Proceedings of AIAA SPACE and Astronautics Forum and Exposition. Orlando, 2017. 1–15
Zhu H, Luo Y Z, Li Z Y, et al. Orbital pursuit-evasion game with incomplete information in the Hill reference frame. In: Proceedings of 27th International Symposium on Space Flight Dynamics (ISSFD). Melbourne, 2019. 1–6
Yang Z, Luo Y Z, Zhang J. Second-order analytical solution of relative motion in J2-perturbed elliptic orbits. J Guid Control Dyn, 2018, 41: 2258–2270
Sullivan J, Grimberg S, D’Amico S. Comprehensive survey and assessment of spacecraft relative motion dynamics models. J Guid Control Dyn, 2017, 40: 1837–1859
Bryson A E, Ho Y C. Applied Optimal Control: Optimization, Estimation, and Control, Hemisphere. New York: John Wiley & Sons, 1975: 20–100
Yang J N, Lin S, Huang H, et al. An adaptive extended Kalman filter for structural damage identification. Struct Control Health Monit, 2006, 13: 849–867
Song Y, Campa G, Napolitano M, et al. Online parameter estimation techniques comparison within a fault tolerant flight control system. J Guid Control Dyn, 2002, 25: 528–537
Liu Y, Yu A X, Zhu J B, et al. Unscented Kalman filtering in the additive noise case. Sci China Tech Sci, 2010, 53: 929–941
Wen S, Qi H, Wang Y F, et al. Prediction of the coupled heat radiation and conduction parameters and boundary condition using the unscented Kalman filter. Sci China Tech Sci, 2020, 63: 422–433
Ge Z X, Yang Y M, Wen X S. Strong tracking UKF method and its application in fault identification (in Chinese). Chin J Sci Instr, 2008, 29: 1670–1674
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by the National Natural Science Foundation of China (Grant Nos. 11572345 & 11972044) and the Program of National University of Defense Technology (Grant No. ZK18-03-07).
Rights and permissions
About this article
Cite this article
Li, Z., Zhu, H. & Luo, Y. An escape strategy in orbital pursuit-evasion games with incomplete information. Sci. China Technol. Sci. 64, 559–570 (2021). https://doi.org/10.1007/s11431-020-1662-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11431-020-1662-0