Abstract
The classical setting of optimal control theory assumes full knowledge of the process dynamics and the costs associated with every control strategy. The problem becomes much harder if the controller only knows a finite set of possible running cost functions, but has no way of checking which of these running costs is actually in place. In this paper we address this challenge for a class of evasive path planning problems on a continuous domain, in which an evader needs to reach a target while minimizing his exposure to an enemy observer, who is in turn selecting from a finite set of known surveillance plans. Our key assumption is that both the evader and the observer need to commit to their (possibly probabilistic) strategies in advance and cannot immediately change their actions based on any newly discovered information about the opponent’s current position. We consider two types of evader behavior: in the first one, a completely risk-averse evader seeks a trajectory minimizing his worst-case cumulative observability, and in the second, the evader is concerned with minimizing the average-case cumulative observability. The latter version is naturally interpreted as a semi-infinite strategic game, and we provide an efficient method for approximating its Nash equilibrium. The proposed approach draws on methods from game theory, convex optimization, optimal control, and multiobjective dynamic programming. We illustrate our algorithm using numerical examples and discuss the computational complexity, including for the generalized version with multiple evaders.
Similar content being viewed by others
Notes
Fast sweeping [39] is another popular approach for gaining efficiency in solving Eikonal equations. We refer readers to [7, 8] for a review of many other “fast” techniques, including the hybrid marching/sweeping methods that aim to combine the best features of both approaches. Even though our own implementation is based on fast marching, any of these methods can be used to solve isotropic control problems arising in subsequent sections. Which one will be faster depends on the domain geometry and the particular pointwise observability functions.
Here, we describe these methods in terms of exposure to different observer’s positions, but both of them were introduced for much more general multiobjective control problems. In many applications, it is necessary to balance completely different criteria, e.g., time vs fuel vs money vs threat, etc. Other methods for approximating the full PF can be found in [13] and [18].
This result assumes that the set \(S=\{ (s_1, s_2, \ldots , s_r) \mid s_i = P({\hat{\varvec{x}}}_i, \varvec{a}( \cdot )); i = 1,2,\dots ,r; \varvec{a}( \cdot )\in {\mathcal {A}}\} \subset {\mathbb {R}}^r \) is bounded and co(S) is closed. In our case, S is not bounded for the full set of control functions in \({\mathcal {A}}\) but becomes bounded if we restrict our attention to Pareto optimal control functions.
References
Alton K, Mitchell IM (2012) An ordered upwind method with precomputed stencil and monotone node acceptance for solving static convex Hamilton–Jacobi equations. J Sci Comput 51:313–348
Bardi M, Capuzzo-Dolcetta I (2008) Optimal control and viscosity solutions of Hamilton–Jacobi-Bellman equations. Springer, Berlin
Beck A (2017) First-order methods in optimization, vol 25. SIAM, Philadelphia
Brucker P (1984) An \(O(n)\) algorithm for quadratic knapsack problems. Oper Res Lett 3:163–166
Carmona R, Delarue F (2017) Probabilistic theory of mean field games with applications I–II. Springer, Berlin
Cartee E, Lai L, Song Q, Vladimirsky A (2019) Time-dependent surveillance-evasion games, preprint arXiv:1903.01332
Chacon A, Vladimirsky A (2012) Fast two-scale methods for Eikonal equations. SIAM J Sci Comput 34:A547–A578
Chacon A, Vladimirsky A (2015) A parallel two-scale method for Eikonal equations. SIAM J Sci Comput 37:A156–A180
Clawson Z, Chacon A, Vladimirsky A (2014) Causal domain restriction for Eikonal equations. SIAM J Sci Comput 36:A2478–A2505
Clawson Z, Ding X, Englot B, Frewen TA, Sisson WM, Vladimirsky A (2015) A bi-criteria path planning algorithm for robotics applications, preprint arXiv:1511.01166
Crandall MG, Lions P-L (1983) Viscosity solutions of Hamilton–Jacobi equations. Trans Am Math Soc 277:1–42
Das I, Dennis JE (1997) A closer look at drawbacks of minimizing weighted sums of objectives for Pareto set generation in multicriteria optimization problems. Struct Optim 14:63–69
Desilles A, Zidani H (2018) Pareto front characterization for multi-objective optimal control problems using Hamilton-Jacobi approach, preprint
Dijkstra EW (1959) A note on two problems in connexion with graphs. Numer Math 1:269–271
Dobbie J (1966) Solution of some surveillance-evasion problems by the methods of differential games. In: Proceedings of the 4th international conference on operational research. MIT, Wiley, New York
Falcone M, Ferretti R (2014) Semi-Lagrangian approximation schemes for linear and Hamilton–Jacobi equations, vol 133. SIAM, Philadelphia
Guéant O, Lasry J-M, Lions P-L (2010) Mean field games and applications. Paris–Princeton lectures on mathematical finance. Springer, Berlin, pp 205–266
Guigue A (2014) Approximation of the pareto optimal set for multiobjective optimal control problems using viability kernels. ESAIM COCV 20:95–115
Kumar A, Vladimirsky A (2010) An efficient method for multiobjective optimal control and optimal control subject to integral constraints. J Comput Math 1:517–551
Lewin J (1973) Decoy in pursuit-evasion games. PhD thesis, Department of Aeronautics and Astronautics, Stanford University
Lewin J, Breakwell JV (1975) The surveillance-evasion game of degree. J Optim Theory Appl 16:339–353
Lewin J, Olsder GJ (1979) Conic surveillance evasion. J Optim Theory Appl 27:107–125
Marler RT, Arora JS (2004) Survey of multi-objective optimization methods for engineering. Struct Multidiscip Optim 26:369–395
Mirebeau J-M (2014) Efficient fast marching with Finsler metrics. Numer Math 126:515–557
Mitchell IM, Sastry S (2003) Continuous path planning with multiple constraints. In: 2003 42nd IEEE conference on decision and control, vol 5, pp 5502–5507
Osborne MJ, Rubinstein A (1994) A course in game theory. MIT press, Cambridge
Qi D, Vladimirsky A (2019) Corner cases, singularities, and dynamic factoring. J Sci Comput 79:1456–1476
Raghavan T (1994) Zero-sum two-person games. Handb Game Theory Econ Appl 2:735–768
Sethian JA (1996) A fast marching level set method for monotonically advancing fronts. In: Proceedings of the National Academy of Sciences, vol 93, pp 1591–1595
Sethian JA (1999) Level set methods and fast marching methods: evolving interfaces in computational geometry, fluid mechanics, computer vision, and materials science, vol 3. Cambridge University Press, Cambridge
Sethian JA, Vladimirsky A (2003) Ordered upwind methods for static Hamilton–Jacobi equations: theory and algorithms. SIAM J Numer Anal 41:325–363
Shu C-W (2007) High order numerical methods for time dependent Hamilton–Jacobi equations. Mathematics and computation in imaging science and information processing. World Scientific, Singapore, pp 47–91
Soner H (1986) Optimal control with state-space constraint. I SIAM J Control Optim 24:552–561
Takei R, Chen W, Clawson Z, Kirov S, Vladimirsky A (2015) Optimal control with budget constraints and resets. SIAM J Control Optim 53:712–744
Tijs S (1976) Semi-infinite linear programs and semi-infinite matrix games. Katholieke Universiteit Nijmegen, Mathematisch Instituut, Nijmegen
Tsai Y-HR, Cheng L-T, Osher S, Zhao H-K (2003) Fast sweeping algorithms for a class of Hamilton–Jacobi equations. SIAM J Numer Anal 41:673–694
Tsitsiklis JN (1995) Efficient algorithms for globally optimal trajectories. IEEE Trans Autom Control 40:1528–1538
Wang W, Carreira-Perpinán MA (2013) Projection onto the probability simplex: an efficient algorithm with a simple proof, and an application, arXiv preprint arXiv:1309.1541
Zhao H (2005) A fast sweeping method for Eikonal equations. Math Comput 74:603–627
Acknowledgements
The authors would like to thank Alex Townsend and anonymous reviewers for their helpful suggestions.
Funding
This work is supported in part by the National Science Foundation Grant DMS-1738010. The second author’s work is also supported by the Simons Foundation Fellowship.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gilles, M.A., Vladimirsky, A. Evasive Path Planning Under Surveillance Uncertainty. Dyn Games Appl 10, 391–416 (2020). https://doi.org/10.1007/s13235-019-00327-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13235-019-00327-x
Keywords
- Path planning
- Semi-infinite games
- Nash equilibrium
- Surveillance evasion
- Convex optimization
- Hamilton–Jacobi PDEs