Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming

Quintero, Steven A. P.; Ludkovski, Michael; Hespanha, João P.

doi:10.1007/s10846-015-0270-7

Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming

Published: 05 October 2015

Volume 82, pages 135–162, (2016)
Cite this article

Journal of Intelligent & Robotic Systems Aims and scope Submit manuscript

Steven A. P. Quintero¹,
Michael Ludkovski² &
João P. Hespanha¹

533 Accesses
23 Citations
Explore all metrics

Abstract

We study the problem of optimally coordinating multiple fixed-wing UAVs to perform vision-based target tracking, which entails that the UAVs are tasked with gathering the best joint vision-based measurements of an unpredictable ground target. We utilize an analytic expression for the error covariance associated with the fused measurements of the target’s position, and we employ stochastic fourth-order models for all vehicles, thereby incorporating a high degree of realism into the problem formulation. While dynamic programming can generate an optimal control policy that minimizes the expected value of the fused geolocation error covariance over time, it is accompanied by significant computational challenges due to the curse of dimensionality. In order to circumvent this challenge, we present a novel policy generation technique that combines simulation-based policy iteration with a robust regression scheme. The resulting control policy offers a significant advantage over alternative approaches and shows that the optimal control strategy involves coordinating the UAVs’ distances to the target rather than their viewing angles, which had been a common practice in target tracking.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

UAV Routing and Coordination in Stochastic, Dynamic Environments

Cooperative Mission and Path Planning for a Team of UAVs

An anytime Visibility–Voronoi graph-search algorithm for generating robust and feasible unmanned surface vehicle paths

Article 23 September 2022

References

Mallick, M.: Geolocation using video sensor measurements. In: IEEE Int. Conf. Information Fusion. Quebec, Canada (2007)
Collins, G.E., Stankevitz, C.R., Liese, J.: Implementation of a sensor guided flight algorithm for target tracking by small UAS. In: Ground/Air Multi-Sens. Interoperability, Integration, Netw. Persistent ISR II, vol. 8047, SPIE (2011)
Kingston, D.B.: Decentralized Control of Multiple UAVs for perimeter and target surveillance. PhD thesis, Brigham Young University (2007)
Gu, G., Chandler, P.R., Schumacher, C.J., Sparks, A., Pachter, M.: Optimum cooperative UAV sensing using a team of UAVs. IEEE Trans. Aerosp. Electron. Syst. 42, 1446–1458 (2006)
Article Google Scholar
Rysdyk, R.: UAV path following for constant line-of-sight. In: Proc. 2nd AIAA Unmanned Unltd. Syst. Technol. Operations Aerosp, Land Sea Conf (2003)
Frew, E.W.: Lyapunov guidance vector fields for unmanned aircraft applications. In: Am. Control Conf. (2007)
Kim, S., Oh, H., Tsourdos, A.: Nonlinear model predictive coordinated standoff tracking of a moving ground vehicle. J. Guid. Control Dyn. 36(2), 557–566 (2013)
Article Google Scholar
Ma, L., Hovakimyan, N.: Cooperative target tracking in balanced circular formation: Multiple UAVs tracking a ground vehicle. In: Am. Control Conf., pp 5386–5391. IEEE (2013)
Summers, T.H.: Cooperative Shape and Orientation Control of Autonomous Vehicle Formations. PhD thesis, University of Texas at Austin (2010)
Oh, H., Kim, S., Tsourdos, A., White, B.A.: Decentralised standoff tracking of moving targets using adaptive sliding mode control for UAVs. J. Intell. Robot. Syst., 1– 15 (2013)
Peterson, C., Paley, D.A.: Multivehicle coordination in an estimated time-varying flowfield. J. Guid. Control Dyn. 34(1), 177–191 (2011)
Article Google Scholar
Anderson, R., Milutinović, D.: A stochastic approach to Dubins feedback control for target tracking. In: IEEE / RSJ Conf. Intell. Robots Syst., pp 3917–3922 (2011)
Quintero, S.A.P., Hespanha, J.P.: Vision-based target tracking with a small UAV: Optimization-based control strategies. Control Eng. Pract. 32, 28–42 (2014)
Article Google Scholar
Miller, S.A., Harris, Z.A., Chong, E.K.P.: A POMDP framework for coordinated guidance of autonomous UAVs for multitarget tracking, EURASIP. J. Adv. Signal Process., 1–17 (2009)
Thrun, S., Burgard, W., Fox, D.: Probabilistic Robotics. The MIT Press (2005)
Stachura, M., Carfang, A., Frew, E.W.. In: Workshop Robotic Wirel. Sens. Netw., (Marina Del Ray, CA) Cooperative target tracking with a communication limited active sensor network (2009)
Ding, C., Morye, A.A., Farrell, J.A., Roy-Chowdhury, A. K.: Coordinated sensing and tracking for mobile camera platforms. In: Am. Control Conf., pp. 5114–5119, IEEE (2012)
Quintero, S.A.P., Papi, F., Klein, D.J., Chisci, L., Hespanha, J.P.: Optimal UAV coordination for target tracking using dynamic programming. In: IEEE Conf. Decis. Control, (Atlanta, GA) (2010)
Lalish, E., Morgansen, K., Tsukamaki, T.: Oscillatory control for constant-speed unicycle-type vehicles. In: IEEE Conf. Decis. Control (2007)
Regina, N., Zanzi, M.: UAV guidance law for ground-based target trajectory tracking and loitering. In: Aerosp. Conf., IEEE (2011)
Quintero, S.A.P., Collins, G.E., Hespanha, J.P.: Flocking with fixed-wing UAVs for distributed sensing: A stochastic optimal control approach. In: Am. Control Conf., (Washington, D.C.) (2013)
Bouchard, B., Warin, X.: Monte-carlo valorisation of American options: facts and new algorithms to improve existing methods. In: R. Carmona, P. Del Moral, P. Hu, N. Oudjane (eds.) Numer. Methods Finance,Springer Proc. Math. (2011)
Lewis, F. L., Vrabie, D., Syrmos, V.L.: 3rd. John Wiley, Hoboken, New Jersey (2012)
Guestrin, C., Hauskrecht, M., Kveton, B.: Solving factored mdps with continuous and discrete variables. In: Proc. 20th Conf. on Uncertain. in Artif. Intell., 235–242, AUAI Press (2004)
Wiegerinck, W., Broek, B.v.d., Kappen, H.: Stochastic optimal control in continuous space-time multi-agent systems. In: Proc. 22nd Conf. on Uncertain. in Artif. Intell. (2006)
Longstaff, F.A., Schwartz, E.S.: Valuing American options by simulation: A simple least-squares approach. Rev. Financial Stud. 14(1), 113–147 (2001)
Article Google Scholar
Egloff, D.: Monte carlo algorithms for optimal stopping and statistical learning. Ann. Appl. Probab. 15(2), 1396–1432 (2005)
Article MathSciNet MATH Google Scholar
Ludkovski, M., Niemi, J.: Optimal dynamic policies for influenza management, Statistical Commun. Infect. Dis. (2010)
Bertsekas, D.P., Dynamic Programming and Optimal Control, vol. 2. Belmont, MA: Athena Scientific (2012)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning, 2nd edn. Springer (2009)
Belomestny, D., Kolodko, A., Schoenmakers, J.: Regression methods for stochastic control problems and their convergence analysis. SIAM J. on Control and Optim. 48(5), 3562–3588 (2010)
Article MathSciNet MATH Google Scholar
Vrabie, D., Vamvoudakis, K.G., Lewis, F.L.: Optimal adaptive control and differential games by reinforcement learning principles, vol. 81. IET (2013)
Oh, H., Kim, S., Shin, H.-S., Tsourdos, A., White, B.: Coordinated standoff tracking of groups of moving targets using multiple UAVs. In: Control & Automation (MED), 2013 21st Mediterranean Conf. on, 969–977, IEEE (2013)
Gramacy, R.B., Ludkovski, M.: Sequential design for optimal stopping problems,” SIAM J. on Financial Math. Note Accepted subject to minor revision (2015)
Carrillo, L., Russell, W., Hespanha, J., Collins, G.: State estimation of multiagent systems under impulsive noise and disturbances. IEEE Trans. Control Syst. Technol. 23, 13–26 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA 93106, USA
Steven A. P. Quintero & João P. Hespanha
Department of Statistics and Applied Probability, University of California, Santa Barbara, CA 93106, USA
Michael Ludkovski

Authors

Steven A. P. Quintero
View author publications
You can also search for this author in PubMed Google Scholar
Michael Ludkovski
View author publications
You can also search for this author in PubMed Google Scholar
João P. Hespanha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Steven A. P. Quintero.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Quintero, S.A.P., Ludkovski, M. & Hespanha, J.P. Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming. J Intell Robot Syst 82, 135–162 (2016). https://doi.org/10.1007/s10846-015-0270-7

Download citation

Received: 02 April 2015
Accepted: 31 August 2015
Published: 05 October 2015
Issue Date: April 2016
DOI: https://doi.org/10.1007/s10846-015-0270-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming

Abstract

Access this article

Similar content being viewed by others

UAV Routing and Coordination in Stochastic, Dynamic Environments

Cooperative Mission and Path Planning for a Team of UAVs

An anytime Visibility–Voronoi graph-search algorithm for generating robust and feasible unmanned surface vehicle paths

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Stochastic Optimal Coordination of Small UAVs for Target Tracking using Regression-based Dynamic Programming

Abstract

Access this article

Similar content being viewed by others

UAV Routing and Coordination in Stochastic, Dynamic Environments

Cooperative Mission and Path Planning for a Team of UAVs

An anytime Visibility–Voronoi graph-search algorithm for generating robust and feasible unmanned surface vehicle paths

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation