SVM Approximation of Value Function Contours in Target Hitting Problems

Chapel, Laetitia; Deffuant, Guillaume

doi:10.1007/978-3-642-31353-0_3

Laetitia Chapel⁵ &
Guillaume Deffuant⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 174))

873 Accesses
1 Citations

Abstract

In a problem of target hitting, the capture basin at cost c is the set of states that can reach the target with a cost lower or equal than c, without breaking the viability constraints. The boundary of a c-capture basin is the c-contour of the problem value function. In this paper, we propose a new algorithm that solves target hitting problems, by iteratively approximating capture basins at successive costs. We show that, by a simple change of variables, minimising a cost may be reduced to the problem of time minimisation, and hence a recursive backward procedure can be set. Two variants of the algorithm are derived, one providing an approximation from inside (the approximation is included in the actual capture basin) and one providing a outer approximation, which allows one to assess the approximation error. We use a machine learning algorithm (as a particular case, we consider Support Vector Machines) trained on points of a grid with boolean labels, and we state the conditions on the machine learning procedure that guarantee the convergence of the approximations towards the actual capture basin when the resolution of the grid decreases to 0. Moreover, we define a control procedure which uses the set of capture basin approximations to drive a point into the target. When using the inner approximation, the procedure guarantees to hit the target, and when the resolution of the grid tends to 0, the controller tends to the optimal one (minimizing the cost to hit the target). We illustrate the method on two simple examples, Zermelo and car on the hill problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aubin, J.P.: Viability theory. Birkhäuser (1991)
Google Scholar
Bayen, A.M., Crück, E., Tomlin, C.J.: Guaranteed Overapproximations of Unsafe Sets for Continuous and Hybrid Systems: Solving the Hamilton-Jacobi Equation Using Viability Techniques. In: Tomlin, C.J., Greenstreet, M.R. (eds.) HSCC 2002. LNCS, vol. 2289, pp. 90–104. Springer, Heidelberg (2002)
Chapter Google Scholar
Cardaliaguet, P., Quincampoix, M., Saint-Pierre, P.: Optimal times for constrained nonlinear control problems without local optimality. Applied Mathematics & Optimization 36, 21–42 (1997)
MathSciNet MATH Google Scholar
Cardaliaguet, P., Quincampoix, M., Saint-Pierre, P.: Set-Valued Numerical Analysis for Optimal control and Differential Games. Annals of the International Society of Dynamic Games (1998)
Google Scholar
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines (2001)
Google Scholar
Deffuant, G., Chapel, L., Martin, S.: Approximating viability kernels with support vector machines. IEEE Transactions on Automatic Control 52(5), 933–937 (2007)
Article MathSciNet Google Scholar
Frankowska, H.: Optimal trajectories associated with a solution of the contingent hamilton-jacobi equation. Applied Mathematics and Optimization 19(1), 291–311 (1989)
Article MathSciNet MATH Google Scholar
Lhommeau, M., Jaulin, L., Hardouin, L.: Inner and outer approximation of capture basin using interval analysis. In: 4th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2007 (2007)
Google Scholar
Lygeros, J.: On reachability and minimum cost optimal control. Automatica 40, 917–927 (2004)
Article MathSciNet MATH Google Scholar
Mitchell, I., Bayen, A., Tomlin, C.: A time-dependent Hamilton-Jacobi formulation for reachable sets for continuous dynamic games. IEEE Transactions on Automatic Control 50(7), 947–957 (2005)
Article MathSciNet Google Scholar
Moore, A., Atkeson, C.: The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces. Machine Learning 21, 199–233 (1995)
Google Scholar
Saint-Pierre, P.: Approximation of the viability kernel. Applied Mathematics & Optimization 29(2), 187–209 (1994)
Article MathSciNet MATH Google Scholar
Saint-Pierre, P.: Approche ensembliste des systèmes dynamiques, regards qualitatifs et quantitatifs. Société de Mathématiques Appliquées et Industrielles, 66 (2001)
Google Scholar
Scholkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2002)
Google Scholar
Vapnik, V.: The nature of statistical learning theory. Springer (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Lab-STICC, Université Européenne de Bretagne, Université de Bretagne Sud, 56017, Vannes Cedex, France
Laetitia Chapel
Laboratoire d’Ingénierie pour les Systèmes Complexes, Cemagref, 63172, Aubière Cedex, France
Guillaume Deffuant

Authors

Laetitia Chapel
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Deffuant
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laetitia Chapel .

Editor information

Editors and Affiliations

l’Ingénieur d’Angers (ISTIA), Labo. d’Ingénierie des Systémes, Institut des Sciences et Techniques de, avenue Notre Dame du Lac 62, Angers, 49000, France
Jean-Louis Ferrier
CNRS, UMR 6597, IRCCN, Ecole Centrale de Nantes, rue de la Noe 1, Nantes CX 03, 44321, France
Alain Bernard
Ford Research & Adv. Engineering, RIC Builidng, 2101 Village rd.,, Dearborn, 48124, Michigan, USA
Oleg Gusikhin
, Images, Signals and Intelligence, University Paris-Est Creteil (UPEC), LISSI EA 3956, Paris, 77127, France
Kurosh Madani

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chapel, L., Deffuant, G. (2013). SVM Approximation of Value Function Contours in Target Hitting Problems. In: Ferrier, JL., Bernard, A., Gusikhin, O., Madani, K. (eds) Informatics in Control, Automation and Robotics. Lecture Notes in Electrical Engineering, vol 174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31353-0_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-31353-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31352-3
Online ISBN: 978-3-642-31353-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics