Abstract
In a problem of target hitting, the capture basin at cost c is the set of states that can reach the target with a cost lower or equal than c, without breaking the viability constraints. The boundary of a c-capture basin is the c-contour of the problem value function. In this paper, we propose a new algorithm that solves target hitting problems, by iteratively approximating capture basins at successive costs. We show that, by a simple change of variables, minimising a cost may be reduced to the problem of time minimisation, and hence a recursive backward procedure can be set. Two variants of the algorithm are derived, one providing an approximation from inside (the approximation is included in the actual capture basin) and one providing a outer approximation, which allows one to assess the approximation error. We use a machine learning algorithm (as a particular case, we consider Support Vector Machines) trained on points of a grid with boolean labels, and we state the conditions on the machine learning procedure that guarantee the convergence of the approximations towards the actual capture basin when the resolution of the grid decreases to 0. Moreover, we define a control procedure which uses the set of capture basin approximations to drive a point into the target. When using the inner approximation, the procedure guarantees to hit the target, and when the resolution of the grid tends to 0, the controller tends to the optimal one (minimizing the cost to hit the target). We illustrate the method on two simple examples, Zermelo and car on the hill problems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aubin, J.P.: Viability theory. Birkhäuser (1991)
Bayen, A.M., Crück, E., Tomlin, C.J.: Guaranteed Overapproximations of Unsafe Sets for Continuous and Hybrid Systems: Solving the Hamilton-Jacobi Equation Using Viability Techniques. In: Tomlin, C.J., Greenstreet, M.R. (eds.) HSCC 2002. LNCS, vol. 2289, pp. 90–104. Springer, Heidelberg (2002)
Cardaliaguet, P., Quincampoix, M., Saint-Pierre, P.: Optimal times for constrained nonlinear control problems without local optimality. Applied Mathematics & Optimization 36, 21–42 (1997)
Cardaliaguet, P., Quincampoix, M., Saint-Pierre, P.: Set-Valued Numerical Analysis for Optimal control and Differential Games. Annals of the International Society of Dynamic Games (1998)
Chang, C.C., Lin, C.J.: Libsvm: a library for support vector machines (2001)
Deffuant, G., Chapel, L., Martin, S.: Approximating viability kernels with support vector machines. IEEE Transactions on Automatic Control 52(5), 933–937 (2007)
Frankowska, H.: Optimal trajectories associated with a solution of the contingent hamilton-jacobi equation. Applied Mathematics and Optimization 19(1), 291–311 (1989)
Lhommeau, M., Jaulin, L., Hardouin, L.: Inner and outer approximation of capture basin using interval analysis. In: 4th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2007 (2007)
Lygeros, J.: On reachability and minimum cost optimal control. Automatica 40, 917–927 (2004)
Mitchell, I., Bayen, A., Tomlin, C.: A time-dependent Hamilton-Jacobi formulation for reachable sets for continuous dynamic games. IEEE Transactions on Automatic Control 50(7), 947–957 (2005)
Moore, A., Atkeson, C.: The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces. Machine Learning 21, 199–233 (1995)
Saint-Pierre, P.: Approximation of the viability kernel. Applied Mathematics & Optimization 29(2), 187–209 (1994)
Saint-Pierre, P.: Approche ensembliste des systèmes dynamiques, regards qualitatifs et quantitatifs. Société de Mathématiques Appliquées et Industrielles, 66 (2001)
Scholkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge (2002)
Vapnik, V.: The nature of statistical learning theory. Springer (1995)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Chapel, L., Deffuant, G. (2013). SVM Approximation of Value Function Contours in Target Hitting Problems. In: Ferrier, JL., Bernard, A., Gusikhin, O., Madani, K. (eds) Informatics in Control, Automation and Robotics. Lecture Notes in Electrical Engineering, vol 174. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31353-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-31353-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31352-3
Online ISBN: 978-3-642-31353-0
eBook Packages: EngineeringEngineering (R0)