Learning Throttle Valve Control Using Policy Search

Bischoff, Bastian; Nguyen-Tuong, Duy; Koller, Torsten; Markert, Heiner; Knoll, Alois

doi:10.1007/978-3-642-40988-2_4

Learning Throttle Valve Control Using Policy Search

Bastian Bischoff²³,
Duy Nguyen-Tuong²³,
Torsten Koller²³,
Heiner Markert²³ &
…
Alois Knoll²⁴

Conference paper

3963 Accesses
12 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8188))

Abstract

The throttle valve is a technical device used for regulating a fluid or a gas flow. Throttle valve control is a challenging task, due to its complex dynamics and demanding constraints for the controller. Using state-of-the-art throttle valve control, such as model-free PID controllers, time-consuming and manual adjusting of the controller is necessary. In this paper, we investigate how reinforcement learning (RL) can help to alleviate the effort of manual controller design by automatically learning a control policy from experiences. In order to obtain a valid control policy for the throttle valve, several constraints need to be addressed, such as no-overshoot. Furthermore, the learned controller must be able to follow given desired trajectories, while moving the valve from any start to any goal position and, thus, multi-targets policy learning needs to be considered for RL. In this study, we employ a policy search RL approach, Pilco [2], to learn a throttle valve control policy. We adapt the Pilco algorithm, while taking into account the practical requirements and constraints for the controller. For evaluation, we employ the resulting algorithm to solve several control tasks in simulation, as well as on a physical throttle valve system. The results show that policy search RL is able to learn a consistent control policy for complex, real-world systems.

Download to read the full chapter text

Chapter PDF

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning). The MIT Press (1998)
Google Scholar
Deisenroth, P.M., Rasmussen, C.E.: PILCO: A Model-Based and Data-Efficient Approach to Policy Search. In: ICML, pp. 465–472 (2011)
Google Scholar
Wiering, M., van Otterlo, M.: Reinforcement Learning: State-of-the-Art. Adaptation, Learning, and Optimization. Springer (2012)
Google Scholar
Deisenroth, M.P.: Efficient Reinforcement Learning using Gaussian Processes. PhD Thesis, Karlsruhe (2010)
Google Scholar
Yuan, X., Wang, Y., Wu, L.: SVM-Based Approximate Model Control for Electronic Throttle Valve. Transactions on Vehicular Technology 57(5) (2008)
Google Scholar
Nentwig, M., Mercorelli, P.: Throttle valve control using an inverse local linear model tree based on a fuzzy neural network. In: 7th International Conference on Cybernetic Intelligent Systems (2008)
Google Scholar
Yuan, X., Wang, Y., Lianghong, W., Xizheng, X., Sun, W.: Neural Network Based Self-Learning Control Strategy for Electronic Throttle Valve. Transactions on Vehicular Technology 59(8) (2010)
Google Scholar
Seeger, M., Williams, C.K.I., Lawrence, N.D.: Fast Forward Selection to Speed Up Sparse Gaussian Process Regression. In: 9th International Workshop on Artificial Intelligence and Statistics (2003)
Google Scholar
Leontaritis, I.J., Billings, S.A.: Input-output Parametric Models for Nonlinear Systems. International Journal of Control 41, 303–344 (1985)
Article MathSciNet MATH Google Scholar
Griffiths, P.G.: Embedded Software Control Design for an Electronic Throttle Body. Master’s Thesis, Berkeley, California (2000)
Google Scholar
Deisenroth, M.P., Fox, D.: Multiple-Target Reinforcement Learning with a Single Policy. In: ICML Workshop on Planning and Acting with Uncertain Models (2011)
Google Scholar
Nakamura, H., Masashi, M.: Thottle valve positioning control apparatus. United States Patent 5,852,996 (1998)
Google Scholar
Al-samarraie, S.A., Abbas, Y.K.: Design of Electronic Throttle Valve Position Control System using Nonlinear PID Controller. International Journal of Computer Applications 59, 27–34 (2012)
Article Google Scholar
Wang, H., Yuan, X., Wang, Y., Yang, Y.: Harmony search algorithm-based fuzzy-PID controller for electronic throttle valve. Neural Computing and Applications 22, 329–336 (2013)
Article Google Scholar
Deisenroth, M.P., Rasmussen, C.E., Fox, D.: Learning to Control a Low-Cost Manipulator using Data-Efficient Reinforcement Learning. RSS (2011)
Google Scholar
Fisher Controls International LLC: Control Valve Handbook, 4th edn. (2005)
Google Scholar
Ting, J., D’Souza, A., Schaal, S.: A Bayesian Approach to Nonlinear Parameter Identification for Rigid-Body Dynamics. Neural Networks (2009)
Google Scholar
Garcia, C.: Comparison of Friction Models Applied to a Control Valve. Control Eng. Pract. 16(10), 1231–1243 (2008)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Research, Robert Bosch GmbH, Robert-Bosch-Str. 2, 71701, Schwieberdingen, Germany
Bastian Bischoff, Duy Nguyen-Tuong, Torsten Koller & Heiner Markert
Robotics and Embedded Systems, TU Munich, Boltzmannstr. 3, 85748, Garching at Munich, Germany
Alois Knoll

Authors

Bastian Bischoff
View author publications
You can also search for this author in PubMed Google Scholar
Duy Nguyen-Tuong
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Koller
View author publications
You can also search for this author in PubMed Google Scholar
Heiner Markert
View author publications
You can also search for this author in PubMed Google Scholar
Alois Knoll
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001, Leuven, Belgium
Hendrik Blockeel
Fraunhofer IAIS, Department of Knowledge Discovery, University of Bonn, Schloss Birlinghoven, 53754, Sankt Augustin, Germany
Kristian Kersting
LIACS, Universiteit Leiden, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
Siegfried Nijssen
Department of Computer Science and Engineering, Czech Technical University, Technicka 2, 16627, Prague 6, Czech Republic
Filip Železný

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bischoff, B., Nguyen-Tuong, D., Koller, T., Markert, H., Knoll, A. (2013). Learning Throttle Valve Control Using Policy Search. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2013. Lecture Notes in Computer Science(), vol 8188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40988-2_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-40988-2_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40987-5
Online ISBN: 978-3-642-40988-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics