Safe Learning with Real-Time Constraints: A Case Study

Metta, Giorgio; Natale, Lorenzo; Pathak, Shashank; Pulina, Luca; Tacchella, Armando

doi:10.1007/978-3-642-13022-9_14

Giorgio Metta^24,25,
Lorenzo Natale²⁵,
Shashank Pathak²⁴,
Luca Pulina²⁴ &
…
Armando Tacchella²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6096))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

2126 Accesses

Abstract

Aim of this work is to study the problem of ensuring safety and effectiveness of a multi-agent robot control system with real-time constraints in the case of learning components usage. Our case study focuses on a robot playing the air hockey game against a human opponent, where the robot has to learn how to minimize opponent’s goals. This case study is paradigmatic since the robot must act in real-time, but, at the same time, it must learn and guarantee that the control system is safe throughout the process. We propose a solution using automata-theoretic formalisms and associated verification tools, showing experimentally that our approach can yield safety without heavily compromising effectiveness.

This research has received funding from the European Community’s Information and Communication Technologies Seventh Framework Programme [FP7/2007-2013] under grant agreement n. [215805], the CHRIS project.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Kramer, J., Scheutz, M.: Development environments for autonomous mobile robots: A survey. Autonomous Robots 22(2), 101–132 (2007)
Article Google Scholar
Bagnell, J.A., Schaal, S.: Special issue on Machine Learning in Robotics (Editorial). The International Journal of Robotics Research 27(2), 155–156 (2008)
Article Google Scholar
Clarke, E.M., Grumberg, O., Peled, D.A.: Model checking. Springer, Heidelberg (1999)
Google Scholar
Kern, C., Greenstreet, M.R.: Formal verification in hardware design: a survey. ACM Transactions on Design Automation of Electronic Systems (TODAES) 4(2), 123–193 (1999)
Article Google Scholar
Visser, W., Havelund, K., Brat, G., Park, S.J., Lerda, F.: Model checking programs. Automated Software Engineering 10(2), 203–232 (2003)
Article Google Scholar
Plaku, E., Kavraki, L.E., Vardi, M.Y.: Hybrid systems: From verification to falsification. In: Damm, W., Hermanns, H. (eds.) CAV 2007. LNCS, vol. 4590, pp. 463–476. Springer, Heidelberg (2007)
Chapter Google Scholar
Bentivegna, D.C., Atkeson, C.G., Cheng, G.: Learning tasks from observation and practice. Robotics and Autonomous Systems 47(2-3), 163–169 (2004)
Article Google Scholar
Alur, R., Courcoubetis, C., Henzinger, T.A., Ho, P.H.: Hybrid automata: An algorithmic approach to the specification and verification of hybrid systems. LNCS, pp. 209–229. Springer, Heidelberg (1993)
Google Scholar
Franzle, M., Herde, C., Teige, T., Ratschan, S., Schubert, T.: Efficient solving of large non-linear arithmetic constraint systems with complex boolean structure. Journal on Satisfiability, Boolean Modeling and Computation 1, 209–236 (2007)
Google Scholar
Metta, G., Fitzpatrick, P., Natale, L.: YARP: yet another robot platform. International Journal on Advanced Robotics Systems 3(1), 43–48 (2006)
Google Scholar
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.H.: The weka data mining software: An update. SIGKDD Explorations 11, 10–18 (2009)
Article Google Scholar
Shevade, S.K., Keerthi, S.S., Bhattacharyya, C., Murthy, K.R.K.: Improvements to the SMO algorithm for SVM regression. IEEE Transactions on Neural Networks 11(5), 1188–1193 (2000)
Article Google Scholar
Smith, D.J., Simpson, K.G.L.: Functional safety: a straightforward guide to applying IEC 61508 and related standards. Butterworth-Heinemann (2004)
Google Scholar
Pappas, G., Kress-Gazit, H. (eds.): ICRA Workshop on Formal Methods in Robotics and Automation (2009)
Google Scholar
Cervera, E., Garcia-Aracil, N., Martinez, E., Nomdedeu, L., del Pobil, A.P.: Safety for a robot arm moving amidst humans by using panoramic vision. In: IEEE International Conference on Robotics and Automation, ICRA 2008, pp. 2183–2188 (2008)
Google Scholar
Gordon, D.F.: Asimovian adaptive agents. Journal of Artificial Intelligence Research 13, 95–153 (2000)
MathSciNet MATH Google Scholar
Perkins, T.J., Barto, A.G.: Lyapunov design for safe reinforcement learning. The Journal of Machine Learning Research 3, 803–832 (2003)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

DIST, Università di Genova, Viale Causa, 13, 16145, Genova, Italy
Giorgio Metta, Shashank Pathak, Luca Pulina & Armando Tacchella
Italian Institute of Technology, Via Morego 30, 16163, Genova, Italy
Giorgio Metta & Lorenzo Natale

Authors

Giorgio Metta
View author publications
You can also search for this author in PubMed Google Scholar
Lorenzo Natale
View author publications
You can also search for this author in PubMed Google Scholar
Shashank Pathak
View author publications
You can also search for this author in PubMed Google Scholar
Luca Pulina
View author publications
You can also search for this author in PubMed Google Scholar
Armando Tacchella
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computing and Numerical Analysis, University of Cordoba, Campus Universitario de Rabanales, Einstein Building, 3rd floor, 14071, Cordoba, Spain
Nicolás García-Pedrajas
Dept. of Computer Science and Artificial Intelligence, ETS de Ingenierias Informática y de Telecomunicación, University of Granada, 18071, Granada, Spain
Francisco Herrera
School of Computing, University of the West of Scotland, PA1 2BE, Paisley, UK
Colin Fyfe
Dept. Computer Science and Artificial Intelligence, ETS de Ingenierias Informática y de Telecomunicación, University of Granada, 18071, Granada, Spain
José Manuel Benítez
Department of Computer Science, Texas State University-San Marcos, 601 University Drive, TX 78666-4616, San Marcos, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Metta, G., Natale, L., Pathak, S., Pulina, L., Tacchella, A. (2010). Safe Learning with Real-Time Constraints: A Case Study. In: García-Pedrajas, N., Herrera, F., Fyfe, C., Benítez, J.M., Ali, M. (eds) Trends in Applied Intelligent Systems. IEA/AIE 2010. Lecture Notes in Computer Science(), vol 6096. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13022-9_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-13022-9_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13021-2
Online ISBN: 978-3-642-13022-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics