Abstract
Learning by confirmation is a new learning approach, which combines two types of supervised learning strategies: reinforcement learning and learning by examples. In this paper, we show how this new strategy accelerates the learning process when some knowledge is introduced to the reinforcement algorithm. The learning proposal has been tested on a real-time device, a Lego Mindstorms NXT 2.0 robot that has been configured as an inverted pendulum. The methodology shows good performance and the results are quite promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alpaydin E (2004) Introduction to machine learning. The MIT Press, Cambridge, MA
Russel S, Norvig P (2004) Artificial intelligence: a modern approach. Prentice-Hall, Englewood Cliffs, NJ
Sutton S, Barto A (1998) Reinforcement learning: an introduction. The MIT Press, Cambridge, MA
Martín-H A, Santos M (2010) Aprendizaje por refuerzo. In: Aprendizaje automático, chapter 12, RA-MA, Madrid, Spain
Karamouzas I, Overmars MH (2008) Adding variation to path planning. Comp Anim Virtual Worlds 19: 283–293
Santos M, Martín-H JA, López V, Botella G (2012) Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems. Knowl-Based Syst 32:28–36
Garzés M, Kudenko D (2010) Online learning of shaping rewards in reinforcement learning. Neural Netw 23(4):541–550
Alvarez C, Santos M, López V (2010) Reinforcement learning vs. A* in a role playing game benchmark scenario. In: Ruan D, Li T, Xu Y, Chen G, Kerre E (eds) Computational intelligence: foundations and applications. World Scientific Proc. Series on Computer Engineering and Information Science Vol. 4. Computational Intelligente. Foundations and Applications. Proc. of the 9th Int. FLINS Conference, pp 644–650
Bertsekas DP (1995) Dynamic programming and optimal control. Athena Scientific, Belmont, Massachusetts
Anyway. http://robotsquare.com/2012/03/13/tutorial-segway-with-nxt-g/
Berthilsson S, Danmark A, Hammarqvist U, Nygren H, Savin V (2009) Embedded Control Systems LegoWay http://www.it.uu.se/edu/course/homepage/styrsystem/vt09/Nyheter/Grupper/g5_Final_Report.pdf
Astrom KJ, Hagglund T (2005) Advanced PID control. Research Triangle Park, NC: ISA–The Instrumentation, Systems, and Automation Society
Kelly JF (2007) Lego mindstorms 2.0 NXT-G programming guide. Apress, Berkeley, CA
Watkins C, Dayan P (1992) Q-learning. Mach Learn 8:279–292
Acknowledgments
This work has been partially supported by the Spanish project DPI2009-14552-C02-01.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Carpio, A., Santos, M., Martín, J.A. (2014). A First Approach of a New Learning Strategy: Learning by Confirmation. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_37
Download citation
DOI: https://doi.org/10.1007/978-3-642-37832-4_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37831-7
Online ISBN: 978-3-642-37832-4
eBook Packages: EngineeringEngineering (R0)