A First Approach of a New Learning Strategy: Learning by Confirmation

Carpio, Alejandro; Santos, Matilde; Martín, José Antonio

doi:10.1007/978-3-642-37832-4_37

Alejandro Carpio⁵,
Matilde Santos⁵ &
José Antonio Martín⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 214))

2688 Accesses
1 Altmetric

Abstract

Learning by confirmation is a new learning approach, which combines two types of supervised learning strategies: reinforcement learning and learning by examples. In this paper, we show how this new strategy accelerates the learning process when some knowledge is introduced to the reinforcement algorithm. The learning proposal has been tested on a real-time device, a Lego Mindstorms NXT 2.0 robot that has been configured as an inverted pendulum. The methodology shows good performance and the results are quite promising.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alpaydin E (2004) Introduction to machine learning. The MIT Press, Cambridge, MA
Google Scholar
Russel S, Norvig P (2004) Artificial intelligence: a modern approach. Prentice-Hall, Englewood Cliffs, NJ
Google Scholar
Sutton S, Barto A (1998) Reinforcement learning: an introduction. The MIT Press, Cambridge, MA
Google Scholar
Martín-H A, Santos M (2010) Aprendizaje por refuerzo. In: Aprendizaje automático, chapter 12, RA-MA, Madrid, Spain
Google Scholar
Karamouzas I, Overmars MH (2008) Adding variation to path planning. Comp Anim Virtual Worlds 19: 283–293
Google Scholar
Santos M, Martín-H JA, López V, Botella G (2012) Dyna-H: a heuristic planning reinforcement learning algorithm applied to role-playing-game strategy decision systems. Knowl-Based Syst 32:28–36
Article Google Scholar
Garzés M, Kudenko D (2010) Online learning of shaping rewards in reinforcement learning. Neural Netw 23(4):541–550
Article Google Scholar
Alvarez C, Santos M, López V (2010) Reinforcement learning vs. A* in a role playing game benchmark scenario. In: Ruan D, Li T, Xu Y, Chen G, Kerre E (eds) Computational intelligence: foundations and applications. World Scientific Proc. Series on Computer Engineering and Information Science Vol. 4. Computational Intelligente. Foundations and Applications. Proc. of the 9th Int. FLINS Conference, pp 644–650
Google Scholar
Bertsekas DP (1995) Dynamic programming and optimal control. Athena Scientific, Belmont, Massachusetts
MATH Google Scholar
Anyway. http://robotsquare.com/2012/03/13/tutorial-segway-with-nxt-g/
Berthilsson S, Danmark A, Hammarqvist U, Nygren H, Savin V (2009) Embedded Control Systems LegoWay http://www.it.uu.se/edu/course/homepage/styrsystem/vt09/Nyheter/Grupper/g5_Final_Report.pdf
Astrom KJ, Hagglund T (2005) Advanced PID control. Research Triangle Park, NC: ISA–The Instrumentation, Systems, and Automation Society
Google Scholar
Kelly JF (2007) Lego mindstorms 2.0 NXT-G programming guide. Apress, Berkeley, CA
Google Scholar
Watkins C, Dayan P (1992) Q-learning. Mach Learn 8:279–292
MATH Google Scholar

Download references

Acknowledgments

This work has been partially supported by the Spanish project DPI2009-14552-C02-01.

Author information

Authors and Affiliations

Computer Architecture and Systems Engineering, Facultad de Informática, Universidad Complutense de Madrid, 28040, Madrid, Spain
Alejandro Carpio, Matilde Santos & José Antonio Martín

Authors

Alejandro Carpio
View author publications
You can also search for this author in PubMed Google Scholar
Matilde Santos
View author publications
You can also search for this author in PubMed Google Scholar
José Antonio Martín
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alejandro Carpio .

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, China, People's Republic
Fuchun Sun
School of Information Science and Technology, Southwest Jiaotong University, Chengdu, China, People's Republic
Tianrui Li
Department of Computer Science and Techn, Tsinghua University, Beijing, China, People's Republic
Hongbo Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Carpio, A., Santos, M., Martín, J.A. (2014). A First Approach of a New Learning Strategy: Learning by Confirmation. In: Sun, F., Li, T., Li, H. (eds) Knowledge Engineering and Management. Advances in Intelligent Systems and Computing, vol 214. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37832-4_37

Download citation

DOI: https://doi.org/10.1007/978-3-642-37832-4_37
Published: 24 July 2013
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37831-7
Online ISBN: 978-3-642-37832-4
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics