Learning by Experience from Others — Social Learning and Imitation in Animals and Robots

Riedmiller, Martin; Merke, Artur

doi:10.1007/978-3-662-05594-6_17

Martin Riedmiller &
Artur Merke

264 Accesses
2 Citations

Abstract

A challenging current research direction is the design of intelligent software systems — ‘agents’ — that are able to autonomously solve certain tasks within their environment. Application areas of software agents can be found in robotics, as for example agents that control robots to rescue people in dangerous environments, and also in virtual worlds as electronic markets, where intelligent agents have to compete against other market participants, that pursue their own goals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Andou, T. (1998) Refinement of soccer agent’s position using reinforcement learning. In Kitano H., editor, RoboCup-97: Robot Soccer World Cup I,Springer Verlag.
Google Scholar
Burkhard, H.-D., Hannebauer, M. and Wendler, J. (1998) Belief-desire-intention deliberation in artificial soccer. AI Magazine 19 (3), 87–93.
Google Scholar
Barto, A. G., Sutton, R. S. and Watkins, C. J. C. H. (1989) Learning and sequential decision making. Technical Report COINS TR 89–95, Department of Computer and Information Science, University of Massachusetts, Amherst, September 1989.
Google Scholar
Bertsekas, D. P. and Tsitsiklis, J. N. (1989) Neuro Dynamic Programming. Athena Scientific, Belmont, Massachusetts.
Google Scholar
Bertsekas, D. P. and Tsitsiklis, J. N. (1996) Neuro Dynamic Programming. Athena Scientific, Belmont, Massachusetts.
Google Scholar
Bertsekas, D. P. and Tsitsiklis, J. N. (1996) Neuro-dynamic programming. Optimization and neural computation series, 3. Athena Scientific.
Google Scholar
Claus, C. and Boutilier, C. (1999) The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems. In IJCAI.
Google Scholar
Dorer, K. (1999) Behavior networks for continuous domains using situation-dependent motivations. In Proceedings of IJCAI ’99, Stockholm, Sweden, 1233–1238.
Google Scholar
Filar, J. and Vrieze, K. (1997) Competitive Markov decision processes. Springer Verlag.
Google Scholar
Lauer, M. and Riedmiller, M. (2000) An algorithm for distributed reinforcement learning in cooperative multi-agent systems. In Proceedings of International Conference on Machine Learning, ICML ’00, Stanford, CA, 535–542.
Google Scholar
Luke, S. (1998) Genetic programming produced competitive soccer softbot teams for robocup97. In Proceedings of the Third Annual Genetic Programming Conference (GP98) San Francisco, CA, 204–222.
Google Scholar
Merke, A. (1999) Reinforcement Lernen in Multiagentensystemen. Master’s thesis, Universität Karlsruhe.
Google Scholar
Puterman, M. L. (1994) Markov decision processes: discrete stochastic dynamic programming. Wiley series in probability and mathematical statistics: Applied probability and statistics. Wiley.
Book Google Scholar
Riedmiller, M. (2000) Concepts and facilities of a neural reinforcement learning control architecture for technical process control. Journal of Neural Computing and Application 8, 323–338.
Article Google Scholar
Riedmiller, M., Merke, A., Meier, D., Hoffmann, A., Sinner, A., Thate, O., Kill, O. and Ehrmann, R. (2000) Karlsruhe brainstormers–a reinforcement learning way to robotic soccer. In Jennings, A., and Stone, P.,editors, RoboCup-2000: Robot Soccer World Cup IV, LNCS. Springer Verlag.
Google Scholar
Stolzenburg, F., Obst, O., Murray, J. and Bremer, B. (1999) Spatial agents implemented in a logical expressible language. In Veloso M. M., editor, Proceedings of the 3rd International Workshop on RoboCup in Conjunction with 16th Joint International Conference on Artificial Intelligence, Stockholm, IJCAI press, 205–210.
Google Scholar
Stone, P., Sutton, R. and Singh, S. (2000) Reinforcement learning for 3 vs. 2 keepaway. In Stone, P., Balch, T. and Kreatzschmarr, K. editors, RoboCup-00: Robot Soccer World Cup IV. Springer Verlag.
Google Scholar
Stone, P. and Veloso, M. (1998) A layered approach to learning client behaviours in the robocup soccer server. Applied Artificial Intelligence 12, 165–188.
Article Google Scholar
Stone, P. and Veloso, M. (1998) Team-partitioned, opaque-transition reinforcement learning. In Asada, M. and Kitano, H. editors, RoboCup-98: Robot Soccer World Cup II,Springer Verlag.
Google Scholar
Sutton, R. S. and Barto, A. G. (1998) Reinforcement Learning. MIT Press, Cambridge, MA.
Google Scholar
Sutton, R. S., Precup, D. and Singh S. (1999) Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence to appear.
Google Scholar
Watkins, C. J. (1989) Learning from Delayed Rewards. Phd thesis, Cambridge University.
Google Scholar
Watkins, C. J. C. H. and Dean, P. (1992) Technical Note: Q-Learning. Machine Leaning 8, 279–292.
MATH Google Scholar
Woolridge, M. (1999) Intelligent agents. In Weiss, G. editor, Multi Agent Systems. MIT Press
Google Scholar

Download references

Authors

Martin Riedmiller
View author publications
You can also search for this author in PubMed Google Scholar
Artur Merke
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institut für Theoretische Physik, Universität Heidelberg, Philosophenweg 19, 69120, Heidelberg, Germany
Reimer Kühn & Ion-Olimpiu Stamatescu &
Neurobiologie, Freie Universität Berlin, Königin-Luise-Str. 28/30, 14195, Berlin, Germany
Randolf Menzel
Institut für Logik, Komplexität und Deduktionssysteme, Universität Karlsruhe, 76128, Karlsruhe, Germany
Wolfram Menzel
FESt, Schmeilweg 5, 69118, Heidelberg, Germany
Ulrich Ratsch & Ion-Olimpiu Stamatescu &
FB Informatik, University of Kaiserslautern, 67653, Kaiserslautern, Germany
Michael M. Richter

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Riedmiller, M., Merke, A. (2003). Learning by Experience from Others — Social Learning and Imitation in Animals and Robots. In: Kühn, R., Menzel, R., Menzel, W., Ratsch, U., Richter, M.M., Stamatescu, IO. (eds) Adaptivity and Learning. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-05594-6_17

Download citation

DOI: https://doi.org/10.1007/978-3-662-05594-6_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-05510-2
Online ISBN: 978-3-662-05594-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics