Internal Models and Anticipations in Adaptive Learning Systems

Butz, Martin V.; Sigaud, Olivier; Gérard, Pierre

doi:10.1007/978-3-540-45002-3_6

Martin V. Butz^10,11,
Olivier Sigaud⁹ &
Pierre Gérard⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2684))

935 Accesses
35 Citations

Abstract

The explicit investigation of anticipations in relation to adaptive behavior is a recent approach. This chapter first provides psychological background that motivates and inspires the study of anticipations in the adaptive behavior field. Next, a basic framework for the study of anticipations in adaptive behavior is suggested. Different anticipatory mechanisms are identified and characterized. First fundamental distinctions are drawn between implicit anticipatory behavior, payoff anticipatory behavior, sensory anticipatory behavior, and state anticipatory behavior. A case study allows further insights into the drawn distinctions. Many future research direction are suggested.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arbib, M.: The mirror system, imitation, and the evolution of language. In: Dautenhahn, K., Nehaniv, C.L. (eds.) Imitation in animals and artifacts. MIT Press, Cambridge (2002)
Google Scholar
Baluja, S., Pomerleau, D.A.: Using the representation in a neural network’s hidden layer for task-specific focus on attention. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 133–141 (1995)
Google Scholar
Baluja, S., Pomerleau, D.A.: Expectation-based selective attention for visual monitoring and control of a robot vehicle. Robotics and Autonomous Systems 22, 329–344 (1997)
Article Google Scholar
Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discrete event systems (2003) (to appear)
Google Scholar
Bellman, R.E.: Dynamic programming. Princeton University Press, Princeton (1957)
Google Scholar
Booker, L., Goldberg, D.E., Holland, J.H.: Classifier systems and genetic algorithms. Artificial Intelligence 40, 235–282 (1989)
Article Google Scholar
Brooks, R.A.: Intelligence without reason. In: Proceedings of the 12th International Joint Conference on Artificial Intelligence, pp. 569–595 (1991)
Google Scholar
Butz, M.V.: Anticipatory learning classifier systems. Kluwer Academic Publishers, Boston (2002)
Google Scholar
Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on AI, pp. 1023–1028 (1994)
Google Scholar
Davidsson, P.: Learning by linear anticipation in multi-agent systems. In: Weiss, G. (ed.) Distributed artificial intelligence meets machine learning, pp. 62–72. Springer, Heidelberg (1997)
Google Scholar
Drescher, G.L.: Made-up minds, a constructivist approach to artificial intelligence. MIT Press, Cambridge (1991)
MATH Google Scholar
Elman, J.L.: Finding structure in time. Cognitive Science 14, 179–211 (1990)
Article Google Scholar
Fikes, R.E., Nilsson, N.J.: STRIPS: A new approach to the application of theorem proving to problem solving. Artificial Intelligence 2, 189–208 (1971)
Article MATH Google Scholar
Gallese, V.: The ’shared manifold’ hypothesis: From mirror neurons to empathy. Journal of Consciousness Studies: Between Ourselves - Second-Person Issues in the Study of Consciousness 8, 33–50 (2001)
Google Scholar
Gallese, V., Goldman, A.: Mirror neurons and the simulation theory of mindreading. Trends in Cognitive Sciences 2, 493–501 (1998)
Article Google Scholar
Gérard, P., Meyer, J.A., Sigaud, O.: Combining latent learning and dynamic programming in MACS. European Journal of Operational Research (2003) (submitted)
Google Scholar
Gérard, P., Stolzmann, W., Sigaud, O.: YACS: A new learning classifier system with anticipation. Soft Computing 6, 216–228 (2002)
MATH Google Scholar
Gérard, P., Sigaud, O.: Adding a generalization mechanism to YACS. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pp. 951–957 (2001)
Google Scholar
Gérard, P., Sigaud, O.: YACS: Combining dynamic programming with generalization in classifier systems. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 52–69. Springer, Heidelberg (2001)
Chapter Google Scholar
Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)
MATH Google Scholar
Herbart, J.: Psychologie als Wissenschaft neu gegründet auf Erfahrung, Metaphysik und Mathematik. Zweiter, analytischer Teil. August Wilhem Unzer, Königsberg, Germany (1825)
Google Scholar
Hoffmann, J., Sebald, A., Stöcker, C.: Irrelevant response effects improve serial learning in serial reaction time tasks. Journal of Experimental Psychology: Learning, Memory, and Cognition 27, 470–482 (2001)
Article Google Scholar
Holland, J.H.: Adaptation in natural and artificial systems. The University of Michigan Press, Ann Arbor (1975)
Google Scholar
Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Induction. MIT Press, Cambridge (1986)
Google Scholar
Holland, J.H., Reitman, J.S.: Cognitive systems based on adaptive algorithms. Pattern Directed Inference Systems 7, 125–149 (1978)
Google Scholar
Holland, J.H.: Properties of the bucket brigade algorithm. In: Proceedings of an International Conference on Genetic Algorithms and their Applications, pp. 1–7 (1985)
Google Scholar
Kaelbing, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Google Scholar
Koch, C., Ullmann, S.: Shifts in selective attention: Towards the underlying neural circuitry. Human Neurobiology 4, 219–227 (1985)
Google Scholar
Kunde, W.: Response-effect compatibility in manual choice reaction tasks. Journal of Experimental Psychology: Human Perception and Performance 27, 387– 394 (2001)
Google Scholar
Kuvayev, L., Sutton, R.S.: Model-based reinforcement learning with an approximate, learned model. In: Proceedings of the ninth yale workshop on adaptive and learning systems, New Haven, CT, pp. 101–105 (1996)
Google Scholar
LaBerge, D.: Attentional processing, the brain’s art of mindfulness. Harvard University Press, Cambridge (1995)
Google Scholar
Lanzi, P.L.: An analysis of generalization in the XCS classifier system. Evolutionary Computation 7, 125–149 (1999)
Google Scholar
Lanzi, P.L.: Learning classifier systems from a reinforcement learning perspective. Soft Computing 6, 162–170 (2002)
Google Scholar
Mack, A., Rock, I.: Inattentinal blindness. MIT Press, Cambridge
Google Scholar
Moore, A.W., Atkeson, C.: Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning 13, 103–130 (1993)
Google Scholar
Newell, A., Simon, H.A., Shaw, J.C.: Elements of a theory of human problem solving. Psychological Review 65, 151–166 (1958)
Google Scholar
Pashler, H., Johnston, J.C., Ruthruff, E.: Attention and performance. Annual Review of Psychology 52, 629–651 (2001)
Google Scholar
Pashler, H.E.: The psychology of attention. MIT Press, Cambridge (1998)
Google Scholar
Pavlov, I.P.: Conditioned reflexes. Oxford, London (1927)
Google Scholar
Peng, J., Williams, R.J.: Efficient learning and planning within the dyna framework. Adaptive Behavior 1, 437–454 (1993)
Google Scholar
Rizzolatti, G., Fadiga, L., Gallese, V., Fogassi, L.: Premotor cortex and the recognition of motor actions. Cognitive Brain Research 3, 131–141 (1996)
Google Scholar
Rummery, G.A., Niranjan, M.: On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG/TR 166, Engineering Department, Cambridge University (1994)
Google Scholar
Schubotz, R.I., von Cramon, D.Y.: Functional organization of the lateral premotor cortex. fMRI reveals different regions activated by anticipation of object properties, location and speed. Cognitive Brain Research 11, 97–112 (2001)
Google Scholar
Simons, D.J., Chabris, C.F.: Gorillas in our midst: Sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)
Google Scholar
Sjölander, S.: Some cognitive break-throughs in the evolution of cognition and consciousness, and their impact on the biology language. Evolution and Cognition 1, 3–11 (1995)
Google Scholar
Skinner, B.F.: The behavior of organisms. Appleton-Century Crofts, Inc., New-York (1938)
Google Scholar
Skinner, B.F.: Beyond freedom and dignity. Bantam/Vintage, New York (1971)
Google Scholar
Stock, A., Hoffmann, J.: Intentional fixation of behavioral learning or how R-E learning blocks S-R learning. European Journal of Cognitive Psychology (2002) (in press)
Google Scholar
Stolzmann, W.: Antizipative Classifier Systems [Anticipatory classifier systems]. Shaker Verlag, Aachen (1997)
Google Scholar
Stolzmann, W.: Anticipatory classifier systems. Genetic Programming 1998. In: Proceedings of the Third Annual Conference, pp. 658–664 (1998)
Google Scholar
Stolzmann, W., Butz, M.V., Hoffmann, J., Goldberg, D.E.: First cognitive capabilities in the anticipatory classifier system. In: From Animals to Animats 6: Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior, pp. 287–296 (2000)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Google Scholar
Sutton, R.: Reinforcement learning architectures for animats. In: From animals to animats: Proceedings of the First International Conference on Simulation of Adaptative Behavior (1991)
Google Scholar
Sutton, R., Precup, D., Singh, S.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112, 181–211 (1999)
Google Scholar
Tani, J.: Model-based learning for mobile robot navigation from the dynamical system perspective. IEEE Transactions on System, Man and Cybernetics 26, 421–436 (1996)
Google Scholar
Tani, J.: An interpretation of the ”self” from the dynamical systems perspective: A constructivist approach. Journal of Consciousness Studies 5, 516–542 (1998)
Google Scholar
Tani, J.: Learning to perceive the world as articulated: An approach for hierarchical learning in sensory-motor systems. Neural Networks 12, 1131–1141 (1999)
Google Scholar
Thistlethwaite, D.: A critical review of latent learning and related experiments. Psychological Bulletin 48, 97–129 (1951)
Google Scholar
Thompson, E.: Empathy and consciousness. Journal of Consciousness Studies: Between Ourselves - Second-Person Issues in the Study of Consciousness 8, 1–32 (2001)
Google Scholar
Thorndike, E.L.: Animal intelligence: Experimental studies. Macmillan, New York (1911)
Google Scholar
Tolman, E.C.: Purposive behavior in animals and men. Appletown, New York (1932)
Google Scholar
Tolman, E.C.: The determiners of behavior at a choice point. Psychological Review 45, 1–41 (1938)
Google Scholar
Tolman, E.C.: Cognitive maps in rats and men. Psychological Review 55, 189–208 (1948)
Google Scholar
Tolman, E.C.: Principles of purposive behavior. In: Koch, S. (ed.) Psychology: A study of science, pp. 92–157. McGraw-Hill, New York (1959)
Google Scholar
Watkins, C.J.: Learning with delayed rewards. PhD thesis, Psychology Department, University of Cambridge, England (1989)
Google Scholar
Wilson, S.W.: ZCS, a zeroth level classifier system. Evolutionary Computation 2, 1–18 (1994)
Google Scholar
Wilson, S.W.: Classifier fitness based on accuracy. Evolutionary Computation 3, 149–175 (1995)
Google Scholar
Wilson, S.W.: Mining oblique data with XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, p. 158. Springer, Heidelberg (2001)
Chapter Google Scholar
Wilson, S.W.: Knowledge growth in an artificial animal. In: Grefenstette, J.J. (ed.) Proceedings of an international conference on genetic algorithms and their applications, pp. 16–23. Carnegie-Mellon University, Pittsburgh (1985)
Google Scholar
Witkowski, C.M.: Schemes for learning and behaviour: A new expectancy model. PhD thesis, Department of Computer Science, University of London, England (1997)
Google Scholar
Wolpert, D.H.: The lack of a priori distinctions between learning algorithms. Neural Computation 8, 1341–1390 (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

AnimatLab-LIP6, 8, rue du capitaine Scott, 75015, Paris, France
Olivier Sigaud & Pierre Gérard
Department of Cognitive Psychology, University of Würzburg, Germany
Martin V. Butz
Illinois Genetic Algorithms Laboratory (IlliGAL), University of Illinois at Urbana-Champaign, IL, USA
Martin V. Butz

Authors

Martin V. Butz
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Sigaud
View author publications
You can also search for this author in PubMed Google Scholar
Pierre Gérard
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Psychology, University of Würzburg, Röntgenring 11, 97070, Würzburg, Germany
Martin V. Butz
Animat Lab, University Paris VI, 104 Av du Président Kennedy, 75016, Paris, France
Olivier Sigaud
ADAge, LIPN, Univ. de Paris-Nord, 93 430, Villetaneuse, France
Pierre Gérard

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Butz, M.V., Sigaud, O., Gérard, P. (2003). Internal Models and Anticipations in Adaptive Learning Systems. In: Butz, M.V., Sigaud, O., Gérard, P. (eds) Anticipatory Behavior in Adaptive Learning Systems. Lecture Notes in Computer Science(), vol 2684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45002-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-540-45002-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40429-3
Online ISBN: 978-3-540-45002-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics