Abstract
The explicit investigation of anticipations in relation to adaptive behavior is a recent approach. This chapter first provides psychological background that motivates and inspires the study of anticipations in the adaptive behavior field. Next, a basic framework for the study of anticipations in adaptive behavior is suggested. Different anticipatory mechanisms are identified and characterized. First fundamental distinctions are drawn between implicit anticipatory behavior, payoff anticipatory behavior, sensory anticipatory behavior, and state anticipatory behavior. A case study allows further insights into the drawn distinctions. Many future research direction are suggested.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arbib, M.: The mirror system, imitation, and the evolution of language. In: Dautenhahn, K., Nehaniv, C.L. (eds.) Imitation in animals and artifacts. MIT Press, Cambridge (2002)
Baluja, S., Pomerleau, D.A.: Using the representation in a neural network’s hidden layer for task-specific focus on attention. In: Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pp. 133–141 (1995)
Baluja, S., Pomerleau, D.A.: Expectation-based selective attention for visual monitoring and control of a robot vehicle. Robotics and Autonomous Systems 22, 329–344 (1997)
Barto, A.G., Mahadevan, S.: Recent advances in hierarchical reinforcement learning. Discrete event systems (2003) (to appear)
Bellman, R.E.: Dynamic programming. Princeton University Press, Princeton (1957)
Booker, L., Goldberg, D.E., Holland, J.H.: Classifier systems and genetic algorithms. Artificial Intelligence 40, 235–282 (1989)
Brooks, R.A.: Intelligence without reason. In: Proceedings of the 12th International Joint Conference on Artificial Intelligence, pp. 569–595 (1991)
Butz, M.V.: Anticipatory learning classifier systems. Kluwer Academic Publishers, Boston (2002)
Cassandra, A.R., Kaelbling, L.P., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proceedings of the Twelfth National Conference on AI, pp. 1023–1028 (1994)
Davidsson, P.: Learning by linear anticipation in multi-agent systems. In: Weiss, G. (ed.) Distributed artificial intelligence meets machine learning, pp. 62–72. Springer, Heidelberg (1997)
Drescher, G.L.: Made-up minds, a constructivist approach to artificial intelligence. MIT Press, Cambridge (1991)
Elman, J.L.: Finding structure in time. Cognitive Science 14, 179–211 (1990)
Fikes, R.E., Nilsson, N.J.: STRIPS: A new approach to the application of theorem proving to problem solving. Artificial Intelligence 2, 189–208 (1971)
Gallese, V.: The ’shared manifold’ hypothesis: From mirror neurons to empathy. Journal of Consciousness Studies: Between Ourselves - Second-Person Issues in the Study of Consciousness 8, 33–50 (2001)
Gallese, V., Goldman, A.: Mirror neurons and the simulation theory of mindreading. Trends in Cognitive Sciences 2, 493–501 (1998)
Gérard, P., Meyer, J.A., Sigaud, O.: Combining latent learning and dynamic programming in MACS. European Journal of Operational Research (2003) (submitted)
Gérard, P., Stolzmann, W., Sigaud, O.: YACS: A new learning classifier system with anticipation. Soft Computing 6, 216–228 (2002)
Gérard, P., Sigaud, O.: Adding a generalization mechanism to YACS. In: Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001), pp. 951–957 (2001)
Gérard, P., Sigaud, O.: YACS: Combining dynamic programming with generalization in classifier systems. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 52–69. Springer, Heidelberg (2001)
Goldberg, D.E.: Genetic algorithms in search, optimization and machine learning. Addison-Wesley, Reading (1989)
Herbart, J.: Psychologie als Wissenschaft neu gegründet auf Erfahrung, Metaphysik und Mathematik. Zweiter, analytischer Teil. August Wilhem Unzer, Königsberg, Germany (1825)
Hoffmann, J., Sebald, A., Stöcker, C.: Irrelevant response effects improve serial learning in serial reaction time tasks. Journal of Experimental Psychology: Learning, Memory, and Cognition 27, 470–482 (2001)
Holland, J.H.: Adaptation in natural and artificial systems. The University of Michigan Press, Ann Arbor (1975)
Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Induction. MIT Press, Cambridge (1986)
Holland, J.H., Reitman, J.S.: Cognitive systems based on adaptive algorithms. Pattern Directed Inference Systems 7, 125–149 (1978)
Holland, J.H.: Properties of the bucket brigade algorithm. In: Proceedings of an International Conference on Genetic Algorithms and their Applications, pp. 1–7 (1985)
Kaelbing, L.P., Littman, M.L., Moore, A.W.: Reinforcement learning: A survey. Journal of Artificial Intelligence Research 4, 237–285 (1996)
Koch, C., Ullmann, S.: Shifts in selective attention: Towards the underlying neural circuitry. Human Neurobiology 4, 219–227 (1985)
Kunde, W.: Response-effect compatibility in manual choice reaction tasks. Journal of Experimental Psychology: Human Perception and Performance 27, 387– 394 (2001)
Kuvayev, L., Sutton, R.S.: Model-based reinforcement learning with an approximate, learned model. In: Proceedings of the ninth yale workshop on adaptive and learning systems, New Haven, CT, pp. 101–105 (1996)
LaBerge, D.: Attentional processing, the brain’s art of mindfulness. Harvard University Press, Cambridge (1995)
Lanzi, P.L.: An analysis of generalization in the XCS classifier system. Evolutionary Computation 7, 125–149 (1999)
Lanzi, P.L.: Learning classifier systems from a reinforcement learning perspective. Soft Computing 6, 162–170 (2002)
Mack, A., Rock, I.: Inattentinal blindness. MIT Press, Cambridge
Moore, A.W., Atkeson, C.: Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning 13, 103–130 (1993)
Newell, A., Simon, H.A., Shaw, J.C.: Elements of a theory of human problem solving. Psychological Review 65, 151–166 (1958)
Pashler, H., Johnston, J.C., Ruthruff, E.: Attention and performance. Annual Review of Psychology 52, 629–651 (2001)
Pashler, H.E.: The psychology of attention. MIT Press, Cambridge (1998)
Pavlov, I.P.: Conditioned reflexes. Oxford, London (1927)
Peng, J., Williams, R.J.: Efficient learning and planning within the dyna framework. Adaptive Behavior 1, 437–454 (1993)
Rizzolatti, G., Fadiga, L., Gallese, V., Fogassi, L.: Premotor cortex and the recognition of motor actions. Cognitive Brain Research 3, 131–141 (1996)
Rummery, G.A., Niranjan, M.: On-line Q-learning using connectionist systems. Technical Report CUED/F-INFENG/TR 166, Engineering Department, Cambridge University (1994)
Schubotz, R.I., von Cramon, D.Y.: Functional organization of the lateral premotor cortex. fMRI reveals different regions activated by anticipation of object properties, location and speed. Cognitive Brain Research 11, 97–112 (2001)
Simons, D.J., Chabris, C.F.: Gorillas in our midst: Sustained inattentional blindness for dynamic events. Perception 28, 1059–1074 (1999)
Sjölander, S.: Some cognitive break-throughs in the evolution of cognition and consciousness, and their impact on the biology language. Evolution and Cognition 1, 3–11 (1995)
Skinner, B.F.: The behavior of organisms. Appleton-Century Crofts, Inc., New-York (1938)
Skinner, B.F.: Beyond freedom and dignity. Bantam/Vintage, New York (1971)
Stock, A., Hoffmann, J.: Intentional fixation of behavioral learning or how R-E learning blocks S-R learning. European Journal of Cognitive Psychology (2002) (in press)
Stolzmann, W.: Antizipative Classifier Systems [Anticipatory classifier systems]. Shaker Verlag, Aachen (1997)
Stolzmann, W.: Anticipatory classifier systems. Genetic Programming 1998. In: Proceedings of the Third Annual Conference, pp. 658–664 (1998)
Stolzmann, W., Butz, M.V., Hoffmann, J., Goldberg, D.E.: First cognitive capabilities in the anticipatory classifier system. In: From Animals to Animats 6: Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior, pp. 287–296 (2000)
Sutton, R.S., Barto, A.G.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998)
Sutton, R.: Reinforcement learning architectures for animats. In: From animals to animats: Proceedings of the First International Conference on Simulation of Adaptative Behavior (1991)
Sutton, R., Precup, D., Singh, S.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112, 181–211 (1999)
Tani, J.: Model-based learning for mobile robot navigation from the dynamical system perspective. IEEE Transactions on System, Man and Cybernetics 26, 421–436 (1996)
Tani, J.: An interpretation of the ”self” from the dynamical systems perspective: A constructivist approach. Journal of Consciousness Studies 5, 516–542 (1998)
Tani, J.: Learning to perceive the world as articulated: An approach for hierarchical learning in sensory-motor systems. Neural Networks 12, 1131–1141 (1999)
Thistlethwaite, D.: A critical review of latent learning and related experiments. Psychological Bulletin 48, 97–129 (1951)
Thompson, E.: Empathy and consciousness. Journal of Consciousness Studies: Between Ourselves - Second-Person Issues in the Study of Consciousness 8, 1–32 (2001)
Thorndike, E.L.: Animal intelligence: Experimental studies. Macmillan, New York (1911)
Tolman, E.C.: Purposive behavior in animals and men. Appletown, New York (1932)
Tolman, E.C.: The determiners of behavior at a choice point. Psychological Review 45, 1–41 (1938)
Tolman, E.C.: Cognitive maps in rats and men. Psychological Review 55, 189–208 (1948)
Tolman, E.C.: Principles of purposive behavior. In: Koch, S. (ed.) Psychology: A study of science, pp. 92–157. McGraw-Hill, New York (1959)
Watkins, C.J.: Learning with delayed rewards. PhD thesis, Psychology Department, University of Cambridge, England (1989)
Wilson, S.W.: ZCS, a zeroth level classifier system. Evolutionary Computation 2, 1–18 (1994)
Wilson, S.W.: Classifier fitness based on accuracy. Evolutionary Computation 3, 149–175 (1995)
Wilson, S.W.: Mining oblique data with XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, p. 158. Springer, Heidelberg (2001)
Wilson, S.W.: Knowledge growth in an artificial animal. In: Grefenstette, J.J. (ed.) Proceedings of an international conference on genetic algorithms and their applications, pp. 16–23. Carnegie-Mellon University, Pittsburgh (1985)
Witkowski, C.M.: Schemes for learning and behaviour: A new expectancy model. PhD thesis, Department of Computer Science, University of London, England (1997)
Wolpert, D.H.: The lack of a priori distinctions between learning algorithms. Neural Computation 8, 1341–1390 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Butz, M.V., Sigaud, O., Gérard, P. (2003). Internal Models and Anticipations in Adaptive Learning Systems. In: Butz, M.V., Sigaud, O., Gérard, P. (eds) Anticipatory Behavior in Adaptive Learning Systems. Lecture Notes in Computer Science(), vol 2684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45002-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-540-45002-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40429-3
Online ISBN: 978-3-540-45002-3
eBook Packages: Springer Book Archive