Abstract
This paper discusses the emergence of sensorimotor coordination for ESCHeR, a 4DOF redundant foveated rob ot-head, by interaction with its environment. A feedback-error-learning(FEL)-based distributed control provides the system with explorative abilities with reflexes constraining the learning space. A Kohonen network, trained at run-time, categorizes the sensorimotor patterns obtained over ESCHeR's interaction with its environment, enables the reinforcement of frequently executed actions, thus stabilizing the learning activity over time. We explain how the development of ESCHeR's visual abilities (namely gaze fixation and saccadic motion), from a context-free reflex-based control process to a context-dependent, pattern-based sensorimotor coordination can be related to the Piagetian ‘stage theory’.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Bajcsy, R., & Campos, M. (1992). Active and Exploratory Perception. CVGIP: Image Understanding, 56(1), pp. 31–40.
Brown, C. (1988). The Rochester Robot (Technical Report TR-257). University of Rochester, Rochester, USA.
Chow, M., & Teeter, J. (1994). An analysis of Weight Decay as a Methodology of Reducing Three-Layer Feedforward Artificial Neural Networks for Classification Problems”. IEEE International Conference on Neural Networks, pp. 600–605.
Coombs, D. (1992). Real-Time Gaze Holding in Binocular Robot Vision. Doctoral dissertation (also available as TR-415), Department of Computer Science, University of Rochester, Rochester, USA.
Gomi, H., & Kawato, M. (1993). Neural Network Control for a Closed-loop System Using Feedback-Error-Learning. Neural Networks, 6, pp. 933–946.
Jordan, M.I. (1990). Motor Learning and the Degree of Freedom Problem”. In Jeannerod, M. (ed) Attention and Performance, vol. XIII, pp. 796–836.
Jordan, M.I. (1992). Computational Aspects of Motor Control and Motor Learning (Technical Report TR-9206). Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences, USA.
Jordan, M.I., & Rosenbaum, D.A. (1989). Action. In M.I. Posner (Ed.), Foundations of Cognitive Science. Cambridge, MA: MIT Press.
Kawato, M., Furukawa, K., & Suzuki, R. (1987). A Hierarchical Neural-Network Model for Control and Learning of Voluntary Movement. Biological Cybernetics, 57, pp. 169–185.
Kawato, M., & Gomi, H. (1992). A Computational Model of Four Regions of the Cerebellum Based on Feedback-error Learning. Biological Cybernetics, 68, pp. 95–103.
Kohonen, T., Hynninen, J., Kangas, J. & Laaksonen, J. (1995). SOM-PAK: the Self-Organizing Map Program Package (Technical Report April 7). Helsinki University of Technology, Laboratory of Computer and Information Science, Rakentajanaukio 2 C, SF-02150 Espoo, Finland.
Kraaijveld, M.A., Mao, J., & Jain, A.K. (1992). A Non-linear Projection Method Based on Kohonen's Topology Preserving Maps. International Conference on Pattern Recognition, Los Alamitos, CA, pp. 41–45.
Kuniyoshi, Y., Kita, N., Sugimoto, K., Nakamura, S., & Suehiro, T. (1995). A Foveated Wide Angle Lens for Active Vision. IEEE International Conference on Robotics and Automation, Japan, pp. 2982–2985.
Kuniyoshi, Y., Kita, N., Rougeaux, S., & Suehiro, T. (1995). Active Stereo Vision System with Foveated Wide Angle Lenses. In S.Z. Li, D.P. Mital, E.K. Teoh, H. Wang (eds) Recent Developments in Computer Vision, Lecture Notes in Computer Science 1035, Springer-Verlag, pp. 191–200.
Kuniyoshi, Y. (1994). The Science of Imitation — Towards Physically and Socially Grounded Intelligence —. RWC Technical Report, TR-94001.
Lucas, B., & Kanade, T. (1981). An Iterative Image Registration TechniqueWith an Application to Stereo Vision. Proc. DARPA Image Understanding Workshop, pp. 121–130.
Meltzoff, A.N., & Moore, M.K. (1989). Imitation in Newborn Infants: Exploring the Range of Gestures Imitated and the Underlying Mechanisms. Developmental Psychology, vol. 25, no. 6, pp. 954–962.
Murray, D.W., Bradshaw, K. J., McLauchlan, P.F., Reid, I.D., & Sharkey, P.M. (1995). Driving Saccade to Pursuit using Image Motion. International Journal of Computer Vision.
Nordlund P., & Uhlin, T. (1995). Closing the Loop: Detection and Pursuit of a Moving Object by a Moving Observer (Technical Report CVAP–175–95–7–173). Computational Vision and Active Perception Laboratory, Royal Institute of Technology, S-100 44 Stockholm, Sweden.
Piaget, J. (1962). Play, Dreams and Imitation in Childhood. New York: W. W. Norton.
Rougeaux, S., & Kuniyoshi, Y. (1997). Velocity and Disparity Cues for Robust Real-Time Binocular Tracking. IEEE Proc. Computer Vision and Pattern Recognition, Puerto-Rico, pp. 1–6.
Rougeaux, S., Kita, N., Kuniyoshi, S., Sakane, S., & Chavand, F. (1994). Binocular Tracking Based on Virtual Horopters. IEEE Proc. International Conference on Intelligent Robots and Systems, Munich, Germany, pp. 2052–2057.
Rumelhart, D.E., Hinton, G.E., & Williams, R.J. (1986). Learning Representations by Back-propagating Errors. Nature, vol. 323, pp. 533–536.
Sandini, G., & Tagliaso, V. (1980). An Anthropomorphic Retina-like Structure for Scene Analysis. Computer Graphics and Image Processing, 14(3), pp. 365–372.
Smagt, P., & Krose, B.J.A. (1991). A Real-time Learning Neural Robot Controller. International Conference on Neural Networks, Espoo, Finland, pp. 351–356.
Thelen, E., & Smith, L. (1994). A Dynamic Systems Approach to the Development of Cognition and Action, Cambridge, Mass.: MIT Press, Bradford Books.
Rights and permissions
About this article
Cite this article
Berthouze, L., Kuniyoshi, Y. Emergence and Categorization of Coordinated Visual Behavior Through Embodied Interaction. Machine Learning 31, 187–200 (1998). https://doi.org/10.1023/A:1007453010407
Issue Date:
DOI: https://doi.org/10.1023/A:1007453010407