Visual surveillance monitoring and watching

  • Richard Howarth
  • Hilary Buxton
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1065)


This paper describes the development of computational understanding for surveillance of moving objects and their interactions in real world situations. Understanding the activity of moving objects starts by tracking objects in an image sequence, but this is just the beginning. The objective of this work is to go further and form conceptual descriptions that capture the dynamic interactions of objects in a meaningful way. The computational approach uses results from the VIEWS project. The issues concerned with extending computational vision to address high-level vision are described in the context of a surveillance system. In this paper we describe two systems: a passive architecture based on “event reasoning” which is the identification of behavioural primitives, their selection and composition; and an active architecture based on “task-level control” which is the guidance of the system to comply with a given surveillance task.


Conceptual Description Visual Surveillance Scene Object Situate Approach Level Control System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Philip E. Agre and David Chapman. Pengi: An implementation of a theory of activity. In Sixth AAAI Conference, pages 268–272. AAAI Press, 1987.Google Scholar
  2. 2.
    Dana H. Ballard. Animate vision. Artificial Intelligence, 48:57–86, 1991.Google Scholar
  3. 3.
    Hilary Buxton and others. VIEWS: Visual Inspection and Evaluation of Wide-area Scenes. IJCAI-91 Videotape Program, Morgan Kaufmann, 1991.Google Scholar
  4. 4.
    Hilary Buxton and Shaogang Gong. Visual Surveillance in a Dynamic and Uncertain World. Artificial Intelligence, 78:371–405, 1995.Google Scholar
  5. 5.
    David Chapman. Vision, Instruction and Action. The MIT Press, 1991.Google Scholar
  6. 6.
    Anthony N. Clark. Pattern recognition of noisy sequences of behavioural events using functional combinators. The Computer Journal, 37(5):385–398, 1994.Google Scholar
  7. 7.
    David R. Corrall, Anthony N. Clark, and A. Graham Hill. Airside ground movements surveillance. In NATO AGARD Symposium on Machine Intelligence in Air Traffic Management, pages 29:1–29:13, 1993.Google Scholar
  8. 8.
    David R. Corrall and A. Graham Hill. Visual surveillance. GEC Review, 8(1):15–27, 1992.Google Scholar
  9. 9.
    Li Du, Geoffery D. Sullivan, and Keith B. Baker. Quantitative analysis of the viewpoint consistency constraint in model-based vision. In Fourth ICCV, pages 632–639. IEEE Press, 1993.Google Scholar
  10. 10.
    Margaret M. Fleck. Representing space for practical reasoning. Image and Vision Computing, 6(2):75–86, 1986.Google Scholar
  11. 11.
    Margaret M. Fleck. Boundaries and Topological Algorithms. PhD thesis, MIT AI Lab., 1988. AI-TR 1065.Google Scholar
  12. 12.
    R.A. Frost. Constructing programs as executable attribute grammars. The Computer Journal, 35(4):376–387, 1992.Google Scholar
  13. 13.
    James J. Gibson. The Ecological Approach to Visual Perception. Houghton Mifflin Company, 1979.Google Scholar
  14. 14.
    Shaogang G. Gong and Hilary Buxton. On the expectations of moving objects. In Tenth ECAI Conference, pages 781–784, 1992.Google Scholar
  15. 15.
    Ian Horswill. Visual routines and visual search: a real-time implementation and an automata-theoretic analysis. In Fourteenth IJCAI Conference, pages 56–62. Morgan Kaufmann, 1995.Google Scholar
  16. 16.
    Richard J. Howarth. Spatial Representation, Reasoning and Control for a Surveillance System. PhD thesis, QMW, University of London, 1994.Google Scholar
  17. 17.
    Richard J. Howarth. Interpreting a dynamic and uncertain world: high-level vision. Artificial Intelligence Review, 9(1):37–63, 1995.Google Scholar
  18. 18.
    Richard J. Howarth and Hilary Buxton. An analogical representation of space and time. Image and Vision Computing, 10(7):467–478, 1992.Google Scholar
  19. 19.
    Richard J. Howarth and Hilary Buxton. Selective attention in dynamic vision. In Thirteenth IJCAI Conference, pages 1579–1584. Morgan Kaufmann, 1993.Google Scholar
  20. 20.
    Edwin Hutchins. Understanding micronesian navigation. In G. Dedre and A.L. Stevens Mental Models, pages 191–225. Lawrence Erlbaum Associates, 1983.Google Scholar
  21. 21.
    Simon King, Sophie Motet, Jérôme Thoméré, and FranÇois Arlabosse. A visual surveillance system for incident detection. In AAAI workshop on AI in Intelligent Vehicle Highway Systems, pages 30–36. AAAI Press, 1994.Google Scholar
  22. 22.
    Stephen M. Kosslyn, Rex A. Flynn, Jonathan B. Amsterdam, and Gretchen Wang. Components of high-level vision: a cognitive neuroscience analysis and accounts of neurological syndromes. Cognition, 34:203–277, 1990.Google Scholar
  23. 23.
    Hans-Hellmut Nagel. From image sequences towards conceptual descriptions. Image and Vision Computing, 6(2):59–74, May 1988.Google Scholar
  24. 24.
    Bernd Neumann. Natural language descriptions of time-varying scenes. In David L. Waltz, editor, Semantic Structures: Advances in Natural Language Processing, pages 167–206. Lawrence Erlbaum Associates, 1989.Google Scholar
  25. 25.
    Nils J. Nilsson. Teleo-reactive programs for agent control. Journal of Artificial Intelligence Research, 1:139–158, 1994.Google Scholar
  26. 26.
    Raymond D. Rimey and Christopher M. Brown. Control of selective perception using Bayes nets and dicision theory. International Journal of Computer Vision, 12(2/3):173–207, April 1994.Google Scholar
  27. 27.
    Roger C. Schank and Robert P. Abelson. Scripts, Plans, Goals and Understanding. Lawrence Erlbaum Associates, 1977.Google Scholar
  28. 28.
    Andrew F. Toal and Hilary Buxton. Spatio-temporal reasoning within a traffic surveillance system. In G. Sandini, editor, Computer Vision-ECCV'92, pages 884–892. Springer-Verlag, 1992.Google Scholar
  29. 29.
    John K. Tsotsos. Toward a computational model of attention. In T. Papathomas, C. Chubb, A. Gorea, and E. Kowler, editors, Early Vision and Beyond, pages 207–218. The MIT Press, 1995.Google Scholar
  30. 30.
    Shimon Ullman. Visual routines. In Steven Pinker, editor, Visual Cognition, pages 97–159. The MIT Press, 1985.Google Scholar
  31. 31.
    Anthony D. Worrall, Geoffery D. Sullivan, and Keith B. Baker. Advances in model-based traffic vision. In British Machine Vision Conference 1993, pages 559–568. BMVA Press, 1993.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1996

Authors and Affiliations

  • Richard Howarth
    • 1
  • Hilary Buxton
    • 1
  1. 1.School of Cognitive and Computing SciencesUniversity of SussexFalmerUK

Personalised recommendations