A Probabilistic Sensor for the Perception and the Recognition of Activities

  • Olivier Chomat
  • JéerΩe Martin
  • James L. Crowley
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1842)


This paper presents a new technique for the perception and recognition of activities using statistical descriptions of their spatiotemporal properties. A set of motion energy receptive fields is designed in order to sample the power spectrum of a moving texture. Their structure relates to the spatio-temporal energy models of Adelson and Bergen where measures of local visual motion information are extracted by comparing the outputs of a triad of Gabor energy filters. Then the probability density function required for Bayes rule is estimated for each class of activity by computing multi-dimensional histograms from the outputs from the set of receptive fields. The perception of activities is achieved according to Bayes rule. The result at each instant of time is the map of the conditional probabilities that each pixel belongs to each one of the activities of the training set. Since activities are perceived over a short integration time, a temporal analysis of outputs is done using Hidden Markov Models.

The approach is validated with experiments in the perception and recognition of activities of people walking in visual surveillance scenari. The presented work is in progress and preliminary results are encouraging, since recognition is robust to variations in illumination conditions, to partial occlusions and to changes in texture. It is shown that it constitute a powerful early vision tool for human behaviors analysis for smart-environnements.


Hide Markov Model Recognition Rate Activity Recognition Probabilistic Sensor Signal Decomposition 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. [AB85]
    E.H. Adelson and J.R. Bergen. Spatio-temporal energy models for the perception of motion. Optical Society of America, 2(2):284–299, 1985.CrossRefGoogle Scholar
  2. [AB91]
    E.H. Adelson and J.R. Bergen. Computational Models of Visual Processing, chapter The Plenoptic function and the elements of early vision. M. Landy and J.A. Movshons, Cambridge, 1991. MIT Press.Google Scholar
  3. [BCG98]
    C. Biernacki, G. Celeux, and G. Govaert. Assessing a mixture model for clustering eith integrated classification likelihood. Technical report, October 1998.Google Scholar
  4. [BD96]
    A. Bobick and J. Davis. An appearence based representation of action. In Proc. Int. Conference on Pattern Recognition, pages 307–312, 1996.Google Scholar
  5. [BH95]
    A.M. Baumberg and D.C. Hogg. Learning spatiotemporal models from training examples. Technical Report 95.9, School of Computer Studies, Division of Artificial Intelligence, University of Leeds, March 1995.Google Scholar
  6. [BJ96]
    M.J. Black and A.D. Jepson. Eigentracking: Robust matching and tracking of articulated objects using a view-based representation. In European Conference on Computer Vision, pages 329–342, 1996.Google Scholar
  7. [CC98]
    V. Colin de Verdiére and J.L. Crowley. Visual recognition using local appearance. In European Conference on Computer Vision, pages 640–654, 1998.Google Scholar
  8. [CTL93]
    T.F. Cootes, C.J. Taylor, A. Lanitis, D.H. Cooper, and J. Graham. Building and using flexible models incorporating gray-level information. In International Conference on Computer Vision, pages 242–246, May 1993.Google Scholar
  9. [Hee88]
    D.J. Heeger. Optical flow using spatio-temporal filters. International Journal of Computer Vision, pages 279–302, 1988.Google Scholar
  10. [Kv92]
    J.J. Koenderink and A.J. van Doorn. Generic neighborhood operators. Pattern Analysis and Machine Intelligence, 14(6):597–605, june 1992.Google Scholar
  11. [Lin98]
    T. Lindeberg. Feature detection with automatic scale selection. International Journal of Computer Vision, 30(2):79–116, 1998.CrossRefGoogle Scholar
  12. [MN95]
    H. Murase and S.K. Nayar. Visual learning and recognition of 3-d objects from appearance. International Journal of Computer Vision, 14:5–24, 1995.CrossRefGoogle Scholar
  13. [Sch97]
    B. Schiele. Object Recognition Using Multidimentional Receptive Field Histograms. PhD thesis, Institut National Polytechnique de Grenoble, july 1997.Google Scholar
  14. [SPH98]
    A. Spinei, D. Pellerin, and J. Herault. Spatio-temporal energy-based me-thod for velocity estimation. Signal Processing, 65:347–362, 1998.zbMATHCrossRefGoogle Scholar
  15. [WADP96]
    C. Wren, A. Azarbayejani, T. Darrel, and A. Pentland. Pfinder:Real-time tracking of the human body. In International Conference on Automatic face and Gesture Recognition, pages 51–56, 1996.Google Scholar
  16. [YB98]
    Y. Yacoob and M.J. Black. Parameterized modeling and recognition of activities. In International Conference on Computer Vision, 1998.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2000

Authors and Affiliations

  • Olivier Chomat
    • 1
  • JéerΩe Martin
    • 1
  • James L. Crowley
    • 1
  1. 1.Project PRIMA - Lab GRAVIR - IMAGINRIA Rhône-AlpesMontbonnotFrance

Personalised recommendations