Skip to main content

Video Content Representation as Salient Regions of Activity

  • Conference paper
Image and Video Retrieval (CIVR 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

Abstract

In this paper, we present a generic and robust representation of video shots content expressed in terms of salient regions of activity. The proposed approach is based on salient points of the image space, thus minimizing the computational effort. Salient points are extracted from each frame. Their trajectories are computed between successive frames and the global motion model is estimated. Moving salient points are selected from which salient regions are estimated using an adaptive Mean-Shift process, based on the statistical properties of the point neighborhoods. The salient regions are then matched along the stream, using the salient points trajectories. The information carried by the proposed salient regions of activity is evaluated and we show that such a representation of the content forms suitable input for video content interpretation algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Baumberg, A.: Reliable feature matching across widely separated views. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 774–781 (2000)

    Google Scholar 

  2. Comaniciu, D., Ramesh, V., Meer, P.: Real-time tracking of non-rigid objects using mean shift. pp. 142–151 (2000)

    Google Scholar 

  3. Fisher, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 381–395 (1981)

    Google Scholar 

  4. Gouet, V., Boujemaa, N.: On the robustness of color points of interest for image retrieval. In: IEEE International Conference on Image Processing ICIP 2002 (2002)

    Google Scholar 

  5. Harris, C., Stephens, M.: A combined corner and edge detector. In: 4th Alvey Vision Conference, pp. 189–192 (1988)

    Google Scholar 

  6. Lee, J.-S., Sun, Y.-N., Chen, C.-H.: Multiscale corner detection by using wavelet transform. IEEE Transactions on Image Processing (1995)

    Google Scholar 

  7. Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision 30(2), 77–116 (1998)

    Google Scholar 

  8. Lowe, D.G.: Object recognition from local scale-invariant features. In: Proc. of the International Conference on Computer Vision ICCV, Corfu, pp. 1150–1157 (1999)

    Google Scholar 

  9. Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: 8th Internationnal Conference on Computer Vision, pp. 525–531 (2001)

    Google Scholar 

  10. Mokhtarian, F., Suomela, R.: Robust image corner detection through curvature scale space. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 20(12), 1376–1381 (1998)

    Article  Google Scholar 

  11. Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 19(5) (1997)

    Google Scholar 

  12. Smith, S.M., Brady, J.M.: SUSAN – A new approach to low level image processing. Technical Report TR95SMS1c, Chertsey, Surrey, UK (1995)

    Google Scholar 

  13. Stiller, C., Konrad, J.: Estimating motion in image sequences: A tutorial on modeling and computation of 2D motion. IEEE Signal Process 16, 70–91 (1999)

    Article  Google Scholar 

  14. Tian, Q., Sebe, N., Lew, M.S., Loupias, E., Huang, T.S.: Image retrieval using wavelet-based salient points. Journal of Electronic Imaging, Special Issue on Storage and Retrieval of Digital Media, 835–849 (2001)

    Google Scholar 

  15. Torr, P.H.S., Zisserman, A.: Feature based methods for structure and motion estimation. In: Workshop on Vision Algorithms, pp. 278–294 (1999)

    Google Scholar 

  16. Zelnik-Manor, L., Irani, M.: Event-based analysis of video. In: IEEE Conference on Computer Vision and Pattern Recognition (2001)

    Google Scholar 

  17. Zhang, Z., Deriche, R., Faugeras, O.D., Luong, Q.-T.: A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry. Artificial Intelligence 78(1-2), 87–119 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Moënne-Loccoz, N., Bruno, E., Marchand-Maillet, S. (2004). Video Content Representation as Salient Regions of Activity. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27814-6_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22539-3

  • Online ISBN: 978-3-540-27814-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics