Abstract
In this paper, we present a generic and robust representation of video shots content expressed in terms of salient regions of activity. The proposed approach is based on salient points of the image space, thus minimizing the computational effort. Salient points are extracted from each frame. Their trajectories are computed between successive frames and the global motion model is estimated. Moving salient points are selected from which salient regions are estimated using an adaptive Mean-Shift process, based on the statistical properties of the point neighborhoods. The salient regions are then matched along the stream, using the salient points trajectories. The information carried by the proposed salient regions of activity is evaluated and we show that such a representation of the content forms suitable input for video content interpretation algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Baumberg, A.: Reliable feature matching across widely separated views. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 774–781 (2000)
Comaniciu, D., Ramesh, V., Meer, P.: Real-time tracking of non-rigid objects using mean shift. pp. 142–151 (2000)
Fisher, M.A., Bolles, R.C.: Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography. Communications of the ACM, 381–395 (1981)
Gouet, V., Boujemaa, N.: On the robustness of color points of interest for image retrieval. In: IEEE International Conference on Image Processing ICIP 2002 (2002)
Harris, C., Stephens, M.: A combined corner and edge detector. In: 4th Alvey Vision Conference, pp. 189–192 (1988)
Lee, J.-S., Sun, Y.-N., Chen, C.-H.: Multiscale corner detection by using wavelet transform. IEEE Transactions on Image Processing (1995)
Lindeberg, T.: Feature detection with automatic scale selection. International Journal of Computer Vision 30(2), 77–116 (1998)
Lowe, D.G.: Object recognition from local scale-invariant features. In: Proc. of the International Conference on Computer Vision ICCV, Corfu, pp. 1150–1157 (1999)
Mikolajczyk, K., Schmid, C.: Indexing based on scale invariant interest points. In: 8th Internationnal Conference on Computer Vision, pp. 525–531 (2001)
Mokhtarian, F., Suomela, R.: Robust image corner detection through curvature scale space. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 20(12), 1376–1381 (1998)
Schmid, C., Mohr, R.: Local grayvalue invariants for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI 19(5) (1997)
Smith, S.M., Brady, J.M.: SUSAN – A new approach to low level image processing. Technical Report TR95SMS1c, Chertsey, Surrey, UK (1995)
Stiller, C., Konrad, J.: Estimating motion in image sequences: A tutorial on modeling and computation of 2D motion. IEEE Signal Process 16, 70–91 (1999)
Tian, Q., Sebe, N., Lew, M.S., Loupias, E., Huang, T.S.: Image retrieval using wavelet-based salient points. Journal of Electronic Imaging, Special Issue on Storage and Retrieval of Digital Media, 835–849 (2001)
Torr, P.H.S., Zisserman, A.: Feature based methods for structure and motion estimation. In: Workshop on Vision Algorithms, pp. 278–294 (1999)
Zelnik-Manor, L., Irani, M.: Event-based analysis of video. In: IEEE Conference on Computer Vision and Pattern Recognition (2001)
Zhang, Z., Deriche, R., Faugeras, O.D., Luong, Q.-T.: A robust technique for matching two uncalibrated images through the recovery of the unknown epipolar geometry. Artificial Intelligence 78(1-2), 87–119 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moënne-Loccoz, N., Bruno, E., Marchand-Maillet, S. (2004). Video Content Representation as Salient Regions of Activity. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_46
Download citation
DOI: https://doi.org/10.1007/978-3-540-27814-6_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive