Abstract
In order to provide more efficient content-based functionalities for video applications, it is necessary to extract meaningful video regions from scenes as perceptual-oriented representation of video content. We present a generalpurpose content representation framework for video sequences that employs fuzzy information granulation to capture human perception subjectivity. In particular, the main purpose is to extract spatial-temporal salient grain that is fundamental element for content representation of video sequences. Since perceptual saliency for visual information is a subjective concept, a class-related fuzzy information granulation is constructed for each feature of homogenous regions, mapping original feature space to concepts space. To detect spatial salient regions, segmented homogenous regions are classified according to their prominent importance. After salient region detection, a region tracking mechanism is proposed based on region temporal consistency analysis. The tracking results are sequences of coherent salient regions, called spatial-temporal salient grain. Salient grain can be used to obtain meaningful perceptual-oriented unit in a high-level content description scheme. The experimental results using different types of video sequences are presented to demonstrate the efficiency and accuracy of our proposed algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
S. Antani, R. Kasturi and R. Jain, “A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video”, Pattern Recognition Vol.35 pp. 945-965, 2002.
Y. Li, S. Narayanan and C.-C. Jay Kuo, “Content-based Movie Anaysis and Indexing Based on AudioVisual Cues”, IEEE Transactions on Circuits and Systems for Video Technology, Vol.14(8), pp. 1073-1085, 2004.
H.J.Zhang, J.Wu, D.Zhong and S.Smoliar, “An integrated system for content-based video retrieval and browsing”, Pattern Recognition, Vol.30(4) pp.643-658, 1997.
D. Gatica-Perez, C.Gu, and M.-T. Sun, “Semantic video object extraction using four-band watershed and partition lattice operators”, IEEE Transactions on Circuits and Systems for Video Technology, Vol.11, pp. 603-618, 2001.
H. Xu, A. A. Younis and M. R. Kabuka,” Automatic Moving Object Extraction for Content-based Applications”, IEEE Transactions on Circuits and Systems for Video Technology, Vol.14(6), pp. 796-812, 2004.
A.S. Ogale, C. Fermuller and Y.Aloimonosh, “Motion segmentation using occlusions”, IEEE Transactions On Pattern Analysis and Machine Intelligence, Vol.27(6), pp. 988-992, 2005.
J. Senders, “Distribution of Attention in Static and Dynamic Scenes”, Proceedings SPIE 3026, pp. 186-194, 1997.
A. Yarbus, Eye Movements and Vision. Plenum Press, NewYork NY, (1967).
L. Itti, C. Koch and E. Niebur, “A Model of Saliency-Based Visual Attention for Rapid Scene Analysis”, IEEE Transactions On Pattern Analysis and Machine Intelligence, Vol.20(11), pp. 1254-1259, 1998.
T. Lindeberg, “Detecting salient blob-like image structures and their scales with a scale-space primal sketch: a method for focus-of-attention”, International Journal of Computer Vision, Vol.11, pp.283-318, 1993.
Y.F. Ma and H.J. Zhang, “A Model of Motion Attention for Video Skimming”, Proceedings of ICIP (2002) pp. 22-25, 2002.
L. Zadeh, “A Note on Web Intelligence, World Knowledge and Fuzzy Logic”, Data & Knowledge Engineering, Vol.50, pp. 291-304, 2004.
J. G. Shanahan, “Soft Computing for Knowledge Discovery: Introducing Cartesian Granule Features”, Kluwer Academic Publishers. (2000)
D. Comaniciu and P. Meer, “Mean Shift: A Robust Approach toward Feature Space Analysis”, IEEE Trans. Pattern Analysis Machine Intelligence, Vol.24(5), pp. 603-619, 2002.
D. Bordwell and K. Thompson, “Film Art: An Introduction”, McGraw-Hill Higher Education, 2001.
W. Pedrycz and A.V. Vasilakos, “Linguistic Models and Linguistic Modeling”, IEEE Trans. Systems, Man, and Cybernetics, Vol.29(6), pp. 745-757, 1999.
W. Pedrycz and S. Gacek, “Temporal granulation and its application to signal analysis”, Information Sciences, Vol.143, pp. 47-71, 2002.
W. Pedrycz and F. Gomide, “An Introduction to Fuzzy Sets. Analysis and Design”, Cambridge, MA:MIT Press, 1998.
W. Pedrycz and G. Vukovich, “On Elicitation of Membership Functions”, IEEE Transactions On Systems, Man, and Cybernetics - Part A: Systems and Humans, Vol.32(6), pp. 761-767, 2002.
L. Congyan and X. De, “Perception-Oriented Prominent Region Detection in Video Sequences”, Informatica: International Journal of Computing and Informatics (to appear in Vol.29(3) 2005).
J.F. Baldwin, J. Lawry and T.P. Martin, “A Mass Assignment Method for Prototype Induction”, International Journal of Intelligent Systems, Vol. 14(10), pp. 1041-1070, 1999.
S. Hongeng, R. Nevatia and F. Bremond, “Video-based event recognition: activity representation and probabilistic recognition methods”, Computer Vision and Image Understanding Vol.96, pp. 129-162, 2004.
R. von Mises, “Mathematical Theory of Probability and Statistics”, Academic Press, New York, 1964.
B.C. Ko, S.Y. Kwak and H. Byun, “SVM-based Salient Regions Extraction Method for Image Retrieval”, Proceedings of the 17th International Conference on Pattern Recognition (ICPR’04), 2004.
J. Zacks, B. Tversky and G. Iyer, “Perceiving, Remembering, and Communicating Structure in Events”, Journal of Experimental Psychology: General 130(1), pp. 29-58, 2001.
Y.-L. Kang, J.-H. Lim, Q. Tian, M.S. Kankanhalli and C.-S. Xu, “Visual Keywords Labeling in Soccer Video”, Proceedings of IEEE ICPR, pp. 850-853, 2004.
P. Xu, et al, “Algorithms and System for Segmentation and Structure Analysis in Soccer Video”, Proceedings of IEEE ICME, pp. 928-931, 2001.
V. Navalpakkam and L. Itti, “Modeling the influence of task on attention”, Vision Research, Vol.45(2), pp. 205-231, 2005.
L. A. Zadeh, “From Computing with Numbers to Computing with Words-From Manipulation of Measurements to Manipulation of Perceptions”, IEEE Trans. On Circuits and Systems-I: Fundamental Theory and Applications, Vol.45(1), pp. 105-119, 1999.
L. A. Zadeh, “Information granulation lies at the center of human reasoning and concept formation”, Abstract of BISC Seminar, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Lang, C., Xu, D., Yu, J. (2007). The Use of Fuzzy Information Granular for Content Representation of Video Sequences. In: Nachtegael, M., Van der Weken, D., Kerre, E.E., Philips, W. (eds) Soft Computing in Image Processing. Studies in Fuzziness and Soft Computing, vol 210. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-38233-1_10
Download citation
DOI: https://doi.org/10.1007/978-3-540-38233-1_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38232-4
Online ISBN: 978-3-540-38233-1
eBook Packages: EngineeringEngineering (R0)