Segmentation Techniques for Video Sequences in the Domain of MPEG-Compressed Data

Koprinska, Irena; Carrato, Sergio

doi:10.1007/0-306-48718-7_3

Irena Koprinska⁴ &
Sergio Carrato⁵

117 Accesses

Abstract

Video segmentation into shots is the first step in content-based analysis of digital video. This chapter provides a comprehensive taxonomy and critical survey of the existing techniques for video segmentation operating on MPEG video stream. Their performance, relative merits and limitations are discussed and contrasted. The gradual development of the techniques and their similarities with the video segmentation methods operating on uncompressed video are also considered.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

H. Okamoto, M. Nakamura, Y. Hatanaka, and S. Yamazaki, “A consumer digital VCR for advanced television,” IEEE Transactions on Consumer Electronics, vol. 39, pp. 199–204, 1993.
Article Google Scholar
“SMASH project.” http://www.extra.research.philips.com/euprojects/smash/, Feb. 2003.
Google Scholar
J. Y. Chen, C. Taskiran, E. Delp, and C. A. Bouman, “ViBE: A new paradigm for video database browsing and search,” in IEEE Workshop on Content-Based Access of Image and Video Libraries, (Santa Barbara, USA), pp. 96–100, 1998.
Google Scholar
H. J. Zhang, J. Wu, D. Zhong, and S. Smoliar, “An integrated system for content-based video retrieval and browsing,” Pattern Recognition, vol. 30, no. 4, pp. 643–658, 1997.
Article Google Scholar
S. F. Chang, W. Chen, H. J. Meng, H. Sundaram, and D. Zhong, “VideoQ: An automated content based video search system using visual cues,” in ACM Multimedia Conf., (Seattle, USA), pp. 313–324, 1997.
Google Scholar
W. Niblack, X. Zhu, J. L. Hafner, T. Breuer, D. B. Ponceleon, D. Petkovic, M. D. Flickner, E. Upfal, S. I. Nin, S. Sull, B. E. Dom, B. L. Yeo, S. Srinivasan, D. Zivkovic, and M. Penner, “Updates to the QBIC system,” in IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp. 150–161, 1997.
Google Scholar
M. Smith and T. Kanade, “Video skimming and characterization through the combination of image and language understanding,” in Proc. of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Databases (ICCV’98), (Bombay, India), pp. 61–70, 1998.
Google Scholar
D. Zhong, H. Zhang, and S. F. Chang, “Clustering methods for video browsing and annotation,” in IS&T/SPIE Storage and Retrieval for Still Image and Video databases IV, vol. 2670, pp. 239–246, 1996.
Google Scholar
M. Yeung and B. L. Yeo, “Time-constrained clustering for segmentation of video into story units,” in Proceedings of the 13th International Conference on Pattern Recognition, vol. 3, (Los Alamitos, USA), pp. 375–380, IEEE Comput. Soc. Press, 1996.
Google Scholar
M. Yeung and B.-L. Yeo, “Video visualization for compact presentation and fast browsing of pictorial content,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 5, pp. 771–785, 1997.
Article Google Scholar
A. Hanjalic and R. L. Lagendijk, “Automated high-level movie segmentation for advanced video-retrieval systems,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 580–588, 1999.
Article Google Scholar
Q. Huang, Z. Liu, and A. Rosenberg, “Automated semantic structure reconstruction and representation generation for broadcast news,” in IS&T/SPIE Conference on Storage and Retrieval for Image and Video databases VII, vol. 3656, pp. 50–62, 1999.
Google Scholar
G. Davenport, T. A. Smith, and N. Pincever, “Cinematic primitives for multimedia,” IEEE Transactions on Computer Graphics Applications, vol. 11, no. 4, pp. 67–74, 1991.
Google Scholar
A. Hampapur, R. Jain, and T. E. Weymouth, “Production model based digital video segmentation,” Multimedia Tools and Applications, vol. 1, no. 1, pp. 9–46, 1995.
Article Google Scholar
R. Zabih, J. Miler, and K. Mai, “A feature-based algorithm for detecting and classifying production effects,” Multimedia Systems, vol. 7, no. 2, pp. 119–128, 1999.
Article Google Scholar
H. J. Zhang, C. Y. Low, and S. W. Smoliar, “Video parsing and browsing using compressed data,” Multimedia Tools and Applications, vol. 1, pp. 89–111, 1995.
Article Google Scholar
G. Ahanger and T. D. C. Little, “A survey of technologies for parsing and indexing digital video,” Journal of Visual Communication and Image Representation, vol. 7, no. 1, pp. 28–43, 1996.
Article Google Scholar
F. Idris and S. Panchanathan, “Review of image and video indexing techniques,” Journal of Visual Communication and Image Representation, vol. 8, no. 2, pp. 146–166, 1997.
Article Google Scholar
R. M. Ford, C. Robson, D. Temple, and M. Gerlach, “Metrics for shot boundary detection in digital video sequences,” Multimedia Systems, vol. 8, pp. 37–46, 2000.
Article Google Scholar
A. Dailianas, R. B. Allen, and P. England, “Comparisons of automatic video segmentation algorithms,” in Integration Issues in Large Commercial Media Delivery Systems, no. 2615, pp. 2–16, 1995.
Google Scholar
T. Kikukawa and S. Kawafuchi, “Development of an automatic summary editing system for the audio-visual resources,” Transactions on Electronics and Information, pp. 204–212, 1992.
Google Scholar
A. Nagasaka and Y. Tanaka, Visual Database Systems II, ch. Automatic video indexing and full-video search for object appearances, pp. 113–127. Elsevier, 1995.
Google Scholar
H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic partitioning of full-motion video,” Multimedia Systems, vol. 1, no. 1, pp. 10–28, 1993.
Article Google Scholar
R. Kasturi and R. Jain, Computer Vision: Principles, ch. Dynamic vision, pp. 469–480. Washington DC, USA: IEEE Computer Society Press, 1991.
Google Scholar
B. Shahraray, “Scene change detection and content-based sampling of video sequences,” in Digital Video Compression: Algorithms and Technologies (R. J. Safranek and A. A. Rodriquez, eds.), vol. 2419, pp. 2–13, Feb. 1995.
Google Scholar
W. Xiong, J. C. M. Lee, and M. C. Ip, “Net comparison: a fast and effective method for classifying image sequences,” in Storage and Retrieval for Image and Video Databases III, vol. 2420, (San Jose, USA), pp. 318–328, 1995.
Google Scholar
W. Xiong and J. C. M. Lee, “Efficient scene change detection and camera motion annotation for video classification,” Computer Vision and Image Understanding, vol. 71, no. 2, pp. 166–181, 1998.
Article Google Scholar
M. J. Swain, “Interactive indexing into image databases,” in SPIE Conf. Storage and Retrieval in Image and Video Databases, pp. 173–187, 1993.
Google Scholar
Y. Tonomura, “Video handling based on structured information for hypermedia systems,” in ACM Int. Conf. on Multimedia Information Systems, (Singapore), pp. 333–344, 1991.
Google Scholar
U. Gargi, S. Oswald, D. Kosiba, S. Devadiga, and R. Kasturi, “Evaluation of video sequence indexing and hierarchical video indexing,” in SPIE Conf. Storage and Retrieval in Image and Video Databases, pp. 1522–1530, 1995.
Google Scholar
J. S. Boreczky and L. A. Rowe, “Comparison of video shot boundary detection techniques,” in IS&T/SPIE Intern. Symposium Electronic Imaging: Storage and Retrieval for Image and Video Databases, (San Jose, USA), pp. 170–179, 1996.
Google Scholar
D. Swanberg, C. F. Shu, and R. Jain, “Knowledge guided parsing in video databases,” in Proc. of the Int. Conf. on Storage and Retrieval for Image and Video Databases, (San Jose, USA), pp. 13–24, 1993.
Google Scholar
C. M. Lee and D. M. C. Ip, “A robust approach for camera break detection in color video sequences,” in IAPR Workshop Machine Vision Appl., (Kawasaki, Japan), pp. 502–505, 1994.
Google Scholar
B. Gunsel, A. M. Ferman, and A. M. Tekalp, “Temporal video segmentation using unsupervised clustering and semantic object tracking,” Journal of Electronic Imaging, vol. 7, no. 3, pp. 592–604, 1998.
Google Scholar
A. M. Ferman and A. M. Tekalp, “Efficient filtering and clustering for temporal video segmentation and visual summarization,” Journal of Visual Communication and Image Representation, vol. 9, no. 4, pp. 336–351, 1998.
Article Google Scholar
P. Aigrain and P. Joly, “The automatic real-time analysis of film editing and transition effects and its applications,” Computers and Graphics, vol. 18, no. 1, pp. 93–103, 1994.
Google Scholar
H. Yu, G. Bozdagi, and S. Harrington, “Feature-based hierarchical video segmentation,” in Int. Conf. on Image Processing (ICIP97), (Santa Barbara, USA), pp. 498–501, 1997.
Google Scholar
J. S. Boreczky and L. D. Wilcox, “A hidden Markov model framework for video segmentation using audio and image features,” in Int. Conf. Acoustics, Speech, and Signal Proc. 6, (Seattle, USA), pp. 3741–3744, 1998.
Google Scholar
ISO/EEC, “13818 Draft Int. Standard: Generic Coding of Moving Pictures and Associated Audio, Part 2: video.”
Google Scholar
F. Arman, A. Hsu, and M.-Y. Chiu, “Image processing on compressed data for large video databases,” in First ACM Intern. Conference on Multimedia, pp. 267–272, 1993.
Google Scholar
B. Yeo and B. Liu, “Rapid scene analysis on compressed video,” IEEE Transactions on Circuits & Systems for Video Technology, vol. 5, no. 6, pp. 533–544, 1995.
Google Scholar
K. Shen and E. Delp, “A fast algorithm for video parsing using MPEG compressed sequences,“ in Intern. Conf. Image Processing (ICIP’96), (Lausanne, Switzerland), pp. 69–72, 1996.
Google Scholar
C. Taskiran and E. Delp, “Video scene change detection using the generalized sequence trace,” in IEEE Int. Conf. Acoustics, Speech & Signal Processing, (Seattle, USA), pp. 2961–2964, 1998.
Google Scholar
N. V. Patel and I. K. Sethi, “Video shot detection and characterization for video databases,” Pattern Recognition, vol. 30, pp. 583–592, 1997.
Article Google Scholar
I. K. Sethi and N. V. Patel, “A statistical approach to scene change detection,” in IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases III, vol. 2420, (San Jose, CA, USA), pp. 2–11, 1995.
Google Scholar
J. Meng, Y. Juan, and S. F. Chang, “Scene change detection in a MPEG compressed video sequence,” in IS&T/SPIE Int. Symp. Electronic Imaging, vol. 2417, (San Jose, CA, USA), pp. 14–25, 1995.
Google Scholar
I. Koprinska and S. Carrato, “Detecting and classifying video shot boundaries in MPEG compressed sequences,” in IX Eur. Sig. Proc. Conf. (EUSIPCO), (Rhodes, Greece), pp. 1729–1732, 1998.
Google Scholar
T. Kohonen, “The self-organizing map,” Proc. of the IEEE, vol. 78, no. 9, pp. 1464–1480, 1990.
Article Google Scholar
J. Feng, K. T. Lo, and H. Mehrpour, “Scene change detection algorithm for MPEG video sequence,” in Int. Conf. Image Processing (ICIP96), (Lausanne, Switzerland), pp. 821–824, 1996.
Google Scholar
U. Gargi, R. Kasturi, and S. Antani, “Performance characterization and comparison of video indexing algorithms,” in Conf. Computer Vision and Pattern Recognition (CVPR), (Santa Barbara CA, USA), pp. 559–565, 1998.
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technologies, University of Sidney Sydney, 2006, NSW, Australia
Irena Koprinska
Dept. of Electrical Engineering and Computer Science, University of Trieste, v. Valerio, 10, 34100, Trieste, Italy
Sergio Carrato

Authors

Irena Koprinska
View author publications
You can also search for this author in PubMed Google Scholar
Sergio Carrato
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Slovenia
Jurij F. Tasič
University of Bordeaux I, France
Mohamed Najim
University of Neuchâtel, Switzerland
Michael Ansorge

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Koprinska, I., Carrato, S. (2003). Segmentation Techniques for Video Sequences in the Domain of MPEG-Compressed Data. In: Tasič, J.F., Najim, M., Ansorge, M. (eds) Intelligent Integrated Media Communication Techniques. Springer, Boston, MA. https://doi.org/10.1007/0-306-48718-7_3

Download citation

DOI: https://doi.org/10.1007/0-306-48718-7_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7552-0
Online ISBN: 978-0-306-48718-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics