Abstract
Video segmentation into shots is the first step in content-based analysis of digital video. This chapter provides a comprehensive taxonomy and critical survey of the existing techniques for video segmentation operating on MPEG video stream. Their performance, relative merits and limitations are discussed and contrasted. The gradual development of the techniques and their similarities with the video segmentation methods operating on uncompressed video are also considered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
H. Okamoto, M. Nakamura, Y. Hatanaka, and S. Yamazaki, “A consumer digital VCR for advanced television,” IEEE Transactions on Consumer Electronics, vol. 39, pp. 199–204, 1993.
“SMASH project.” http://www.extra.research.philips.com/euprojects/smash/, Feb. 2003.
J. Y. Chen, C. Taskiran, E. Delp, and C. A. Bouman, “ViBE: A new paradigm for video database browsing and search,” in IEEE Workshop on Content-Based Access of Image and Video Libraries, (Santa Barbara, USA), pp. 96–100, 1998.
H. J. Zhang, J. Wu, D. Zhong, and S. Smoliar, “An integrated system for content-based video retrieval and browsing,” Pattern Recognition, vol. 30, no. 4, pp. 643–658, 1997.
S. F. Chang, W. Chen, H. J. Meng, H. Sundaram, and D. Zhong, “VideoQ: An automated content based video search system using visual cues,” in ACM Multimedia Conf., (Seattle, USA), pp. 313–324, 1997.
W. Niblack, X. Zhu, J. L. Hafner, T. Breuer, D. B. Ponceleon, D. Petkovic, M. D. Flickner, E. Upfal, S. I. Nin, S. Sull, B. E. Dom, B. L. Yeo, S. Srinivasan, D. Zivkovic, and M. Penner, “Updates to the QBIC system,” in IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases VI, vol. 3312, pp. 150–161, 1997.
M. Smith and T. Kanade, “Video skimming and characterization through the combination of image and language understanding,” in Proc. of the 1998 IEEE International Workshop on Content-Based Access of Image and Video Databases (ICCV’98), (Bombay, India), pp. 61–70, 1998.
D. Zhong, H. Zhang, and S. F. Chang, “Clustering methods for video browsing and annotation,” in IS&T/SPIE Storage and Retrieval for Still Image and Video databases IV, vol. 2670, pp. 239–246, 1996.
M. Yeung and B. L. Yeo, “Time-constrained clustering for segmentation of video into story units,” in Proceedings of the 13th International Conference on Pattern Recognition, vol. 3, (Los Alamitos, USA), pp. 375–380, IEEE Comput. Soc. Press, 1996.
M. Yeung and B.-L. Yeo, “Video visualization for compact presentation and fast browsing of pictorial content,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 7, no. 5, pp. 771–785, 1997.
A. Hanjalic and R. L. Lagendijk, “Automated high-level movie segmentation for advanced video-retrieval systems,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 580–588, 1999.
Q. Huang, Z. Liu, and A. Rosenberg, “Automated semantic structure reconstruction and representation generation for broadcast news,” in IS&T/SPIE Conference on Storage and Retrieval for Image and Video databases VII, vol. 3656, pp. 50–62, 1999.
G. Davenport, T. A. Smith, and N. Pincever, “Cinematic primitives for multimedia,” IEEE Transactions on Computer Graphics Applications, vol. 11, no. 4, pp. 67–74, 1991.
A. Hampapur, R. Jain, and T. E. Weymouth, “Production model based digital video segmentation,” Multimedia Tools and Applications, vol. 1, no. 1, pp. 9–46, 1995.
R. Zabih, J. Miler, and K. Mai, “A feature-based algorithm for detecting and classifying production effects,” Multimedia Systems, vol. 7, no. 2, pp. 119–128, 1999.
H. J. Zhang, C. Y. Low, and S. W. Smoliar, “Video parsing and browsing using compressed data,” Multimedia Tools and Applications, vol. 1, pp. 89–111, 1995.
G. Ahanger and T. D. C. Little, “A survey of technologies for parsing and indexing digital video,” Journal of Visual Communication and Image Representation, vol. 7, no. 1, pp. 28–43, 1996.
F. Idris and S. Panchanathan, “Review of image and video indexing techniques,” Journal of Visual Communication and Image Representation, vol. 8, no. 2, pp. 146–166, 1997.
R. M. Ford, C. Robson, D. Temple, and M. Gerlach, “Metrics for shot boundary detection in digital video sequences,” Multimedia Systems, vol. 8, pp. 37–46, 2000.
A. Dailianas, R. B. Allen, and P. England, “Comparisons of automatic video segmentation algorithms,” in Integration Issues in Large Commercial Media Delivery Systems, no. 2615, pp. 2–16, 1995.
T. Kikukawa and S. Kawafuchi, “Development of an automatic summary editing system for the audio-visual resources,” Transactions on Electronics and Information, pp. 204–212, 1992.
A. Nagasaka and Y. Tanaka, Visual Database Systems II, ch. Automatic video indexing and full-video search for object appearances, pp. 113–127. Elsevier, 1995.
H. J. Zhang, A. Kankanhalli, and S. W. Smoliar, “Automatic partitioning of full-motion video,” Multimedia Systems, vol. 1, no. 1, pp. 10–28, 1993.
R. Kasturi and R. Jain, Computer Vision: Principles, ch. Dynamic vision, pp. 469–480. Washington DC, USA: IEEE Computer Society Press, 1991.
B. Shahraray, “Scene change detection and content-based sampling of video sequences,” in Digital Video Compression: Algorithms and Technologies (R. J. Safranek and A. A. Rodriquez, eds.), vol. 2419, pp. 2–13, Feb. 1995.
W. Xiong, J. C. M. Lee, and M. C. Ip, “Net comparison: a fast and effective method for classifying image sequences,” in Storage and Retrieval for Image and Video Databases III, vol. 2420, (San Jose, USA), pp. 318–328, 1995.
W. Xiong and J. C. M. Lee, “Efficient scene change detection and camera motion annotation for video classification,” Computer Vision and Image Understanding, vol. 71, no. 2, pp. 166–181, 1998.
M. J. Swain, “Interactive indexing into image databases,” in SPIE Conf. Storage and Retrieval in Image and Video Databases, pp. 173–187, 1993.
Y. Tonomura, “Video handling based on structured information for hypermedia systems,” in ACM Int. Conf. on Multimedia Information Systems, (Singapore), pp. 333–344, 1991.
U. Gargi, S. Oswald, D. Kosiba, S. Devadiga, and R. Kasturi, “Evaluation of video sequence indexing and hierarchical video indexing,” in SPIE Conf. Storage and Retrieval in Image and Video Databases, pp. 1522–1530, 1995.
J. S. Boreczky and L. A. Rowe, “Comparison of video shot boundary detection techniques,” in IS&T/SPIE Intern. Symposium Electronic Imaging: Storage and Retrieval for Image and Video Databases, (San Jose, USA), pp. 170–179, 1996.
D. Swanberg, C. F. Shu, and R. Jain, “Knowledge guided parsing in video databases,” in Proc. of the Int. Conf. on Storage and Retrieval for Image and Video Databases, (San Jose, USA), pp. 13–24, 1993.
C. M. Lee and D. M. C. Ip, “A robust approach for camera break detection in color video sequences,” in IAPR Workshop Machine Vision Appl., (Kawasaki, Japan), pp. 502–505, 1994.
B. Gunsel, A. M. Ferman, and A. M. Tekalp, “Temporal video segmentation using unsupervised clustering and semantic object tracking,” Journal of Electronic Imaging, vol. 7, no. 3, pp. 592–604, 1998.
A. M. Ferman and A. M. Tekalp, “Efficient filtering and clustering for temporal video segmentation and visual summarization,” Journal of Visual Communication and Image Representation, vol. 9, no. 4, pp. 336–351, 1998.
P. Aigrain and P. Joly, “The automatic real-time analysis of film editing and transition effects and its applications,” Computers and Graphics, vol. 18, no. 1, pp. 93–103, 1994.
H. Yu, G. Bozdagi, and S. Harrington, “Feature-based hierarchical video segmentation,” in Int. Conf. on Image Processing (ICIP97), (Santa Barbara, USA), pp. 498–501, 1997.
J. S. Boreczky and L. D. Wilcox, “A hidden Markov model framework for video segmentation using audio and image features,” in Int. Conf. Acoustics, Speech, and Signal Proc. 6, (Seattle, USA), pp. 3741–3744, 1998.
ISO/EEC, “13818 Draft Int. Standard: Generic Coding of Moving Pictures and Associated Audio, Part 2: video.”
F. Arman, A. Hsu, and M.-Y. Chiu, “Image processing on compressed data for large video databases,” in First ACM Intern. Conference on Multimedia, pp. 267–272, 1993.
B. Yeo and B. Liu, “Rapid scene analysis on compressed video,” IEEE Transactions on Circuits & Systems for Video Technology, vol. 5, no. 6, pp. 533–544, 1995.
K. Shen and E. Delp, “A fast algorithm for video parsing using MPEG compressed sequences,“ in Intern. Conf. Image Processing (ICIP’96), (Lausanne, Switzerland), pp. 69–72, 1996.
C. Taskiran and E. Delp, “Video scene change detection using the generalized sequence trace,” in IEEE Int. Conf. Acoustics, Speech & Signal Processing, (Seattle, USA), pp. 2961–2964, 1998.
N. V. Patel and I. K. Sethi, “Video shot detection and characterization for video databases,” Pattern Recognition, vol. 30, pp. 583–592, 1997.
I. K. Sethi and N. V. Patel, “A statistical approach to scene change detection,” in IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases III, vol. 2420, (San Jose, CA, USA), pp. 2–11, 1995.
J. Meng, Y. Juan, and S. F. Chang, “Scene change detection in a MPEG compressed video sequence,” in IS&T/SPIE Int. Symp. Electronic Imaging, vol. 2417, (San Jose, CA, USA), pp. 14–25, 1995.
I. Koprinska and S. Carrato, “Detecting and classifying video shot boundaries in MPEG compressed sequences,” in IX Eur. Sig. Proc. Conf. (EUSIPCO), (Rhodes, Greece), pp. 1729–1732, 1998.
T. Kohonen, “The self-organizing map,” Proc. of the IEEE, vol. 78, no. 9, pp. 1464–1480, 1990.
J. Feng, K. T. Lo, and H. Mehrpour, “Scene change detection algorithm for MPEG video sequence,” in Int. Conf. Image Processing (ICIP96), (Lausanne, Switzerland), pp. 821–824, 1996.
U. Gargi, R. Kasturi, and S. Antani, “Performance characterization and comparison of video indexing algorithms,” in Conf. Computer Vision and Pattern Recognition (CVPR), (Santa Barbara CA, USA), pp. 559–565, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Kluwer Academic Publishers
About this chapter
Cite this chapter
Koprinska, I., Carrato, S. (2003). Segmentation Techniques for Video Sequences in the Domain of MPEG-Compressed Data. In: Tasič, J.F., Najim, M., Ansorge, M. (eds) Intelligent Integrated Media Communication Techniques. Springer, Boston, MA. https://doi.org/10.1007/0-306-48718-7_3
Download citation
DOI: https://doi.org/10.1007/0-306-48718-7_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-7552-0
Online ISBN: 978-0-306-48718-7
eBook Packages: Springer Book Archive