Abstract
The MPEG-7 standard is a step towards standardizing the description of multimedia content so that quick and efficient identification of relevant content can be facilitated, together with efficient management of information. The description definition language (DDL) is a schema language to represent valid MPEG-7 descriptors and description schemes. MPEG-7 instances are XML documents that conform to a particular MPEG-7 schema, as expressed in the DDL and that describe audiovisual content. In this paper, we pick one of the visual descriptors related to motion in a video sequence, viz., motion trajectory. It describes the displacements of objects in time, where an object is defined as a spatiotemporal region or set of spatiotemporal regions. We present a method of automatically extracting trajectories from video sequences and generating an XML document that conforms to the MPEG-7 schema. We use sports videos in particular, because the trajectories are very random and the robustness of our algorithm can be demonstrated.
Similar content being viewed by others
References
B. Bascle and R. Deriche, “Region tracking through image sequences,” in Proceedings of 5th International Conference on Computer Vision, Cambridge, MA, 1995.
A. Cavallaro, O. Steiger, and T. Ebrahimi, “Multiple video object tracking in complex scenes,” in Proceedings of ACM Multimedia 2002, Dec. 2002, pp. 523–532.
Y. Chen, Y. Rui, and T.S. Huang, “JPDAF based HMM for real-time contour tracking,” in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Kauai, Hawaii, 2001, Vol. 1, pp. 543–550.
D. Comaniciu, V. Ramesh, and P. Meer, “Real-time tracking of nonrigid objects using mean shift,” in Proc. IEEE Conf. on Computer Vision and Pattern Recognition, Hilton Head, SC, 2000, Vol. 2, pp. 142–149.
D. Comaniciu, V. Ramesh, and P. Meer, “Kernel-based object tracking,” IEEE Transaction on Pattern Analysis Machine Intelligence, Vol. 25, No. 5, pp. 564–575, 2003.
A. Ekin and A. Tekalp, “A framework for analysis and tracking of soccer video,” in Visual Communication and Image Processing (VCIP), San Jose, CA, Jan 2002.
A. Gueziec, “Tracking pitches for broadcast television,” IEEE Computer, Vol. 35, No. 3, pp. 38–43, 2002.
S. Intille and A. Bobick, “Recognizing planned, multi-person action,” Computer Vision and Image Understanding, Vol. 81, No. 3, pp. 414–445, 2001.
ISO/IEC 15938-8:2002. Multimedia content description interface-part 8: Extraction and use of MPEG-7 descriptors, 2002.
B. Manjunath, P. Salembier, and T. Sikora, Introduction to MPEG-7: Multimedia Content Descriptioninterface, Chichester: New York, New York: Wiley, NY, USA, 2002.
R. Milanese, F. Deguillaume, and A. Jacot-Descombes, “Video segmentation and camera motion characterization using compressed data,” Multimedia Storage and Archiving Systems II(SPIE Proceedings), 1997, p. 3229.
N. Patel and I. Sethi, “Video shot detection and characterization for video databases,” Pattern Recognition, Vol. 30, pp. 607–625, 1997.
G. Pingali, Y. Jean, and I. Carlbom, “Real time tracking for enhanced tennis broadcasts,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1998, pp. 260–265.
C. Rafael and E. Richard, Digital Image Processing 2nd Edition, Addison-Wesley, 1992.
G. Robertson, J. Machinlay, and S. Card, “Cone trees: Animated 3d visualization of hierachical information,” in Proceedings of ACM SIGCHI conference on Human Factors in Computing System’91, 1991.
V. Tovinkere and R. Qian, “Detection of semantic events in soccer games: Towards a complete solution,” in IEEE International Conference on Multimedia and Expo(ICME), Aug. 2001.
P. Xu, L. Xie, S.-F. Chang, A. Divakaran, A. Vetro, and H. Sun, “Algorithms and system for segmentation and structure analysis in soccer video,” in IEEE Conference on Multimedia and Expo(ICME), Tokyo, Aug. 2001.
Author information
Authors and Affiliations
Corresponding author
Additional information
Haoran Yi received the B.S. degree in Electrical and Information Engineering from Huazhong University of Science and Technology, Wuhan, PR. China in 2002. He is working for his Ph.D. degree in the School of Computer Engineering at Nanyang Technological University, Singapore now. His research interests include content-based video analysis and representation, image understanding and other issues on image and video technology.
Deepu Rajan received the BE degree in Electronics and Communication Engineering from Birla Institute of Technology, Ranchi, the MS degree in Electrical Engineering from Clemson University and the Ph.D. degree from Indian Institute of Technology, Bombay. From April 1992 until May 2002, he was a lecturer in the Department of Electronics at Cochin University of Science and Technology, India. Since June 2002, he has been Assistant Professor in the School of Computer Engineering at Nanyang Technological University, Singapore. His research interests include image and video processing, computer vision and multimedia signal processing.
Liang-Tien Chia received the B.Sc. and Ph.D degrees from Loughborough University, in 1990 and 1994, res-pectively. He is the director for the Centre of Multimedia and Network Technology and also an Associate Professor in the Division of Computer Communications, School of Computer Engineering, Nanyang Technological University, Singapore. His research interests include image/video processing, image/video coding, multimedia adaptation/transmission. He has published over 50 research papers.
Rights and permissions
About this article
Cite this article
Yi, H., Rajan, D. & Chia, LT. Automatic Generation of MPEG-7 Compliant XML Document for Motion Trajectory Descriptor in Sports Video. Multimed Tools Appl 26, 191–206 (2005). https://doi.org/10.1007/s11042-005-0450-8
Issue Date:
DOI: https://doi.org/10.1007/s11042-005-0450-8