Abstract
Automatic annotation of semantic events allows effective retrieval of video content. In this work, we present solutions for highlights detection in sports videos. The proposed approach exploits the typical structure of a wide class of sports videos, namely those related to sports which are played in delimited venues with playfields of well known geometry, like soccer, basketball, swimming, track and field disciplines, and so on. For these sports, a modeling scheme based on a limited set of visual cues and on finite state machines that encode the temporal evolution of highlights is presented, that is of general applicability to this class of sports. Visual cues encode position and speed information coming from the camera and from the object/athletes that are present in the scene, and are estimated automatically from the video stream. Algorithms for model checking and for visual cues estimation are discussed, as well as applications of the representation to different sport domains.
Similar content being viewed by others
References
E. Andre, G. Herzog, and T. Rist, “On the simultaneous interpretation of real-world image sequences and their natural language description: The system SOCCER,” in Proc. 8th European Conference on Artificial Intelligence (ECAI’88), 1988, pp. 449–454.
J. Assfalg, M. Bertini, A. Del Bimbo, W. Nunziati, and P. Pala, “Soccer highlights detection and recognition using HMMs,” in Proc. of Int’l Conf. on Multimedia and Expo (ICME2002), Switzerland, 2002.
G. Baldi, C. Colombo, and A. Del Bimbo, “A compact and retrieval-oriented video representation using mosaics,” in Proc. 3rd International Conference on Visual Information Systems VISual99, LNCS 1999, Springer: Amsterdam, The Netherlands, June 1999, pp. 171–178.
Y. Bengio, “Markovian models for sequential data,” Neural Comp. Surveys, Vol. 2, pp. 129–162, 1998.
M. Brand, N. Oliver, and A. Pentland, “Coupled hidden Markov models for complex action recognition,” in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, 1997.
A. Ekin, A. Murat Tekalp, and R. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transactions on Image Processing, Vol. 12, No. 7, pp. 796–807, 2003.
S.S. Intille and A.F. Bobick, “Recognizing planned, multi-person action,” Computer Vision and Image Understanding (1077–3142), Vol. 81, No. 3, pp. 414–445, 2001.
M.I. Jordan, Learning in Graphical Models, MIT Press, Cambrigde, 1999.
R. Leonardi and P. Migliorati, “Semantic indexing of multimedia documents,” IEEE MultiMedia, Vol. 9, No. 2, 2002, pp. 44–51.
M. Mottaleb and G. Ravitz, “Detection of plays and breaks in football games using audiovisual features and HMM,” in Proc. of Ninth Int’l Conf. on Distributed Multimedia Systems, Sept. 2003, pp. 154–160.
V. Pavlovic, R. Sharma, and T. Huang, “Visual interpretation of hand gestures for human-computer interaction: A review,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, 1997.
L.R. Rabiner, “A tutorial on HMM and selected applications in speech recognition,” in Proc. IEEE, Vol. 77, No. 2, pp. 257–286, 1989.
S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, Prentice Hall, Englewood Cliffs, NJ, 1995.
G. Sudhir, J.C.M. Lee, and A.K. Jain, “Automatic classification of tennis video for high-level content-based retrieval,” in Proc. Int’l Workshop on Content-Based Access of Image and Video Databases CAIVD’98, 1998, pp. 81–90.
V. Tovinkere and R.J. Qian, “Detecting semantic events in soccer games: Towards a complete solution,” in Proc. Int’l Conf. on Multimedia and Expo ICME 2001, 2001, pp. 1040–1043.
L. Xie, P. Xu, S.-F. Chang, A. Divakaran, and H. Sun, “Structure analysis of soccer video with domain knowledge and hidden Markov models,” in Proc. IEEE Int’l Conference on Acoustics, Speech, and Signal Processing (ICASSP’02), May 2002, pp. 4096–4099.
W. Zhou, A. Vellaikal, and C.C.J. Kuo, “Rule-based video classification system for basketball video indexing,” in Proc. ACM Multimedia 2000 Workshop, 2000, pp. 213–216.
Rights and permissions
About this article
Cite this article
Bertini, M., Bimbo, A.D. & Nunziati, W. Common Visual Cues for Sports Highlights Modeling. Multimed Tools Appl 27, 215–218 (2005). https://doi.org/10.1007/s11042-005-2575-1
Issue Date:
DOI: https://doi.org/10.1007/s11042-005-2575-1