Skip to main content
Log in

Common Visual Cues for Sports Highlights Modeling

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Automatic annotation of semantic events allows effective retrieval of video content. In this work, we present solutions for highlights detection in sports videos. The proposed approach exploits the typical structure of a wide class of sports videos, namely those related to sports which are played in delimited venues with playfields of well known geometry, like soccer, basketball, swimming, track and field disciplines, and so on. For these sports, a modeling scheme based on a limited set of visual cues and on finite state machines that encode the temporal evolution of highlights is presented, that is of general applicability to this class of sports. Visual cues encode position and speed information coming from the camera and from the object/athletes that are present in the scene, and are estimated automatically from the video stream. Algorithms for model checking and for visual cues estimation are discussed, as well as applications of the representation to different sport domains.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. E. Andre, G. Herzog, and T. Rist, “On the simultaneous interpretation of real-world image sequences and their natural language description: The system SOCCER,” in Proc. 8th European Conference on Artificial Intelligence (ECAI’88), 1988, pp. 449–454.

  2. J. Assfalg, M. Bertini, A. Del Bimbo, W. Nunziati, and P. Pala, “Soccer highlights detection and recognition using HMMs,” in Proc. of Int’l Conf. on Multimedia and Expo (ICME2002), Switzerland, 2002.

  3. G. Baldi, C. Colombo, and A. Del Bimbo, “A compact and retrieval-oriented video representation using mosaics,” in Proc. 3rd International Conference on Visual Information Systems VISual99, LNCS 1999, Springer: Amsterdam, The Netherlands, June 1999, pp. 171–178.

  4. Y. Bengio, “Markovian models for sequential data,” Neural Comp. Surveys, Vol. 2, pp. 129–162, 1998.

    Google Scholar 

  5. M. Brand, N. Oliver, and A. Pentland, “Coupled hidden Markov models for complex action recognition,” in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, 1997.

  6. A. Ekin, A. Murat Tekalp, and R. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transactions on Image Processing, Vol. 12, No. 7, pp. 796–807, 2003.

    Article  Google Scholar 

  7. S.S. Intille and A.F. Bobick, “Recognizing planned, multi-person action,” Computer Vision and Image Understanding (1077–3142), Vol. 81, No. 3, pp. 414–445, 2001.

    Article  Google Scholar 

  8. M.I. Jordan, Learning in Graphical Models, MIT Press, Cambrigde, 1999.

  9. R. Leonardi and P. Migliorati, “Semantic indexing of multimedia documents,” IEEE MultiMedia, Vol. 9, No. 2, 2002, pp. 44–51.

    Article  Google Scholar 

  10. M. Mottaleb and G. Ravitz, “Detection of plays and breaks in football games using audiovisual features and HMM,” in Proc. of Ninth Int’l Conf. on Distributed Multimedia Systems, Sept. 2003, pp. 154–160.

  11. V. Pavlovic, R. Sharma, and T. Huang, “Visual interpretation of hand gestures for human-computer interaction: A review,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, 1997.

  12. L.R. Rabiner, “A tutorial on HMM and selected applications in speech recognition,” in Proc. IEEE, Vol. 77, No. 2, pp. 257–286, 1989.

    Article  Google Scholar 

  13. S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, Prentice Hall, Englewood Cliffs, NJ, 1995.

    Google Scholar 

  14. G. Sudhir, J.C.M. Lee, and A.K. Jain, “Automatic classification of tennis video for high-level content-based retrieval,” in Proc. Int’l Workshop on Content-Based Access of Image and Video Databases CAIVD’98, 1998, pp. 81–90.

  15. V. Tovinkere and R.J. Qian, “Detecting semantic events in soccer games: Towards a complete solution,” in Proc. Int’l Conf. on Multimedia and Expo ICME 2001, 2001, pp. 1040–1043.

  16. L. Xie, P. Xu, S.-F. Chang, A. Divakaran, and H. Sun, “Structure analysis of soccer video with domain knowledge and hidden Markov models,” in Proc. IEEE Int’l Conference on Acoustics, Speech, and Signal Processing (ICASSP’02), May 2002, pp. 4096–4099.

  17. W. Zhou, A. Vellaikal, and C.C.J. Kuo, “Rule-based video classification system for basketball video indexing,” in Proc. ACM Multimedia 2000 Workshop, 2000, pp. 213–216.

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bertini, M., Bimbo, A.D. & Nunziati, W. Common Visual Cues for Sports Highlights Modeling. Multimed Tools Appl 27, 215–218 (2005). https://doi.org/10.1007/s11042-005-2575-1

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-005-2575-1

Keywords

Navigation