Multimedia Tools and Applications

, Volume 27, Issue 2, pp 215–218 | Cite as

Common Visual Cues for Sports Highlights Modeling

  • M. Bertini
  • A. Del Bimbo
  • W. Nunziati


Automatic annotation of semantic events allows effective retrieval of video content. In this work, we present solutions for highlights detection in sports videos. The proposed approach exploits the typical structure of a wide class of sports videos, namely those related to sports which are played in delimited venues with playfields of well known geometry, like soccer, basketball, swimming, track and field disciplines, and so on. For these sports, a modeling scheme based on a limited set of visual cues and on finite state machines that encode the temporal evolution of highlights is presented, that is of general applicability to this class of sports. Visual cues encode position and speed information coming from the camera and from the object/athletes that are present in the scene, and are estimated automatically from the video stream. Algorithms for model checking and for visual cues estimation are discussed, as well as applications of the representation to different sport domains.


semantic annotation highlights detection sport videos visual cues 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    E. Andre, G. Herzog, and T. Rist, “On the simultaneous interpretation of real-world image sequences and their natural language description: The system SOCCER,” in Proc. 8th European Conference on Artificial Intelligence (ECAI’88), 1988, pp. 449–454.Google Scholar
  2. 2.
    J. Assfalg, M. Bertini, A. Del Bimbo, W. Nunziati, and P. Pala, “Soccer highlights detection and recognition using HMMs,” in Proc. of Int’l Conf. on Multimedia and Expo (ICME2002), Switzerland, 2002.Google Scholar
  3. 3.
    G. Baldi, C. Colombo, and A. Del Bimbo, “A compact and retrieval-oriented video representation using mosaics,” in Proc. 3rd International Conference on Visual Information Systems VISual99, LNCS 1999, Springer: Amsterdam, The Netherlands, June 1999, pp. 171–178.Google Scholar
  4. 4.
    Y. Bengio, “Markovian models for sequential data,” Neural Comp. Surveys, Vol. 2, pp. 129–162, 1998.Google Scholar
  5. 5.
    M. Brand, N. Oliver, and A. Pentland, “Coupled hidden Markov models for complex action recognition,” in Proc. of IEEE Conference on Computer Vision and Pattern Recognition, San Juan, Puerto Rico, 1997.Google Scholar
  6. 6.
    A. Ekin, A. Murat Tekalp, and R. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transactions on Image Processing, Vol. 12, No. 7, pp. 796–807, 2003.CrossRefGoogle Scholar
  7. 7.
    S.S. Intille and A.F. Bobick, “Recognizing planned, multi-person action,” Computer Vision and Image Understanding (1077–3142), Vol. 81, No. 3, pp. 414–445, 2001.CrossRefGoogle Scholar
  8. 8.
    M.I. Jordan, Learning in Graphical Models, MIT Press, Cambrigde, 1999.Google Scholar
  9. 9.
    R. Leonardi and P. Migliorati, “Semantic indexing of multimedia documents,” IEEE MultiMedia, Vol. 9, No. 2, 2002, pp. 44–51.CrossRefGoogle Scholar
  10. 10.
    M. Mottaleb and G. Ravitz, “Detection of plays and breaks in football games using audiovisual features and HMM,” in Proc. of Ninth Int’l Conf. on Distributed Multimedia Systems, Sept. 2003, pp. 154–160.Google Scholar
  11. 11.
    V. Pavlovic, R. Sharma, and T. Huang, “Visual interpretation of hand gestures for human-computer interaction: A review,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, No. 7, 1997.Google Scholar
  12. 12.
    L.R. Rabiner, “A tutorial on HMM and selected applications in speech recognition,” in Proc. IEEE, Vol. 77, No. 2, pp. 257–286, 1989.CrossRefGoogle Scholar
  13. 13.
    S. Russell and P. Norvig, Artificial Intelligence: A Modern Approach, Prentice Hall, Englewood Cliffs, NJ, 1995.Google Scholar
  14. 14.
    G. Sudhir, J.C.M. Lee, and A.K. Jain, “Automatic classification of tennis video for high-level content-based retrieval,” in Proc. Int’l Workshop on Content-Based Access of Image and Video Databases CAIVD’98, 1998, pp. 81–90.Google Scholar
  15. 15.
    V. Tovinkere and R.J. Qian, “Detecting semantic events in soccer games: Towards a complete solution,” in Proc. Int’l Conf. on Multimedia and Expo ICME 2001, 2001, pp. 1040–1043.Google Scholar
  16. 16.
    L. Xie, P. Xu, S.-F. Chang, A. Divakaran, and H. Sun, “Structure analysis of soccer video with domain knowledge and hidden Markov models,” in Proc. IEEE Int’l Conference on Acoustics, Speech, and Signal Processing (ICASSP’02), May 2002, pp. 4096–4099.Google Scholar
  17. 17.
    W. Zhou, A. Vellaikal, and C.C.J. Kuo, “Rule-based video classification system for basketball video indexing,” in Proc. ACM Multimedia 2000 Workshop, 2000, pp. 213–216.Google Scholar

Copyright information

© Springer Science + Business Media, Inc. 2005

Authors and Affiliations

  • M. Bertini
  • A. Del Bimbo
  • W. Nunziati

There are no affiliations available

Personalised recommendations