Structuring Soccer Video Based on Audio Classification and Segmentation Using Hidden Markov Model

Chen, Jianyun; Li, Yunhao; Lao, Songyang; Wu, Lingda; Bai, Liang

doi:10.1007/978-3-540-27814-6_15

Jianyun Chen²⁰,
Yunhao Li²⁰,
Songyang Lao²⁰,
Lingda Wu²⁰ &
…
Liang Bai²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

International Conference on Image and Video Retrieval

987 Accesses
1 Citations

Abstract

This paper presents a novel scheme for indexing and segmentation of video by analyzing the audio track using Hidden Markov Model. This analysis is then applied to structuring the soccer video. Based on the attributes of soccer video, we define three audio classes in soccer video, namely Game-audio, Advertisement-audio and Studio-audio. For each audio class, a HMM is built using the clip-based 26-coefficients feature stream as observation symbol. The Maximum Likelihood method is then applied for classifying test data using the trained models. Meanwhile, considering that it is highly impossible to change the audio types too suddenly, we apply smoothing rules in final segmentation of an audio sequence. Experimental results indicate that our framework can produce satisfactory results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Zhong, D., Chang, S.F.: Structure Analysis of Sports video Domain Models. In: IEEE Conference on Multimedia and Expo, pp. 920–923 (2001)
Google Scholar
Xie, L., Chang, S.F., Divakaran, A., et al.: Structure analysis of soccer video with Hidden Markov models. In: Proc. ICASSP, Orlando, FL (2002)
Google Scholar
Gong, Y., Sin, L.T., Chuan, C.H., et al.: Automatic Parsing of TV occer Programs. In: IEEE International Conference on Multimedia Computing and Systems, Washington D.C (1995)
Google Scholar
James, A., Chang, S.F.: Automatic Selection of Visual Features and Classifiers. In: SPIE Conference on Storage and Retrieval for Media Database, San Jose, CA, vol. 3972, pp. 346–358 (2000)
Google Scholar
Zhu, L., Wang, Y., et al.: Audio Feature Extraction and Analysis for Scene egmentation and Classification. Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology 1/2, 61–79 (1998)
Google Scholar
Lu, L., Zhang, H.-J., Jiang, H.: Content Analysis for Audio Classification and Segmentation. IEEE Transactions on Speech and Audio Processing 10(7) (2002)
Google Scholar
Rabiner, L., Juang, B.-H.: Theory and implementation of Hidden Markov Models. In: Fundamentals of speech recognition, Prentice Hall, Englewood Cliffs (1993)
Google Scholar
Jun, Y.X.: Speech Signal Processing. Publishing House of Electronics Industry, Beijing (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Multimedia Research & Development Center, National University of Defense and Technology, ChangSha, 410073, P.R.China
Jianyun Chen, Yunhao Li, Songyang Lao, Lingda Wu & Liang Bai

Authors

Jianyun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yunhao Li
View author publications
You can also search for this author in PubMed Google Scholar
Songyang Lao
View author publications
You can also search for this author in PubMed Google Scholar
Lingda Wu
View author publications
You can also search for this author in PubMed Google Scholar
Liang Bai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, Mathematical and Information Sciences, University of Brighton, UK
Peter Enser
Informatics and Telematics Institute, Centre for Research and Technology-Hellas, 57001, Thessaloniki, Greece
Yiannis Kompatsiaris
Centre for Digital Video Processing, Adaptive Information Cluster, Dublin City University, Ireland
Noel E. O’Connor
Dublin City University, Dublin, Ireland
Alan F. Smeaton
ISLA lab, Informatics Institute, University of Amsterdam, The Netherlands
Arnold W. M. Smeulders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, J., Li, Y., Lao, S., Wu, L., Bai, L. (2004). Structuring Soccer Video Based on Audio Classification and Segmentation Using Hidden Markov Model. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-27814-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics