Abstract
This paper presents a novel scheme for indexing and segmentation of video by analyzing the audio track using Hidden Markov Model. This analysis is then applied to structuring the soccer video. Based on the attributes of soccer video, we define three audio classes in soccer video, namely Game-audio, Advertisement-audio and Studio-audio. For each audio class, a HMM is built using the clip-based 26-coefficients feature stream as observation symbol. The Maximum Likelihood method is then applied for classifying test data using the trained models. Meanwhile, considering that it is highly impossible to change the audio types too suddenly, we apply smoothing rules in final segmentation of an audio sequence. Experimental results indicate that our framework can produce satisfactory results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhong, D., Chang, S.F.: Structure Analysis of Sports video Domain Models. In: IEEE Conference on Multimedia and Expo, pp. 920–923 (2001)
Xie, L., Chang, S.F., Divakaran, A., et al.: Structure analysis of soccer video with Hidden Markov models. In: Proc. ICASSP, Orlando, FL (2002)
Gong, Y., Sin, L.T., Chuan, C.H., et al.: Automatic Parsing of TV occer Programs. In: IEEE International Conference on Multimedia Computing and Systems, Washington D.C (1995)
James, A., Chang, S.F.: Automatic Selection of Visual Features and Classifiers. In: SPIE Conference on Storage and Retrieval for Media Database, San Jose, CA, vol. 3972, pp. 346–358 (2000)
Zhu, L., Wang, Y., et al.: Audio Feature Extraction and Analysis for Scene egmentation and Classification. Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology 1/2, 61–79 (1998)
Lu, L., Zhang, H.-J., Jiang, H.: Content Analysis for Audio Classification and Segmentation. IEEE Transactions on Speech and Audio Processing 10(7) (2002)
Rabiner, L., Juang, B.-H.: Theory and implementation of Hidden Markov Models. In: Fundamentals of speech recognition, Prentice Hall, Englewood Cliffs (1993)
Jun, Y.X.: Speech Signal Processing. Publishing House of Electronics Industry, Beijing (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, J., Li, Y., Lao, S., Wu, L., Bai, L. (2004). Structuring Soccer Video Based on Audio Classification and Segmentation Using Hidden Markov Model. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-27814-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive