Summarizing Video: Content, Features, and HMM Topologies

Yaşaroğlu, Yağiz; Alatan, A. Aydın

doi:10.1007/978-3-540-39798-4_15

Yağiz Yaşaroğlu^6,7 &
A. Aydın Alatan^6,7

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2849))

Included in the following conference series:

International Workshop on Visual Content Processing and Representation

396 Accesses
2 Citations

Abstract

An algorithm is proposed for automatic summarization of multimedia content by segmenting digital video into semantic scenes using HMMs. Various multi-modal low-level features are extracted to determine state transitions in HMMs for summarization. Advantage of using different model topologies and observation sets in order to segment different content types is emphasized and verified by simulations. Performance of the proposed algorithm is also compared with a deterministic scene segmentation method. A better performance is observed due to the flexibility of HMMs in modeling different content types.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rui, Y., Huang, T.S., Mehrotra, S.: Constructing Table-of-Content for Videos. Multimedia Systems, Special section on Video Libraries 7, 359–368 (1999)
Google Scholar
Chen, L., Özsu, T.: Rule-Based Scene Extraction from Video. In: Proc. of ICIP 2002, vol. 2, pp. 737–740 (2002)
Google Scholar
Xie, L., Chang, S.-F., Divakaran, A., Sun, H.: Structure Analysis of Soccer Video With Hidden Markov Models. In: Proceedings of ICASSP 2002, vol. 4, pp. 1096–1099 (2002)
Google Scholar
Chang, P., Han, M., Gong, Y.: Extract Highlights from Baseball Game Video With Hidden Markov Models. In: Proc. of ICIP 2002, vol. 1, pp. 609–612 (2002)
Google Scholar
Liu, T., Kender, J.R.: A HMM Approach to the Structure of Documentaries. In: CBAIVL 2000, pp. 111–115 (2000)
Google Scholar
Alatan, A.A., Akansu, A.N., Wolf, W.: Multi-Modal Dialogue Scene Detection using Hidden Markov Models for Content-based Multimedia Indexing. Int. Journal on Multimedia Tools and Applications. Kluwer Ac. (2001)
Google Scholar
Jasinschi, R.S., et al.: Video Scouting: An Architecture and System for the Integration of Multimedia Information. In: Proc. of ICASSP 2001, vol. 3, pp. 1405–1408 (2001)
Google Scholar
Rabiner, L.R., Juang, B.-H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood (1993)
Google Scholar
Wolf, W.: Hidden Markov Model Parsing of Video Programs. In: Proc. of ICASSP 1997, pp. 2609–2611 (1997)
Google Scholar
Chu, S.M., Huang, T.S.: Audio-Visual Speech Modeling Using Coupled Hidden Markov Models. In: Proceedings of ICIP 2002, vol. 2, pp. 2009–2012 (2002)
Google Scholar
Yang, M.-H., Kreigman, D.J., Ahuja, N.: Detecting Faces in Images. IEEE Trans. On PAMI 24, 34–58 (2002)
Google Scholar
Saraceno, C., Leonardi, R.: Identification of Story Units in Audio-Visual Sequences by Joint Audio and Video Processing. In: Proc. of ICIP 1998, pp. 363–367 (1998)
Google Scholar
Peker, K.A., Divakaran, A., Papathomas, T.V.: Automatic Measurement of Intensity of Motion Activity of Video Segments. In: SPIE Conference on Storage and Retrieval for Media Databases, vol. 4315, pp. 341–351 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical and Electronics Engineering, M.E.T.U.,
Yağiz Yaşaroğlu & A. Aydın Alatan
Balgat, TÜBITAK BILTEN, 06531, Ankara, Turkey
Yağiz Yaşaroğlu & A. Aydın Alatan

Authors

Yağiz Yaşaroğlu
View author publications
You can also search for this author in PubMed Google Scholar
A. Aydın Alatan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Grupo de Tratamiento de Imágenes, Universidad Politécnica de Madrid, 28040, Madrid, Spain
Narciso García & Luis Salgado &
Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049, Madrid, Spain
José M. Martínez

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yaşaroğlu, Y., Alatan, A.A. (2003). Summarizing Video: Content, Features, and HMM Topologies. In: García, N., Salgado, L., Martínez, J.M. (eds) Visual Content Processing and Representation. VLBV 2003. Lecture Notes in Computer Science, vol 2849. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39798-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-39798-4_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20081-9
Online ISBN: 978-3-540-39798-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics