A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video

Márquez, Daniel; Bescós, Jesús

doi:10.1007/978-3-540-77051-0_9

Daniel Márquez¹ &
Jesús Bescós¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4816))

Included in the following conference series:

International Conference on Semantic and Digital Media Technologies

541 Accesses
3 Citations

Abstract

We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.

Work partially supported by the European Commission under its 6^th Framework Programme (FP6-027685 - MESH Project) and by Spanish Institutions under projects TIN2004-07860-C02-01 and S-0505-TIC-0223.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lim, Y.K., Choi, S.H., Lee, S.W.: Text Extraction in MPEG Compressed Video for Content-based Indexing. In: Proc. ICPR (2000)
Google Scholar
Crandall, D., Kasturi, R.: Robust Detection of Stylized Text-Events on Digital Video. In: Proc. 6th Int. Conf. on Document Analysis and Recognition (2001)
Google Scholar
Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Localization in Compressed Video. IEEE Transactions on PAMI (2000)
Google Scholar
Chen, D.Y., Hsiao, M.H., Suh-Yin, L.: Automatic Closed Caption Detection and Filtering in MPEG Vídeos for Vídeo Structuring. Journal of Information Science and Engineering 22(5) (2006)
Google Scholar
Zhang, Y., Chua, T.: Detection of Text Caption in Compressed Domain Vídeo. In: Proc. ACM Workshop on Multimedia (2000)
Google Scholar
Chun, S., Kim, H., Kim, J.R., Oh, S., Sull, S.: Fast Text Caption Localization on Vídeo Using Visual Rythm. In: Proc. 5th Intl. Conf. on Recent Advances in Visual Information Systems (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Grupo de Tratamiento de Imágenes, Escuela Politécnica Superior, Universidad Autónoma de Madrid, E-28049 Madrid, Spain
Daniel Márquez & Jesús Bescós

Authors

Daniel Márquez
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Bescós
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Bianca Falcidieno Michela Spagnuolo Yannis Avrithis Ioannis Kompatsiaris Paul Buitelaar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Márquez, D., Bescós, J. (2007). A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video. In: Falcidieno, B., Spagnuolo, M., Avrithis, Y., Kompatsiaris, I., Buitelaar, P. (eds) Semantic Multimedia. SAMT 2007. Lecture Notes in Computer Science, vol 4816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77051-0_9

Download citation

DOI: https://doi.org/10.1007/978-3-540-77051-0_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77033-6
Online ISBN: 978-3-540-77051-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics