Abstract
We here describe a method for caption extraction that totally works in the MPEG compressed domain. As opposed to other compressed domain methods; it does not need to refine their results in the pixel domain. It consists of two phases: first, a selection of candidate frames with captions, based on a rigorous statistical design of an AC coefficients mask; second, an extraction of caption boxes from the pre-selected set of candidate frames. Caption extraction relies on a model-based approach to obtaining the caption mask, robust enough to avoid the use of any subsequent refinement.
Work partially supported by the European Commission under its 6th Framework Programme (FP6-027685 - MESH Project) and by Spanish Institutions under projects TIN2004-07860-C02-01 and S-0505-TIC-0223.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lim, Y.K., Choi, S.H., Lee, S.W.: Text Extraction in MPEG Compressed Video for Content-based Indexing. In: Proc. ICPR (2000)
Crandall, D., Kasturi, R.: Robust Detection of Stylized Text-Events on Digital Video. In: Proc. 6th Int. Conf. on Document Analysis and Recognition (2001)
Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Localization in Compressed Video. IEEE Transactions on PAMI (2000)
Chen, D.Y., Hsiao, M.H., Suh-Yin, L.: Automatic Closed Caption Detection and Filtering in MPEG Vídeos for Vídeo Structuring. Journal of Information Science and Engineering 22(5) (2006)
Zhang, Y., Chua, T.: Detection of Text Caption in Compressed Domain Vídeo. In: Proc. ACM Workshop on Multimedia (2000)
Chun, S., Kim, H., Kim, J.R., Oh, S., Sull, S.: Fast Text Caption Localization on Vídeo Using Visual Rythm. In: Proc. 5th Intl. Conf. on Recent Advances in Visual Information Systems (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Márquez, D., Bescós, J. (2007). A Model-Based Iterative Method for Caption Extraction in Compressed MPEG Video. In: Falcidieno, B., Spagnuolo, M., Avrithis, Y., Kompatsiaris, I., Buitelaar, P. (eds) Semantic Multimedia. SAMT 2007. Lecture Notes in Computer Science, vol 4816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77051-0_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-77051-0_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77033-6
Online ISBN: 978-3-540-77051-0
eBook Packages: Computer ScienceComputer Science (R0)