A New Video Images Text Localization Approach Based on a Fast Hough Transform

Bouaziz, Bassem; Mahdi, Walid; Ben Hamadou, Abdelmajid

doi:10.1007/11867586_39

Bassem Bouaziz¹⁸,
Walid Mahdi¹⁸ &
Abdelmajid Ben Hamadou¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4141))

Included in the following conference series:

International Conference Image Analysis and Recognition

1470 Accesses
2 Citations

Abstract

The amount of digital video data is increasing over the world. It highlights the need for efficient algorithms that can retrieve this data by content. The full use of this media is currently limited by the opaque nature of the video which prevents content-based access. To facilitate video indexing and browsing, it is essential to allow non-linear access, especially for long programs. This can be achieved by identifying semantic description captured automatically from video story structure. Among these descriptions, text within video frames is considered as rich features that enable a good way for browsing video. In this paper we propose a fast Hough transformation based approach for automatic video frames text localization. Experimental results, that we drove on a large sample of video images issued from portions of news broadcasts, sports program, advertisements and movies, shows that our method is very efficient, capable to locate text regions with different character sizes, directions and styles even in case of complex image background.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bouaziz, B., Mahdi, W., Ardabilian, M., Ben Hamadou, A.: A new approach for texture features extraction : application for text localization in video images. In: Proceedings of the IEEE International Conference on Multimedia and Exposition, ICME 2006, Ontario, Canada (July 2006)
Google Scholar
Aigrain, P., Jolyet, P., Longueville, V.: Representation based user interfaces for the Audiovisual Library of Year 2000. In: Proceedings of IS&T/SPIE Conference on Multimedia Computing and Networking, pp. 35–45 (1995)
Google Scholar
Ardebilian, M., Tu, X.W., Chen, L.: Improvement of Shot Detection Methods Based-on Dynamic Threshold Selection. In: Proc. SPIE: Multimedia Strage and Archiving Systems II, Dallas, USA (1997)
Google Scholar
Jung, K., Han, H.: Hybrid approach to efficient text extraction in complex color images. Pattern Recognition Letters 25(6), 679–699 (2004)
Article Google Scholar
Swain, M., Ballard, D.: Color Indexing. International Journal of Computer Vision 7(1), 11–32 (1991)
Article Google Scholar
Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Content Parsing Based on Exterior and Interior Shots Classification. In: The seventh Iternational conference on Advanced Computer Systems, ACS 2000, Szczecin, Poland, October 23-25, pp. 571–578 (2000), ISBN 83-87352-24-7
Google Scholar
Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Scene Segmentation based on Spatial-temporal Clues and Rhythm. International Journal of Networking and Information Sytsems 3(1), 27–51 (2000), http://www.hermes-journals.com
Google Scholar
Lyu, Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Transactions on Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Article Google Scholar
Marc, D.: Media stream: Representing Video for Retrieval and Repurposing. In: ACM Multimedia proceeding, Sans Fransisco, CA, USA, October 15-20, pp. 478–479 (1994)
Google Scholar
Jain, A.K., Yu, B.: Automatic Text Location In Images and Videos Frames. Pattern Recognation 31(12), 2055–2076 (1998)
Article Google Scholar
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognation 28(10), 1523–1535 (1995)
Article Google Scholar
Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans, Image Processing 9(1) (January 2000)
Google Scholar
Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Extraction of Digital Video. In: Proc. ICIP 1999, Kobe (1999)
Google Scholar
Sato, T., Kanade, T.: Video OCR: Indexing digital news livrairies by recognation of superimposed caption. In: ICCV Workshop on Image and Video retrieval (1998)
Google Scholar
Wu, V., Manmatha, R., Riseman, E.M.: Finding text in images. In: Proc. of the 2nd Intl. Conf. on Digital Libraries, Philadalphia, PA, pp. 1–10 (July 1997)
Google Scholar
Sobottka, K., Bunke, H., Kronenberg, H.: Identification of Text on Colored Book and Journal Covers. In: Proc. of the 5th Intl. Conf. on Document Analysis and Recoginzation, pp. 57–62 (1999)
Google Scholar
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital News Archives. In: IEEE International Workshop on Content-Based Access of Images and Video Databases, pp. 52–60 (January 1998)
Google Scholar
Qi, W., et al.: Integrating Visual, Audio and Text Analysis for news Video. In: 7th IEEE International Conference on Image Processing (ICIP 2000), Vancouver, British Columbia, Canada, September 10-13 (2000)
Google Scholar
Hao, Y., Zhang, Y., Zeng-guang, H., Min, T.: Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network And CED. Journal of WSCG 11(1) (2003), ISSN 1214-6972, WSG 2003, Plzen, Czech Republic. Copyright UNION Agency – Science Press (February 3-7, 2003)
Google Scholar
Wolf, C., Jolion, J.M., Chassaing, F.: Text Localization, Enhancement and Binarization in Multimedia Documents. In: Proceedings of the International Conference on Pattern Recognition (ICPR) 2002, Quebec City, Canada, August 11–15, vol. 4, pp. 1037–1040. IEEE Computer Society, Los Alamitos (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Higher Institute of Computer Science and Multimedia, MIRACL, Multimedia InfoRmation system and Advanced Computing Laboratory, Sfax, BP 1030, 69042, Tunisia
Bassem Bouaziz, Walid Mahdi & Abdelmajid Ben Hamadou

Authors

Bassem Bouaziz
View author publications
You can also search for this author in PubMed Google Scholar
Walid Mahdi
View author publications
You can also search for this author in PubMed Google Scholar
Abdelmajid Ben Hamadou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Engenharia Electrotécnica e de Computadores da, Faculdade de Engenharia da, Universidade do Porto, Campus FEUP, 4200-465, Porto, Portugal
Aurélio Campilho
Electrical and Computer Engineering Department, University of Waterloo,
Mohamed S. Kamel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bouaziz, B., Mahdi, W., Ben Hamadou, A. (2006). A New Video Images Text Localization Approach Based on a Fast Hough Transform. In: Campilho, A., Kamel, M.S. (eds) Image Analysis and Recognition. ICIAR 2006. Lecture Notes in Computer Science, vol 4141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11867586_39

Download citation

DOI: https://doi.org/10.1007/11867586_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44891-4
Online ISBN: 978-3-540-44893-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics