Skip to main content

A New Video Images Text Localization Approach Based on a Fast Hough Transform

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4141))

Abstract

The amount of digital video data is increasing over the world. It highlights the need for efficient algorithms that can retrieve this data by content. The full use of this media is currently limited by the opaque nature of the video which prevents content-based access. To facilitate video indexing and browsing, it is essential to allow non-linear access, especially for long programs. This can be achieved by identifying semantic description captured automatically from video story structure. Among these descriptions, text within video frames is considered as rich features that enable a good way for browsing video. In this paper we propose a fast Hough transformation based approach for automatic video frames text localization. Experimental results, that we drove on a large sample of video images issued from portions of news broadcasts, sports program, advertisements and movies, shows that our method is very efficient, capable to locate text regions with different character sizes, directions and styles even in case of complex image background.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bouaziz, B., Mahdi, W., Ardabilian, M., Ben Hamadou, A.: A new approach for texture features extraction : application for text localization in video images. In: Proceedings of the IEEE International Conference on Multimedia and Exposition, ICME 2006, Ontario, Canada (July 2006)

    Google Scholar 

  2. Aigrain, P., Jolyet, P., Longueville, V.: Representation based user interfaces for the Audiovisual Library of Year 2000. In: Proceedings of IS&T/SPIE Conference on Multimedia Computing and Networking, pp. 35–45 (1995)

    Google Scholar 

  3. Ardebilian, M., Tu, X.W., Chen, L.: Improvement of Shot Detection Methods Based-on Dynamic Threshold Selection. In: Proc. SPIE: Multimedia Strage and Archiving Systems II, Dallas, USA (1997)

    Google Scholar 

  4. Jung, K., Han, H.: Hybrid approach to efficient text extraction in complex color images. Pattern Recognition Letters 25(6), 679–699 (2004)

    Article  Google Scholar 

  5. Swain, M., Ballard, D.: Color Indexing. International Journal of Computer Vision 7(1), 11–32 (1991)

    Article  Google Scholar 

  6. Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Content Parsing Based on Exterior and Interior Shots Classification. In: The seventh Iternational conference on Advanced Computer Systems, ACS 2000, Szczecin, Poland, October 23-25, pp. 571–578 (2000), ISBN 83-87352-24-7

    Google Scholar 

  7. Mahdi, W., Ardebilian, M., Chen, L.: Automatic Video Scene Segmentation based on Spatial-temporal Clues and Rhythm. International Journal of Networking and Information Sytsems 3(1), 27–51 (2000), http://www.hermes-journals.com

    Google Scholar 

  8. Lyu, Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Transactions on Circuits and Systems for Video Technology 15(2), 243–255 (2005)

    Article  Google Scholar 

  9. Marc, D.: Media stream: Representing Video for Retrieval and Repurposing. In: ACM Multimedia proceeding, Sans Fransisco, CA, USA, October 15-20, pp. 478–479 (1994)

    Google Scholar 

  10. Jain, A.K., Yu, B.: Automatic Text Location In Images and Videos Frames. Pattern Recognation 31(12), 2055–2076 (1998)

    Article  Google Scholar 

  11. Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognation 28(10), 1523–1535 (1995)

    Article  Google Scholar 

  12. Li, H., Doermann, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Trans, Image Processing 9(1) (January 2000)

    Google Scholar 

  13. Zhong, Y., Zhang, H., Jain, A.K.: Automatic Caption Extraction of Digital Video. In: Proc. ICIP 1999, Kobe (1999)

    Google Scholar 

  14. Sato, T., Kanade, T.: Video OCR: Indexing digital news livrairies by recognation of superimposed caption. In: ICCV Workshop on Image and Video retrieval (1998)

    Google Scholar 

  15. Wu, V., Manmatha, R., Riseman, E.M.: Finding text in images. In: Proc. of the 2nd Intl. Conf. on Digital Libraries, Philadalphia, PA, pp. 1–10 (July 1997)

    Google Scholar 

  16. Sobottka, K., Bunke, H., Kronenberg, H.: Identification of Text on Colored Book and Journal Covers. In: Proc. of the 5th Intl. Conf. on Document Analysis and Recoginzation, pp. 57–62 (1999)

    Google Scholar 

  17. Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR for digital News Archives. In: IEEE International Workshop on Content-Based Access of Images and Video Databases, pp. 52–60 (January 1998)

    Google Scholar 

  18. Qi, W., et al.: Integrating Visual, Audio and Text Analysis for news Video. In: 7th IEEE International Conference on Image Processing (ICIP 2000), Vancouver, British Columbia, Canada, September 10-13 (2000)

    Google Scholar 

  19. Hao, Y., Zhang, Y., Zeng-guang, H., Min, T.: Automatic Text Detection In Video Frames Based on Bootstrap Artificial Neural Network And CED. Journal of WSCG 11(1) (2003), ISSN 1214-6972, WSG 2003, Plzen, Czech Republic. Copyright UNION Agency – Science Press (February 3-7, 2003)

    Google Scholar 

  20. Wolf, C., Jolion, J.M., Chassaing, F.: Text Localization, Enhancement and Binarization in Multimedia Documents. In: Proceedings of the International Conference on Pattern Recognition (ICPR) 2002, Quebec City, Canada, August 11–15, vol. 4, pp. 1037–1040. IEEE Computer Society, Los Alamitos (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bouaziz, B., Mahdi, W., Ben Hamadou, A. (2006). A New Video Images Text Localization Approach Based on a Fast Hough Transform. In: Campilho, A., Kamel, M.S. (eds) Image Analysis and Recognition. ICIAR 2006. Lecture Notes in Computer Science, vol 4141. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11867586_39

Download citation

  • DOI: https://doi.org/10.1007/11867586_39

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44891-4

  • Online ISBN: 978-3-540-44893-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics