Skip to main content

A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6298))

Abstract

With the rapid growth of the number of TV channels, the internet and online information services, more and more information becomes available and accessible. The digitization enhances preservation of records and makes the access to documents easier. However, when the quantity of documents become important the digitalization is not enough to ensure an efficient access. Indeed, we need to extract semantic information to help users to find what we need quickly. The text included in video sequences is highly needed for indexing and searching system. However, this text is difficult to detect and recognize because of the variability of its size, low resolution characters and the complexity of the backgrounds. To resolve these shortcomings, we propose a two task system: As a first step, we extract the textual information from video sequences and second, we recognize this text. Our system is tested on a diverse database composed of several Arabic news broadcast. The obtained results are encouraging and prove the qualities of our approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mahdi, W., Chen, L., Fontaine, D.: Improving the Spatial-Temporal Clue Based Segmentation by the Use of Rhythm. In: Nikolaou, C., Stephanidis, C. (eds.) ECDL 1998. LNCS, vol. 1513, pp. 169–181. Springer, Heidelberg (1998)

    Google Scholar 

  2. Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern recognition 31(12), 2055–2076 (1998)

    Article  Google Scholar 

  3. Lee, C.M., Kankanhalli, A.: Automatic extraction of characters in complex scene images. International Journal of Pattern Recognition and Artificial Intelligence 9(1), 67–82 (1995)

    Article  Google Scholar 

  4. Lienhart, R., Stuber, F.: Automatic text recognition indigital videos. In: Proceedings of SPIE Image and Video Processing IV, vol. 2666, pp. 180–188 (1996)

    Google Scholar 

  5. Karray, H., Ellouze, M., Alimi, M.A.: Indexing Video Summaries for Quick Video Browsing. In: Computer Communications and Networks 2009, pp. 77–95 (2009)

    Google Scholar 

  6. Wu, V., Manmatha, R., Riseman, E.M.: TextFinder: anautomatic system to detect and recognize text in images. IEEE Trans. PAMI 21, 1224–1229 (1999)

    Google Scholar 

  7. Agnihotri, L., Dimitrova, N.: Text detection for video analysis. In: Workshop on Content Based Access to Image and Video libraries, Conjunction with CVPR, Colorado (1999)

    Google Scholar 

  8. Gao, X., Tang, X.: Automatic news video caption extraction and recognition. In: Proc. Of Intelligent Data Engineering and Automated Learning 2000, pp. 425–430 (2000)

    Google Scholar 

  9. Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: Proc. of IEEE International Conf. on Acoustics, Speech, and Signal Processing, vol. 4, pp. 2326–2329 (2000)

    Google Scholar 

  10. Agnihotri., L., Dimitrova, N., Soletic, M.: Multi-layered Videotext Extraction Method. In: IEEE International Conference on Multimedia and Expo. (ICME), pp. 213–216 (2002)

    Google Scholar 

  11. Hua, S., Chen, X.-R., et al.: Automatic Location of Text in Video Frames. In: Intl Workshop on Multimedia Information Retrieval (MIR 2001), pp. 24–27 (2001)

    Google Scholar 

  12. Karray, H., Alimi, A.M.: Detection and Extraction of the Text in a video sequence. In: Proc. IEEE 12 International Conference on Electronics, Circuits and Systems 2005 ( ICECS 2005), vol. 2, pp. 474–478 (2005)

    Google Scholar 

  13. Kherallah, M., Karray, H., Ellouze, M., Alimi, A.M.: Toward an Interactive Device for Quick News Story Browsing. In: ICPR 2008, pp. 1–4 (2008)

    Google Scholar 

  14. Ben Halima, M., Karray, H., Alimi, A.M.: Arabic Text Recognition in Video Sequences. In: The 2010 International Conference on Informatics, Cybernetics, and Computer Applications (ICICCA 2010) (July 2010)

    Google Scholar 

  15. Shanbehzadeh, J., Pezashki, H., Sarrafzadeh, A.: Features Extraction from Farsi Hand Written Letters. In: Proceedings of Image and Vision Computing, New Zealand 2007, pp. 35–40 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Halima, M.B., Karray, H., Alimi, A.M. (2010). A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15696-0_60

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15695-3

  • Online ISBN: 978-3-642-15696-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics