A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition

  • M. Ben Halima
  • H. Karray
  • A. M. Alimi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6298)


With the rapid growth of the number of TV channels, the internet and online information services, more and more information becomes available and accessible. The digitization enhances preservation of records and makes the access to documents easier. However, when the quantity of documents become important the digitalization is not enough to ensure an efficient access. Indeed, we need to extract semantic information to help users to find what we need quickly. The text included in video sequences is highly needed for indexing and searching system. However, this text is difficult to detect and recognize because of the variability of its size, low resolution characters and the complexity of the backgrounds. To resolve these shortcomings, we propose a two task system: As a first step, we extract the textual information from video sequences and second, we recognize this text. Our system is tested on a diverse database composed of several Arabic news broadcast. The obtained results are encouraging and prove the qualities of our approach.


Arabic VideoText Segmentation Extraction Recognition 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Mahdi, W., Chen, L., Fontaine, D.: Improving the Spatial-Temporal Clue Based Segmentation by the Use of Rhythm. In: Nikolaou, C., Stephanidis, C. (eds.) ECDL 1998. LNCS, vol. 1513, pp. 169–181. Springer, Heidelberg (1998)Google Scholar
  2. 2.
    Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern recognition 31(12), 2055–2076 (1998)CrossRefGoogle Scholar
  3. 3.
    Lee, C.M., Kankanhalli, A.: Automatic extraction of characters in complex scene images. International Journal of Pattern Recognition and Artificial Intelligence 9(1), 67–82 (1995)CrossRefGoogle Scholar
  4. 4.
    Lienhart, R., Stuber, F.: Automatic text recognition indigital videos. In: Proceedings of SPIE Image and Video Processing IV, vol. 2666, pp. 180–188 (1996)Google Scholar
  5. 5.
    Karray, H., Ellouze, M., Alimi, M.A.: Indexing Video Summaries for Quick Video Browsing. In: Computer Communications and Networks 2009, pp. 77–95 (2009)Google Scholar
  6. 6.
    Wu, V., Manmatha, R., Riseman, E.M.: TextFinder: anautomatic system to detect and recognize text in images. IEEE Trans. PAMI 21, 1224–1229 (1999)Google Scholar
  7. 7.
    Agnihotri, L., Dimitrova, N.: Text detection for video analysis. In: Workshop on Content Based Access to Image and Video libraries, Conjunction with CVPR, Colorado (1999)Google Scholar
  8. 8.
    Gao, X., Tang, X.: Automatic news video caption extraction and recognition. In: Proc. Of Intelligent Data Engineering and Automated Learning 2000, pp. 425–430 (2000)Google Scholar
  9. 9.
    Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: Proc. of IEEE International Conf. on Acoustics, Speech, and Signal Processing, vol. 4, pp. 2326–2329 (2000)Google Scholar
  10. 10.
    Agnihotri., L., Dimitrova, N., Soletic, M.: Multi-layered Videotext Extraction Method. In: IEEE International Conference on Multimedia and Expo. (ICME), pp. 213–216 (2002)Google Scholar
  11. 11.
    Hua, S., Chen, X.-R., et al.: Automatic Location of Text in Video Frames. In: Intl Workshop on Multimedia Information Retrieval (MIR 2001), pp. 24–27 (2001)Google Scholar
  12. 12.
    Karray, H., Alimi, A.M.: Detection and Extraction of the Text in a video sequence. In: Proc. IEEE 12 International Conference on Electronics, Circuits and Systems 2005 ( ICECS 2005), vol. 2, pp. 474–478 (2005)Google Scholar
  13. 13.
    Kherallah, M., Karray, H., Ellouze, M., Alimi, A.M.: Toward an Interactive Device for Quick News Story Browsing. In: ICPR 2008, pp. 1–4 (2008)Google Scholar
  14. 14.
    Ben Halima, M., Karray, H., Alimi, A.M.: Arabic Text Recognition in Video Sequences. In: The 2010 International Conference on Informatics, Cybernetics, and Computer Applications (ICICCA 2010) (July 2010)Google Scholar
  15. 15.
    Shanbehzadeh, J., Pezashki, H., Sarrafzadeh, A.: Features Extraction from Farsi Hand Written Letters. In: Proceedings of Image and Vision Computing, New Zealand 2007, pp. 35–40 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • M. Ben Halima
    • 1
  • H. Karray
    • 1
  • A. M. Alimi
    • 1
  1. 1.REGIM: REsearch Group on Intelligent MachinesUniversity of Sfax, National School of Engineers (ENIS)SfaxTunisia

Personalised recommendations