A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition

Halima, M. Ben; Karray, H.; Alimi, A. M.

doi:10.1007/978-3-642-15696-0_60

A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition

M. Ben Halima²²,
H. Karray²² &
A. M. Alimi²²

Conference paper

1489 Accesses
9 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6298))

Abstract

With the rapid growth of the number of TV channels, the internet and online information services, more and more information becomes available and accessible. The digitization enhances preservation of records and makes the access to documents easier. However, when the quantity of documents become important the digitalization is not enough to ensure an efficient access. Indeed, we need to extract semantic information to help users to find what we need quickly. The text included in video sequences is highly needed for indexing and searching system. However, this text is difficult to detect and recognize because of the variability of its size, low resolution characters and the complexity of the backgrounds. To resolve these shortcomings, we propose a two task system: As a first step, we extract the textual information from video sequences and second, we recognize this text. Our system is tested on a diverse database composed of several Arabic news broadcast. The obtained results are encouraging and prove the qualities of our approach.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mahdi, W., Chen, L., Fontaine, D.: Improving the Spatial-Temporal Clue Based Segmentation by the Use of Rhythm. In: Nikolaou, C., Stephanidis, C. (eds.) ECDL 1998. LNCS, vol. 1513, pp. 169–181. Springer, Heidelberg (1998)
Google Scholar
Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern recognition 31(12), 2055–2076 (1998)
Article Google Scholar
Lee, C.M., Kankanhalli, A.: Automatic extraction of characters in complex scene images. International Journal of Pattern Recognition and Artificial Intelligence 9(1), 67–82 (1995)
Article Google Scholar
Lienhart, R., Stuber, F.: Automatic text recognition indigital videos. In: Proceedings of SPIE Image and Video Processing IV, vol. 2666, pp. 180–188 (1996)
Google Scholar
Karray, H., Ellouze, M., Alimi, M.A.: Indexing Video Summaries for Quick Video Browsing. In: Computer Communications and Networks 2009, pp. 77–95 (2009)
Google Scholar
Wu, V., Manmatha, R., Riseman, E.M.: TextFinder: anautomatic system to detect and recognize text in images. IEEE Trans. PAMI 21, 1224–1229 (1999)
Google Scholar
Agnihotri, L., Dimitrova, N.: Text detection for video analysis. In: Workshop on Content Based Access to Image and Video libraries, Conjunction with CVPR, Colorado (1999)
Google Scholar
Gao, X., Tang, X.: Automatic news video caption extraction and recognition. In: Proc. Of Intelligent Data Engineering and Automated Learning 2000, pp. 425–430 (2000)
Google Scholar
Garcia, C., Apostolidis, X.: Text detection and segmentation in complex color images. In: Proc. of IEEE International Conf. on Acoustics, Speech, and Signal Processing, vol. 4, pp. 2326–2329 (2000)
Google Scholar
Agnihotri., L., Dimitrova, N., Soletic, M.: Multi-layered Videotext Extraction Method. In: IEEE International Conference on Multimedia and Expo. (ICME), pp. 213–216 (2002)
Google Scholar
Hua, S., Chen, X.-R., et al.: Automatic Location of Text in Video Frames. In: Intl Workshop on Multimedia Information Retrieval (MIR 2001), pp. 24–27 (2001)
Google Scholar
Karray, H., Alimi, A.M.: Detection and Extraction of the Text in a video sequence. In: Proc. IEEE 12 International Conference on Electronics, Circuits and Systems 2005 ( ICECS 2005), vol. 2, pp. 474–478 (2005)
Google Scholar
Kherallah, M., Karray, H., Ellouze, M., Alimi, A.M.: Toward an Interactive Device for Quick News Story Browsing. In: ICPR 2008, pp. 1–4 (2008)
Google Scholar
Ben Halima, M., Karray, H., Alimi, A.M.: Arabic Text Recognition in Video Sequences. In: The 2010 International Conference on Informatics, Cybernetics, and Computer Applications (ICICCA 2010) (July 2010)
Google Scholar
Shanbehzadeh, J., Pezashki, H., Sarrafzadeh, A.: Features Extraction from Farsi Hand Written Letters. In: Proceedings of Image and Vision Computing, New Zealand 2007, pp. 35–40 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

REGIM: REsearch Group on Intelligent Machines, University of Sfax, National School of Engineers (ENIS), BP 1173, Sfax, 3038, Tunisia
M. Ben Halima, H. Karray & A. M. Alimi

Authors

M. Ben Halima
View author publications
You can also search for this author in PubMed Google Scholar
H. Karray
View author publications
You can also search for this author in PubMed Google Scholar
A. M. Alimi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, University of Nottingham, Jubilee Campus, NG8 1BB, Nottingham, UK
Guoping Qiu
The Centre for Multimedia Signal Processing, The Hong Kong Polytechnic University, Hong Kong, China
Kin Man Lam
Faculty of System Design, Tokyo Metropolitan University, 6-6, Asahigaoka, 191-0065, Tokyo, Hino-city
Hitoshi Kiya
Shanghai Key Laboratory of Intelligent Information Processing, Department of Computer Science & Engineering, Fudan University, Shanghai, China
Xiang-Yang Xue
Department of Electrical Engineering, University of Southern California, 90089-2564, Los Angeles, CA
C.-C. Jay Kuo
LIACS Media Lab, Leiden University,
Michael S. Lew

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Halima, M.B., Karray, H., Alimi, A.M. (2010). A Comprehensive Method for Arabic Video Text Detection, Localization, Extraction and Recognition. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_60

Download citation

DOI: https://doi.org/10.1007/978-3-642-15696-0_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15695-3
Online ISBN: 978-3-642-15696-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics