Skip to main content

CNN-BLSTM Model for Arabic Text Recognition in Unconstrained Captured Identity Documents

  • Conference paper
  • First Online:
Image Analysis and Processing - ICIAP 2023 Workshops (ICIAP 2023)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14365))

Included in the following conference series:

  • 162 Accesses

Abstract

Optical Character Recognition (OCR) for Arabic text (printed and handwritten) has been widely studied by researchers in the last two decades. Some commercial solutions have emerged with good recognition rates for printed text (on white or uniform backgrounds) or handwritten text with limited vocabulary. In addition to being naturally cursive, the Arabic language comes with additional challenges due to its calligraphy resulting in a variety of fonts and styles. In this work, recent advances in recurrent neural networks are explored for the recognition of Arabic text in identity documents captured in the wild. The unconstrained captures bring additional difficulties as the text has to be first localized before being able to recognize it. Various pre-processing steps are introduced to overcome the difficulties related to the Arabic text itself and also due to the capturing conditions. The presented approach outperforms existing solutions when evaluated using a private dataset and also using the recent MIDV2020 dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 89.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 119.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Bulatovich, B.K., et al. "MIDV-2020: A comprehensive benchmark dataset for identity document analysis". In: 46.2 (2022), pp. 252–270

    Google Scholar 

  2. Al-Hashim A.G., Mahmoud, S.A.: Benchmark database and GUI environment for printed Arabic text recognition research. In: WSEAS Trans. Inf. Sci. Appl. 7.4 (2010), pp. 587–597

    Google Scholar 

  3. Jaiem, F.K., et al. "Database for Arabic printed text recognition research". In: ICIAP. 2013, pp. 251–259

    Google Scholar 

  4. Maalej, R., Tagougui, N., Kherallah, M.:Online Arabic handwriting recognition with dropout applied in deep recurrent neural networks. In: 12th IAPR DAS. IEEE. 2016, pp. 417–421

    Google Scholar 

  5. El-Mahallawy, M. S. M.: A Large scale HMM-based omni front-written OCR system for cursive scripts. In: Ph.D. thesis, Cairo University. 2008

    Google Scholar 

  6. Mahmoud, S. A., et al.: "KHATT: an open Arabic offline handwritten text database". In: Pattern Recognition 47.3 (2014), pp. 1096–1112

    Google Scholar 

  7. Ngoc, M., Fabrizio, J., éraud, T.G.,: Saliency-based detection of identy documents captured by smartphones. In: 13th DAS. 2018, pp. 387–392

    Google Scholar 

  8. Ramdan, J., Omar, K., Faidzul. M.: A Novel method to detect segmentation points of Arabic words using peaks and neural network. In: IJASEIT 7.2 (2017), pp. 625–631

    Google Scholar 

  9. de Sá Soares, A., das Neves Junior, R. B., Bezerra, B.L.D.:BID Dataset: a challenge dataset for document processing tasks. In: 18th Conference on Graphics, Patterns and Images. SBC. 2020, pp. 143–146

    Google Scholar 

  10. Shi, B., Bai, X., Yao. C.: An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition. In: CoRR abs/1507.05717 (2015). arXiv: 1507.05717

  11. Shivakumara, P., et al.:"CNN-RNN based method for license plate recognition". In: CAAI Transactions on Intelligence Technology 3.3 (2018), pp. 169–175

    Google Scholar 

  12. Shu, Y., Xu, Y.: End-to-End Captcha Recognition Using Deep CNNRNN Network. In: IEEE 3rd IMCEC. IEEE. 2019, pp. 54–58

    Google Scholar 

  13. Slimane, F., et al.: A new Arabic printed text image database and evaluation protocols. In: 2009 10th International Conference on Document Analysis and Recognition. IEEE. 2009, pp. 946–950

    Google Scholar 

  14. Suvarnam, B., Ch, V.S.: Combination of CNN-GRU model to recognize characters of a license plate number without segmentation. In: 5th ICACCS. IEEE. 2019, pp. 317–322

    Google Scholar 

  15. Yousef, M., Hussain, K.F., Mohammed, U.S.: Accurate, Data-Efficient, unconstrained text recognition with convolutional neural networks. In: CoRR abs/1812.11894 (2018). arXiv: 1812.11894

  16. Yousfi, S.: Embedded Arabic text detection and recognition in videos. PhD thesis. Université de Lyon, 2016

    Google Scholar 

  17. Zeki, A., Zakaria, M., Liong, C.: The Use of Area-Voronoi diagram in separating Arabic text connected components. In: 3rd ACEA. 2007, pp. 251–288

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nabil Ghanmi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ghanmi, N., Belhakimi, A., Awal, AM. (2024). CNN-BLSTM Model for Arabic Text Recognition in Unconstrained Captured Identity Documents. In: Foresti, G.L., Fusiello, A., Hancock, E. (eds) Image Analysis and Processing - ICIAP 2023 Workshops. ICIAP 2023. Lecture Notes in Computer Science, vol 14365. Springer, Cham. https://doi.org/10.1007/978-3-031-51023-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-51023-6_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-51022-9

  • Online ISBN: 978-3-031-51023-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics