Skip to main content

A Robust Method for Text, Line, and Word Segmentation for Historical Arabic Manuscripts

  • Chapter
  • First Online:
Data Analytics for Cultural Heritage

Abstract

The segmentation of old documents is a crucial phase for reading and understanding the content of a document automatically. Also, the extraction of words and phrases in a document needs segmentation of each line and word. But, the variations of text lines directions throughout the same document and overlapping characters between two or more text lines, especially in Arabic manuscripts, are the problems that usually found in such documents. For that, this chapter proposes an approach for text segmentation as well as line and word for historical Arabic manuscripts. First, text segmentation is realized using an encoder-decoder deep model to segment the main text and side text in the image. The model has been trained on two Arabic manuscripts dataset including Bukhari and RASM2018 datasets. Then, the segmentation of lines using a smoothing approach followed by thresholding determined automatically according to the size of handwriting. Then, segmentation of words is provided using smoothed Chamfer distance which takes into consideration the handwriting characteristics. The evaluation of the proposed approach is reported on the QUWI Arabic database and very promising results are achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Prusty, A., Aitha, S., Trivedi, A., et al.: Indiscapes: instance segmentation networks for layout parsing of historical Indic manuscripts. arXiv preprint arXiv:1912.07025 (2019)

    Google Scholar 

  2. Bukhari, S.S., Breuel, T.M., Asi, A., El-Sana, J.: Layout analysis for Arabic historical document images using machine learning. In: Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 639–644. IEEE, New York (2012)

    Google Scholar 

  3. Clausner, C., Antonacopoulos, A., Mcgregor, N., et al.: ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts–RASM2018. In: Proceedings of the 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 471–476. IEEE, New York (2018)

    Google Scholar 

  4. Adam, K., Al-Maadeed, S., Bouridane, A.: Based classification of Arabic scripts style in ancient Arabic manuscripts: preliminary results. In: Proceedings of the 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR), pp. 95–98. IEEE, New York (2017)

    Google Scholar 

  5. Barakat, B.K., El-Sana, J.: Binarization free layout analysis for Arabic historical documents using fully convolutional networks. In: Proceedings of the 2018 IEEE 2nd International Workshop on Arabic and Derived Script Analysis and Recognition (ASAR), pp. 151–155. IEEE, New York (2018)

    Google Scholar 

  6. Prusty, A., Aitha, S., Trivedi, A., et al.: Indiscapes: instance segmentation networks for layout parsing of historical Indic manuscripts (2019). arXiv preprint arXiv:1912.07025

    Google Scholar 

  7. Abuhaiba, I.S.I., Datta, S., Murray J.J.H.: Line extraction and stroke ordering of text pages. In: Proceedings of the Third International Conference on Document Analysis and Recognition, 1995, vol. 1. IEEE, New York (1995)

    Google Scholar 

  8. Adiguzel, H., Emre S., Pinar D.: A Hybrid for Line Segmentation in Handwritten Documents. In: Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, New York (2012)

    Google Scholar 

  9. Ouwayed, N., Belaïd, A. Auger, F.: General text line extraction approach based on locally orientation estimation. In: IS&T/SPIE Electronic Imaging. International Society for Optics and Photonics (2010)

    Google Scholar 

  10. Ghazouani, F., Maddouri, M., Maddouri Snoussi, S., El Abed, H., Volker, M.: Segmentation of handwritten and printed Arabic documents. In: Proceedings of the 2nd Workshop SIDOP, Hammamet, Tunisia (2012)

    Google Scholar 

  11. Khayyat, M., et al.: Arabic handwritten text line extraction by applying an adaptive mask to morphological dilation. In: Proceedings of the 2012 10th IAPR International Workshop on Document Analysis Systems (DAS). IEEE, New York (2012)

    Google Scholar 

  12. Kumar, J., et al.: Handwritten Arabic text line segmentation using affinity propagation. In: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems. ACM, New York (2010)

    Google Scholar 

  13. Kumar, J., et al.: Segmentation of handwritten textlines in presence of touching components. In: Proceedings of the 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, New York (2011)

    Google Scholar 

  14. Manohar, V., et al.: Graph clustering-based ensemble method for handwritten text line segmentation. In: Proceedings of the 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, New York (2011)

    Google Scholar 

  15. Öztop, E., et al.: Repulsive attractive network for baseline extraction on document images. Signal Process. 75(1), 1–10 (1999)

    Article  Google Scholar 

  16. Saabni, R., El-Sana, J.: Language-independent text lines extraction using seam carving. In: Proceedings of the 2011 International Conference on Document Analysis and Recognition (ICDAR). IEEE, New York (2011)

    Google Scholar 

  17. Shi, Z., Setlur, S., Govindaraju, V.: A steerable directional local profile technique for extraction of handwritten Arabic text lines. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, 2009 (ICDAR’09). IEEE, New York (2009)

    Google Scholar 

  18. AlKhateeb, J.H., et al.: Component-based segmentation of words from handwritten Arabic text. Int. J. Comput. Syst. Sci. Eng. 2(5), 1428–1432 (2008)

    Google Scholar 

  19. Farooq, F., Govindaraju, V., Perrone, M.: Pre-processing methods for handwritten Arabic documents. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, 2005. IEEE, New York (2005)

    Google Scholar 

  20. Srihari, S., et al.: Spotting words in handwritten Arabic documents. In: Proceedings of the International Society for Optics and Photonics, Electronic Imaging 2006 (2006)

    Google Scholar 

  21. Brook, S., Al Aghbari, Z.: Holistic approach for classifying and retrieving personal Arabic handwritten documents. In: Proceedings of the 7th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases. World Scientific and Engineering Academy and Society (WSEAS) (2008)

    Google Scholar 

  22. Parvez, M.T., Mahmoud, S.A.: Arabic handwriting recognition using structural and syntactic pattern attributes. Pattern Recognit. 46(1), 141–154 (2013)

    Article  Google Scholar 

  23. Al-Maadeed, S., Issawi, F., Bouridan, A.: Word retrieval system for Ancient Arabic manuscripts. In Proceedings of the 2017 9th IEEE-GCC Conference and Exhibition (GCCCE). IEEE, New York (2017)

    Google Scholar 

  24. Jan-Mark Geusebroek, R. van den Boomgaard, A.W.M. Smeulders, A.: Dev, Color and scale: the spatial structure of color images. In: Proceeding of the 6th European Conference on Computer Vision, vol. 1, pp. 331–341 (2000)

    Google Scholar 

  25. Elharrouss, O., Moujahid, D., Tairi, H.: Motion detection based on the combining of the background subtraction and the structure–texture decomposition. Opt.-Int. J. Light Electron Opt. 126(24), 5992–5997 (2015)

    Article  Google Scholar 

  26. Elharrouss, O., Moujahid, D., Elkah, S., Tairi, H.: Moving object detection using a background modeling based on entropy theory and quad-tree decomposition. J. Electron. Imaging 25(6), 061615 (2016)

    Article  Google Scholar 

  27. Elharrouss, O., Abbad, A., Moujahid, D., Riffi, J., Tairi, H.: A block-based background model for moving object detection. ELCVIA: Electron. Lett. Comput. Vision Image Anal. 15(3), 0017–31 (2016)

    Article  Google Scholar 

  28. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, USA, pp. 3431–3440 (2015)

    Google Scholar 

  29. Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)

    Article  Google Scholar 

  30. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). arXiv preprint arXiv:1409.1556

    Google Scholar 

  31. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32nd International Conference on Machine Learning, Lille, France, pp. 448–456 (2015)

    Google Scholar 

  32. Bahaoui, Z., El Fadili, H., Zenkouar, K., et al.: Exact Zernike and pseudo-Zernike moments image reconstruction based on circular overlapping blocks and Chamfer distance. Signal, Image and Video Processing, 11(7), 1313–1320 (2017)

    Article  Google Scholar 

  33. Handwritten Arabic Proximity Datasets. Language and Media Processing Laboratory. http://lampsrv02.umiacs.umd.edu/projdb/project.php

  34. Al Maadeed, S., et al.: QUWI: an Arabic and English handwriting dataset for offline writer identification. In: Proceedings of the 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR). IEEE, New York (2012)

    Google Scholar 

Download references

Acknowledgements

This work was made possible by NPRP grant #10-0205-170346 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Elharrouss, O., Al-Maadeed, S., Alja’am, J.M., Hassaine, A. (2021). A Robust Method for Text, Line, and Word Segmentation for Historical Arabic Manuscripts. In: Belhi, A., Bouras, A., Al-Ali, A.K., Sadka, A.H. (eds) Data Analytics for Cultural Heritage. Springer, Cham. https://doi.org/10.1007/978-3-030-66777-1_7

Download citation

Publish with us

Policies and ethics