Skip to main content
Log in

Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

In this paper, we present a novel segmentation-free Arabic handwriting recognition system based on hidden Markov model (HMM). Two main contributions are introduced: a new technique for dividing the image into nonuniform horizontal segments to extract the features and a new technique for solving the problems of the skewing of characters by fusing multiple HMMs. Moreover, two enhancements are introduced: the pre-processing method and feature extraction using concavity space. The proposed system first pre-processes the input image by setting the thickness of the input word to three pixels and fixing the spacing between the different parts of the word. The input image is divided into constant number of nonuniform horizontal segments depending on the distribution of the foreground pixels. A set of robust features representing the gradient of the foreground pixels is extracted using sliding windows. The input image is decomposed into several images representing the vertical, horizontal, left diagonal and right diagonal edges in the image. A set of robust features representing the densities of the foreground pixels in the various edge images is extracted using sliding windows. The proposed system builds character HMM models and learns word HMM models using embedded training. Besides the vertical sliding window, two slanted sliding windows are used to extract the features. Three different HMMs are used: one for the vertical sliding window and two for the slanted windows. A fusion scheme is used to combine the three HMMs. The proposed system is very promising and outperforms all the other Arabic handwriting recognition systems reported in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17

Similar content being viewed by others

References

  1. Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Arabic handwriting recognition using baseline dependant features and hidden Markov modeling. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition (ICDAR’05) (2005)

  2. Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combination of HMM-based classifiers for the recognition of arabic handwritten words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)

  3. Al-Hajj, R., Likforman-Sulem, L., Mokbel, C.: Combining slanted-frame classifiers for improved HMM-based arabic handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(7), 1165–1177 (2009)

    Google Scholar 

  4. AlKhateeb, J.H., Ren, J., Jiang, J., Al-Muhtaseb, H.: Offline handwritten Arabic cursive text recognition using hidden Markov models and re-ranking. Pattern Recognit. Lett. 32, 8 (2011)

    Google Scholar 

  5. Benouareth, A., Ennaji, A., Sellami, M.: HMMs with explicit state duration applied to handwritten Arabic word recognition. In: Proceeding of 18th International Conference Pattern Recognition (ICPR) (2006)

  6. Benouareth, A., Ennaji, A., Sellami, M.: Semi-continuous HMMs with explicit state duration for unconstrained arabic word modeling and recognition. Pattern Recognit. Lett. 29, 1742–1752 (2008)

    Article  Google Scholar 

  7. Bianne-Bernard, A.-L., Menasri, F., Al-Hajj Mohamad, R., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(10), 2066–2080 (2011)

    Google Scholar 

  8. Dreuw, P., Jonas, S., Ney, H.: White-space models for offline Arabic handwriting recognition. In: Proceeding of 19th Int. Conf. Pattern Recognition (ICPR) (2008)

  9. El Abed, H., Märgner, V.: Comparison of different preprocessing and feature extraction methods for offline recognition of handwritten Arabic words. In: Proceedings of the Ninth International Conference on Document Analysis and Recognition (ICDAR’07) (2007)

  10. Elbaati, A., Boubaker, H., Kherallah, M., Alimi, A.M., Ennaji, A., El Abed, H.: Arabic handwriting recognition using restored stroke chronology. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR), pp. 411–415, July (2009)

  11. Gatos, B., Pratikakis, I., Kesidis, A.L., Perantonis, S.J.: Efficient off-line cursive handwriting word recognition. In: Proceedings of the Tenth International Workshop on Frontiers in Handwriting Recognition, Oct. La Baule (2006)

  12. Gonzales, R.C., Woods, R.E.: Digital Image Processing, vol. 2. Addison-Wesley, Reading, MA (2002)

    Google Scholar 

  13. Hamdani, M., El Abed, H., Kherallah, M., Alimi Adel, M.: Combining multiple HMMs using online and offline features for offline Arabic handwriting recognition. In: Proceedings of the 10th International Conference on Document Analysis and Recognition (ICDAR) (2009)

  14. HTK Speech Recognition Toolkit, pp. 108–122. http://htk.eng.cam.ac.uk/

  15. Kessentini, Y., Paquet, T., Ben Hamado, A.M.: Offline handwritten word recognition using multi-stream hidden Markov models. J. Pattern Recognit. Lett. 1(1) (2010)

  16. Khorsheed, M.S.: Offline Arabic character recognition—a review. Pattern Anal. Appl. 5, 31–45 (2002)

    Article  MathSciNet  Google Scholar 

  17. Liu, C., Nakashima, K., Sako, H., Fujisawa, H.: Handwritten digit recognition: benchmarking of state-of-the-art techniques. Pattern Recognit. 36, 2271–2285 (2003)

    Article  MATH  Google Scholar 

  18. Märgner, V., El Abed, H.: ICDAR 2007 Arabic handwriting recognition competition. In: Proceedings 9th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1274–1278 (2007)

  19. Märgner, V., El Abed, H.: ICDAR 2009 Arabic handwriting recognition competition. In: Proceedings of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR), pp. 1383–1387 (2009)

  20. Märgner, V., El Abed, H.: ICDAR 2011 Arabic handwriting recognition competition. In: Proceedings of the 11th Int. Conf. on Document Analysis and Recognition (ICDAR) (2011)

  21. Märgner, V., El Abed, H.: ICFHR 2010 Arabic handwriting recognition competition. In: Proceedings of the 12th International Conference on Frontiers in Handwriting Recognition(ICFHR) (2010)

  22. Märgner, V., Pechwitz, M., El Abed, H.: ICDAR 2005 Arabic handwriting recognition competition. Proc. 8th Int. Conf. Doc. Anal. Recognit. 1, 70–74 (2005)

  23. Pechwitz, M., Maddouri, S.S., Maergner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of the Colloque International Francophone surl’Ècrit et le Document (CIFED ’02), pp. 129–136. Hammamet, Tunisia, October (2002)

  24. Pechwitz, M., Maergner, V.: HMM based approach for handwritten Arabic word recognition using the IFN/ENIT-database. In: Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR’03) (2003)

  25. Rodríguez, J.A., Perronnin, F.: Local gradient histogram features for word spotting in unconstrained handwritten documents. In: Proceeding of International Conference on Frontiers and Handwriting Recognition (ICFHR2008) Montréal, Québec (2008)

  26. Suen, C.Y., Lam, L., Lee, S.-W.: Thinning methodologies—a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 14(9), 879 (1992)

    Google Scholar 

  27. Xiang, D., Yan, H., Chen, X., Cheng, Y.: Offline Arabic handwriting recognition system based on HMM. In: Computer Science and Information Technology ICCSIT, 3rd IEEE International Conference (2010)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hany Ahmed.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Azeem, S.A., Ahmed, H. Effective technique for the recognition of offline Arabic handwritten words using hidden Markov models. IJDAR 16, 399–412 (2013). https://doi.org/10.1007/s10032-013-0201-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-013-0201-8

Keywords

Navigation