Skip to main content

Arabic Handwriting Recognition Using Bernoulli HMMs

  • Chapter
Guide to OCR for Arabic Scripts

Abstract

Hidden Markov models (HMMs) are now widely used for off-line handwriting recognition in many languages and, in particular, in Arabic. As in speech recognition, they are usually built from shared, embedded HMMs at the symbol level, in which state-conditional probability density functions are modeled with Gaussian mixtures. In contrast to speech recognition, however, it is unclear which kinds of features should be used and, indeed, very different feature sets are in use today. Among them, we have recently proposed to simply use columns of raw, binary image pixels, which are directly fed into embedded Bernoulli (mixture) HMMs, that is, embedded HMMs in which the emission probabilities are modeled with Bernoulli mixtures. The idea is to bypass feature extraction and ensure that no discriminative information is filtered out during feature extraction, which in some sense is integrated into the recognition model. In this chapter, we review this idea along with some extensions that are currently providing state-of-the-art results on Arabic handwritten word recognition.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dreuw, P., Heigold, G., Ney, H.: Confidence-based discriminative training for model adaptation in offline Arabic handwriting recognition. In: ICDAR, pp. 596–600 (2009)

    Google Scholar 

  2. Giménez, A., Juan, A.: Embedded Bernoulli mixture HMMs for handwritten word recognition. In: Proc. of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR 2009), Barcelona, Spain, pp. 896–900 (2009)

    Chapter  Google Scholar 

  3. Giménez, A., Khoury, I., Juan, A.: Windowed Bernoulli mixture HMMs for Arabic handwritten word recognition. In: Proc. of the Int. Conf. on Frontiers in Handwriting Recognition (ICFHR 2010), Kolkata, India (2010)

    Google Scholar 

  4. Goodman, J.T.: A bit of progress in language modeling. Tech. rep. (2001)

    Google Scholar 

  5. Khoury, I., Giménez, A., Juan, A.: Arabic handwritten word recognition using Bernoulli mixture HMMs. In: Proc. of the 3rd Palestinian Int. Conf. on Computer and Information Technology (PICCIT 2010), Hebron, Palestine (2010)

    Google Scholar 

  6. Lorigo, L., Govindaraju, V.: Offline Arabic handwriting recognition: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 712–724 (2006). doi:10.1109/TPAMI.2006.102

    Article  Google Scholar 

  7. Märgner, V., Abed, H.E.: ICDAR 2007—Arabic handwriting recognition competition. In: Proc. of the 9th Int. Conf. on Document Analysis and Recognition (ICDAR 2007), Curitiba, Brazil, vol. 2, pp. 1274–1278 (2007)

    Google Scholar 

  8. Märgner, V., Abed, H.E.: ICDAR 2009 Arabic handwriting recognition competition. In: Proc. of the 10th Int. Conf. on Document Analysis and Recognition (ICDAR 2009), Barcelona, Spain, pp. 1383–1387 (2009)

    Chapter  Google Scholar 

  9. Märgner, V., Pechwitz, M., Abed, H.E.: ICDAR 2005 Arabic handwriting recognition competition. In: Proc. of the 8th Int. Conf. on Document Analysis and Recognition (ICDAR 2005), Seoul, Korea, vol. 1, pp. 70–74 (2005)

    Chapter  Google Scholar 

  10. Pechwitz, M., Maddouri, S.S., Märgner, V., Ellouze, N., Amiri, H.: IFN/ENIT—database of handwritten Arabic words. In: 7th Colloque International Francophone sur l’Ecrit et le Document, CIFED, Hammamet, Tunis, pp. 21–23 (2002)

    Google Scholar 

  11. Rabiner, L., Juang, B.H.: Fundamentals of Speech Recognition. Prentice Hall, Englewood Cliffs (1993)

    Google Scholar 

  12. Young, S., et al.: The HTK Book. Cambridge University Engineering Department, Cambridge (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ihab Alkhoury .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag London

About this chapter

Cite this chapter

Alkhoury, I., Giménez, A., Juan, A. (2012). Arabic Handwriting Recognition Using Bernoulli HMMs. In: Märgner, V., El Abed, H. (eds) Guide to OCR for Arabic Scripts. Springer, London. https://doi.org/10.1007/978-1-4471-4072-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4072-6_10

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4071-9

  • Online ISBN: 978-1-4471-4072-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics