Skip to main content

A Multi-stage Approach to Arabic Document Analysis

  • Chapter
Guide to OCR for Arabic Scripts

Abstract

We approach the analysis of electronic documents as a multi-stage process, which we implement via a multi-filter document processing framework that provides (a) flexibility for research prototyping, (b) efficiency for development, and (c) reliability for deployment. In the context of this framework, we present our multi-stage solutions to multi-engine Arabic OCR (MEMOE) and Arabic handwriting recognition (AHWR). We also describe our adaptive pre-OCR document image cleanup system called ImageRefiner. Experimental results are reported for all mentioned systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. AbdulKader, A.: A two tier Arabic handwriting recognition based on conditional joining rules. In: Proceedings of the Summit on Arabic and Chinese Handwriting, pp. 121–128 (2006)

    Google Scholar 

  2. Al-Ani, A., Deriche, M.: A new technique for combining multiple classifiers using the Dempster–Shafer theory of evidence. J. Artif. Intell. Res. 17, 333–361 (2002)

    MathSciNet  MATH  Google Scholar 

  3. Alma’adeed, S., Higgins, C., Elliman, D.: Off-line recognition of handwritten Arabic words using multiple Hidden Markov Models. In: Twenty-third SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, vol. 17, pp. 75–79 (2004)

    Google Scholar 

  4. Borovikov, E., Zavorin, I., Turner, M.: A filter based post-OCR accuracy boost system. In: Proceedings of the 1st ACM Workshop on Hardcopy Document Processing (2004)

    Google Scholar 

  5. Cannon, M., Hochberg, J., Kelly, P.: Quality assessment and restoration of typewritten document images. Int. J. Doc. Anal. Recognit. 2(2–3), 80–89 (1999)

    Google Scholar 

  6. Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. Wiley-Interscience, New York (1973)

    MATH  Google Scholar 

  7. El Abed, H., Märgner, V.: Comparison of combination methods of Arabic handwritten word recognizers. In: 5th International Multi-Conference on Systems, Signals and Devices, pp. 1–6 (2008)

    Chapter  Google Scholar 

  8. Essoukri, N., Amara, B., Bouslama, F.: Classification of Arabic script using multiple sources of information: State of the art and perspectives. Int. J. Doc. Anal. Recognit. 5(4), 195–212 (2003)

    Article  Google Scholar 

  9. Farah, N., Souici, L., Sellami, M.: Classifiers combination and syntax analysis for Arabic literal amount recognition. Eng. Appl. Artif. Intell. 19(1), 29–39 (2006)

    Article  Google Scholar 

  10. Govindaraju, V.: Paradigms in handwriting recognition. In: Proceedings of the Summit on Arabic and Chinese Handwriting, pp. 171–175 (2006)

    Google Scholar 

  11. Ho, T.K.: Multiple classifier combination: Lessons and next steps. In: Bunke, H., Kandel, A. (eds.) Hybrid Methods in Pattern Recognition, pp. 171–198. World Scientific, Singapore (2002)

    Chapter  Google Scholar 

  12. http://www.ifnenit.com

  13. Jaeger, S.: Informational classifier fusion. In: Proceedings of the International Conference on Pattern Recognition, vol. I, pp. 216–219 (2004)

    Google Scholar 

  14. Klein, S.T., Kopel, M.: A voting system for automatic OCR correction. In: Proceedings of the SIGIR 2002 Workshop on Information Retrieval and OCR: From Converting Content to Grasping Meaning, University of Tampere, August 2002

    Google Scholar 

  15. Kolak, O., Resnik, P., Byrne, W.: A generative probabilistic OCR model for NLP applications. In: Proceedings of HLT-NAACL, May 2003

    Google Scholar 

  16. Kuncheva, L.: Combining Pattern Classifiers. Methods and Algorithms. Wiley, New York (2004)

    Book  MATH  Google Scholar 

  17. Lee, D.-S.: A theory of classifier combination: the neural network approach. Ph.D. thesis, State University of New York at Buffalo (1995)

    Google Scholar 

  18. Lin, X.: Reliable OCR solution for digital content re-mastering. In: Proceedings of SPIE Conference on Document Recognition and Retrieval IX (2002)

    Google Scholar 

  19. Lin, X.: DRR research beyond COTS OCR software: A survey. Technical Report HPL-2004-167, Imaging Systems Laboratory, HP Laboratories Palo Alto, CA (2004)

    Google Scholar 

  20. Lorigo, L.M.: Arabic handwriting recognition and application to ancient documents. In: Proceedings of the Summit on Arabic and Chinese Handwriting, pp. 111–120 (2006)

    Google Scholar 

  21. Lorigo, L.M., Govindaraju, V.: Off-line Arabic handwriting recognition: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 28(5), 712–724 (2006)

    Article  Google Scholar 

  22. Märgner, V., Pechwitz, M., El-Abed, H.: ICDAR 2005 Arabic handwriting recognition competition. In: Proceedings of the International Conference on Document Analysis and Recognition, Seoul, Korea, pp. 70–74 (2005)

    Google Scholar 

  23. McNamara, J., Casey, D., Smith, R., Bradburn, D.: A classifier for evaluating the effects of image processing on character recognition. In: Image Algebra and Morphological Image Processing, pp. 109–120 (1993)

    Google Scholar 

  24. Nartker, T.A., Rice, S.V., Lumos, S.E.: Software tools and test data for research and testing of page-reading OCR systems. In: SPIE Conference Document Recognition and Retrieval, San Jose, CA, January 2005, vol. 5676, pp. 37–47 (2005)

    Google Scholar 

  25. Niblack, W.: An Introduction to Image Processing. Prentice-Hall, Englewood Cliffs (1996)

    Google Scholar 

  26. Nomoto, T.: Predictive models of performance in multi-engine machine translation. In: MT Summit IX, pp. 269–276 (2003)

    Google Scholar 

  27. Otsu, N.: A threshold selection method for gray level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979)

    Article  Google Scholar 

  28. Parker, J.R.: Algorithms for Image Processing and Computer Vision. Wiley, New York (1996)

    Google Scholar 

  29. Pechwitz, M., Snoussi Maddouri, S., Märgner, V., Ellouze, N., Amiri, H.: IFN/ENIT-database of handwritten Arabic words. In: Proceedings of CIFED, pp. 129–136 (2002)

    Google Scholar 

  30. Sezgin, M., Sankur, B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Electron. Imaging 13, 146–165 (2004)

    Article  Google Scholar 

  31. Snoussi Maddouri, S., Amiri, H., Belaïd, A., Choisy, Ch.: Combination of local and global vision modeling for Arabic handwritten words recognition. In: 8th IWFHR, pp. 128–135 (2002)

    Google Scholar 

  32. Summers, K.: Document image improvement for OCR as a classification problem. In: Document Recognition and Retrieval X, vol. 5010 (2003)

    Google Scholar 

  33. Tong, X., Evans, D.A.: A statistical approach to automatic OCR error correction in context. In: 4th Workshop on Very Large Corpora, Copenhagen, Denmark, August 1996, pp. 88–100 (1996)

    Google Scholar 

  34. Yang, Y., Summers, K., Turner, M.: A text image enhancement system based on segmentation and classification methods. In: Proceedings of the 1st ACM Workshop on Hardcopy Document Processing (2004)

    Google Scholar 

  35. Zavorin, I., Borovikov, E.: Data collection and annotation for Arabic document analysis. In: Märgner, V., El Abed, H. (eds.) Guide to OCR for Arabic Scripts. Springer, Berlin (2012)

    Google Scholar 

  36. Zavorin, I., Borovikov, E., Turner, M.: Initial results in offline Arabic handwriting recognition using large-scale geometric features. In: Proceedings of the Symposium on Document Image Understanding Technology, December 2005

    Google Scholar 

  37. Zavorin, I., Borovikov, E., Turner, M., Hernandez, L.: Adaptive pre-OCR cleanup of grayscale document images. In: Proceedings of SPIE/IS&T Electronic Imaging Conference on Document Recognition & Retrieval XIII. SPIE, Bellingham (2006)

    Google Scholar 

  38. Zavorin, I., Borovikov, E., Borovikov, A., Hernandez, L., Summers, K., Turner, M.: A multi-evidence, multi-engine OCR system. In: Proceedings of SPIE/IS&T Electronic Imaging Conference on Document Recognition & Retrieval XIV (2007)

    Google Scholar 

  39. Zavorin, I., Borovikov, E., Davis, E., Borovikov, A., Summers, K.: Combining different classification approaches to improve off-line Arabic handwritten word recognition. In: Proceedings of SPIE/IS&T Electronic Imaging Conference on Document Recognition & Retrieval XV (2008)

    Google Scholar 

  40. Zuo, Y., Zong, C.: Multi-engine based Chinese-to-English translation system. In: The INTERSPEECH/ICSLP-2004 Satellite Workshop: International Workshop on Spoken Language Translation (2004)

    Google Scholar 

Download references

Acknowledgements

We would like to express our thanks to David Doermann (UMCP) for providing a nice GUI tool for Arabic handwriting ground truth preparation, Sargur N. Srihari (CEDAR) for providing an efficient solution for document image segmentation, Volker Märgner (TU Braunschweig) for providing access to the IfN database and for evaluating our Arabic handwriting recognizer, Leila Saidi (CACI) for helping with Arabic documents ground truth creation, Ericson Davis (CACI) for providing technical solutions to document image processing, Anna Borovikov (CACI) for coordinating Arabic corpora creation and for mathematical advice, Yaguang Yang (CACI) for providing custom document image transform and for software coding, Kristen Summers (CACI) for providing technical expertise and leadership in the area of document understanding in general and in Arabic OCR in particular, Mark Turner (CACI) for administrative project oversight, and Luis Hernandez (ARL) for access to various Arabic datasets and GOTS software.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Eugene Borovikov .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag London

About this chapter

Cite this chapter

Borovikov, E., Zavorin, I. (2012). A Multi-stage Approach to Arabic Document Analysis. In: Märgner, V., El Abed, H. (eds) Guide to OCR for Arabic Scripts. Springer, London. https://doi.org/10.1007/978-1-4471-4072-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-4072-6_3

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-4471-4071-9

  • Online ISBN: 978-1-4471-4072-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics