Advertisement

Physical Layout Analysis of Complex Structured Arabic Documents Using Artificial Neural Nets

  • Karim Hadjar
  • Rolf Ingold
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3163)

Abstract

This paper describes PLANET, a recognition method to be applied on Arabic documents with complex structures allowing incremental learning in an interactive environment. The classification is driven by artificial neural nets each one being specialized in a document model. The first prototype of PLANET has been tested on five different phases of newspaper image analysis: thread recognition, frame recognition, image text separation, text line recognition and line merging into blocks. The learning capability has been tested on line merging into blocks. Some promising experimental results are reported.

Keywords

Recognition Rate Text Line Incremental Learning Average Recognition Rate Layout Analysis 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. 1.
    Antonacopoulos, A., Gatos, B., Karatzas, D.: ICDAR 2003 Page Segmentation Contest. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland, August 2003, pp. 688–692 (2003)Google Scholar
  2. 2.
    Gatos, B., Mantzaris, S.L., Antonacopoulos, A.: First international newspaper segmentation contest. In: Proceedings of the 6th International Conference on Document Analysis and Recognition, Seattle, USA, September 2001, pp. 1190–1194 (2001)Google Scholar
  3. 3.
    Hadjar, K., Hitz, O., Ingold, R.: Newspaper Page Decomposition using a Split and Merge Approach. In: Proceedings of the 6th International Conference on Document Analysis and Recognition, Seattle, USA, September 2001, pp. 1186–1189 (2001)Google Scholar
  4. 4.
    Hadjar, K., Ingold, R.: Arabic Newspaper Page Segmentation. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, Edinburgh, Scotland, August 2003, pp. 895–899 (2003)Google Scholar
  5. 5.
    Hadjar, K., Hitz, O., Robadey, L., Ingold, R.: Configuration REecognition Model for Complex Reverse Engineering Methods: 2(CREM). In: Lopresti, D.P., Hu, J., Kashi, R.S. (eds.) DAS 2002. LNCS, vol. 2423, pp. 469–479. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  6. 6.
    Haralick, R.M.: Document image understanding: Geometric and logical layout. In: Proceedings Internet. Conf. On Computer Vision and Pattern Recognition, pp. 385–390 (1994)Google Scholar
  7. 7.
    Hitz, O., Robadey, L., Ingold, R.: An architecture for editing documents recognition results using xml technology. In: Proceedings of the 4th International Workshop on Document Analysis Systems, DAS 2000, Rio de Janeiro, Brazil, December 2000, pp. 385–396 (2000)Google Scholar
  8. 8.
    Hu, J., Kashi, R., Lopresti, D., Nagy, G., Wilfong, G.: Why table ground truthing is hard. In: Proceedings of the 6th International Conference on Document Analysis and Recognition, Seattle, USA, September 2001, pp. 129–133 (2001)Google Scholar
  9. 9.
  10. 10.
    Nagy, G.: Twenty Years of Document Image Analysis in PAMI. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(1), 38–62 (2000)CrossRefGoogle Scholar
  11. 11.
    Wasserman, P.D.: Neural Computing: Theory and Practice. Van Nostrand Reinhold, New York (1989)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Karim Hadjar
    • 1
  • Rolf Ingold
    • 1
  1. 1.DIUFUniversity of FribourgFribourgSwitzerland

Personalised recommendations