Logical Labeling of Document Images Using Layout Graph Matching with Adaptive Learning

  • Jian Liang
  • David Doermann
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2423)


Logical structure analysis of document images is an important problem in document image understanding. In this paper, we propose a graph matching approach to label logical components on a document page. Our system is able to learn a model for a document class, use this model to label document images through graph matching, and adaptively improve the model with error feed back. We tested our method on journal/proceeding article title pages. The experimental results show promising accuracy, and confirm the ability of adaptive learning.


Document Image Optical Character Recognition Graph Match Font Size Average Error Rate 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    O. Altamura, F. Esposito, and D. Malerba. “Transforming paper documents into xml format with WISDOM++”. Journal of Document Analysis and Recognition, 2000, 3(2):175–198.Google Scholar
  2. 2.
    T. A. Bayer and H. Walischewski. “Experiments on extracting structural information from paper documents using syntactic pattern analysis”. In Proceedings of The Third International Conference on Document Analysis And Recognition, 1995, pp. 476–479.Google Scholar
  3. 3.
    A. Dengel, R. Bleisinger, F. Fein, R. Hoch, F. Hones, and M. Malburg. “OfficeMAID-a system for office mail analysis, interpretation and delivery”. In International Workshop on Document Analysis Systems, 1994, pp. 253–276.Google Scholar
  4. 4.
    S. Gold and A. Rangarajan. “A graduated assignment algorithm for graph matching”. IEEE Trans. Pattern Anal. Machine Intell., 1996, 18(4):377–388.CrossRefGoogle Scholar
  5. 5.
    T. Hu and R. Ingold. “A mixed approach toward an efficient logical structure recognition from document images”. Electronic Publishing, 1993, 6(4):457–468.Google Scholar
  6. 6.
    Y. Ishitani. “Model-based information extraction method tolerant of OCR errors for document images”. In Proceedings of The Sixth International Conference on Document Analysis And Recognition, 2001, pp. 908–915.Google Scholar
  7. 7.
    M. Krishnamoorthy, G. Nagy, S. Seth, and M. Viswananthan. “Syntactic segmentation and labeling of digitized pp. from technical journals”. IEEE Transactions On Pattern Analysis And Machine Intelligence, 1993, 15(7):737–747.CrossRefGoogle Scholar
  8. 8.
    G. I. Palermo and Y. A. Dimitriadis. “Structured document labeling and rule extraction using a new recurrent fuzzy-neural system”. In Proceedings of The Fifth International Conference on Document Analysis And Recognition, 1999, pp. 181–184.Google Scholar
  9. 9.
    H. Walischewske. “Learning regions of interest in postal automation”. In Proceedings of The Fifth International Conference on Document Analysis And Recognition, 1999, pp. 317–340.Google Scholar
  10. 10.
    J. Liang and D. Doermann. “Page classification through logical labeling”. (To be published) In Proceedings of The International Conference of Pattern Recognition, 2002.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Jian Liang
    • 1
  • David Doermann
    • 1
  1. 1.Institute for Advanced Computer StudiesUniversity of Maryland at College ParkUSA

Personalised recommendations