Retrieval by Layout Similarity of Documents Represented with MXY Trees

  • Francesca Cesarini
  • Simone Marinai
  • Giovanni Soda
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2423)


Document image retrievalcan be carried out either processing the converted text (obtained with OCR) or by measuring the layout similarity of images. We describe a system for document image retrieval based on layout similarity. The layout is described by means of a treebased representation: the Modified X-Y tree. Each page in the database is represented by a feature vector containing both global features of the page and a vectorialrepresen tation of its layout that is derived from the corresponding MXY tree. Occurrences of tree patterns are handled similarly to index terms in Information Retrieval in order to compute the similarity. When retrieving relevant documents, the images in the collection are sorted on the basis of a measure that is the combination of two values describing the similarity of global features and of the occurrences of tree patterns. The system is applied to the retrieval of documents belonging to digital libraries. Tests of the system are made on a data-set of more than 600 pages belonging to a journal of the 19th Century, and to a collection of monographs printed in the same Century and containing more than 600 pages.


Feature Vector Image Retrieval Digital Library Global Feature Document Image 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. Addison Wesley, 1999.Google Scholar
  2. 2.
    A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain, “Content-based image retrievalat the end of the early years,” in IEEE Trans. PAMI, vol. 22, pp. 1349–1380, December 2000.Google Scholar
  3. 3.
    D. Doermann, “The indexing and retrievalof document images: A survey,” Computer Vision and Image Understanding, vol. 70, pp. 287–298, June 1998.Google Scholar
  4. 4.
    M. Mitra and B. B. Chaudhuri, “Information retirevalfrom documents: a survey,” Information Retrieval, vol. 2, pp. 141–163, 2000.CrossRefGoogle Scholar
  5. 5.
    D. Doermann, J. Sauvola, H. Kauniskangas, C. Shin, M. Pietikainen, and A. Rosenfeld, “The development of a general framework for intelligent document image retrieval,” in Document Analysis Systems, pp. 605–632, 1996.Google Scholar
  6. 6.
    C. Shin and D. Doermann, “Classification of document page images based on visual similarity of layout structures,” in SPIE 2000, pp. 182–190, 2000.Google Scholar
  7. 7.
    J. Hu, R. Kashi, and G. Wilfong, “Comparision and classi.cation of documents based on layout similarity,” Information Retrieval, vol. 2, pp. 227–243, May 2000.Google Scholar
  8. 8.
    J. F. Cullen, J. J. Hull, and P. E. Hart, “Document image database retrieval and browsing using texture analysis,” in Proceedings of the 4th International Conference on Document Analysis and Recognition, pp. 718–721, 1997.Google Scholar
  9. 9.
    F. Cesarini, M. Lastri, S. Marinai, and G. Soda, “Encoding of modified X-Y trees for document classification,” in Proceedings of the Sixth International Conference on Document Analysis and Recognition, pp. 1131–1136, 2001.Google Scholar
  10. 10.
    F. Cesarini, M. Gori, S. Marinai, and G. Soda, “Structured document segmentation and representation by the modified X-Y tree,” in Proc. of ICDAR’ 99, pp. 563–566, 1999.Google Scholar
  11. 11.
    W. Y. Arms, Digital Libraries. MIT Press, 2000.Google Scholar
  12. 12.
    G. Baklarz and B. Wong, DB2 Universal Database V7.1. Prentice Hall, 2000.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Francesca Cesarini
    • 1
  • Simone Marinai
    • 1
  • Giovanni Soda
    • 1
  1. 1.Dipartimento di Sistemi e InformaticaUniversitá di FirenzeFirenzeItaly

Personalised recommendations