Advertisement

Pattern Recognition and Image Analysis

, Volume 21, Issue 2, pp 324–327 | Cite as

Simple algorithm page layout analysis

  • A. O. Shigarov
  • R. K. Fedorov
Representation, Processing, Analysis and Understanding of Images

Abstract

An algorithm for page layout analysis (segmentation) is suggested in the paper. It allows whitespace between text blocks to be detected on a document page. The algorithm could be used in document analysis and recognition problems. In particular, it can be used for column recognition in multicolumn text and tables. The suggested algorithm is quite simple for implementation.

Keywords

Voronoi Diagram Docu Ment Image Suggested Algorithm Text Block Document Page 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    H. S. Baird, S. E. Jones, and S. J. Fortune, “Image Segmentation by Shape-Directed Covers,” in Proc. Int. Conf. on Pattern Recognition (Atlantic City, 1990), Vol. 1, pp. 820–825.CrossRefGoogle Scholar
  2. 2.
    T. M. Breuel, “Two Geometric Algorithms for Layout Analysis,” in Proc. 5th Int. Workshop on Document Analysis Systems (Nara, 2008), Vol. 2423, pp. 188–199.CrossRefGoogle Scholar
  3. 3.
    R. Cattoni, T. Coianiz, S. Messelodi, and C. M. Modena, “Geometric Layout Analysis Techniques for Document Image Understanding: a Review,” Tech. Rep. IRST (Trento, 1998).Google Scholar
  4. 4.
    J. Chaudhuri, S. C. Nandy, and S. Das, “Largest Empty Rectangle among a Point Set,” J. Algorithms 46(1), 54–78 (2003).zbMATHCrossRefMathSciNetGoogle Scholar
  5. 5.
    K. Kise, A. Sato, and M. Iwata, “Segmentation of Page Images Using the Area Voronoi Diagram,” Comp. Vision Image Understand. 70, No. 3, 370–382 (1998).CrossRefGoogle Scholar
  6. 6.
    P. Lyman and H. R. Varian, “How much Information?,” Tech. Rep. (2003), Available from: http://www.sims.berkeley.edu/how-much-info-2003
  7. 7.
    Machine Learning in Document Analysis and Recognition, Ed. by S. Marinai and H. Fujisawa (2008), Vol. 90.Google Scholar
  8. 8.
    M. Orlowski, “A New Algorithm for the Largest Empty Rectangle Problem,” Algorithm. 5, Nos. 1–4, 65–73 (1990).zbMATHCrossRefMathSciNetGoogle Scholar
  9. 9.
    A. O. Shigarov, I. V. Bychkov, G. M. Ruzhnikov, and A. E. Khmel’nov, “A Method for Table Detection in Metafiles,” Pattern Recogn. Image Anal. 19, No. 4, 693–697 (2009).CrossRefGoogle Scholar

Copyright information

© Pleiades Publishing, Ltd. 2011

Authors and Affiliations

  • A. O. Shigarov
    • 1
  • R. K. Fedorov
    • 1
  1. 1.Institute for System Dynamics and Control Theory, Siberian BranchRussian Academy of SciencesIrkutskRussia

Personalised recommendations