Skip to main content

Document Structure and Layout Analysis

  • Chapter

Part of the Advances in Pattern Recognition book series (ACVPR)

Keywords

  • Voronoi Diagram
  • Document Image
  • Text Line
  • Document Structure
  • Grammar Rule

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

This is a preview of subscription content, access via your institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (Canada)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (Canada)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   169.99
Price excludes VAT (Canada)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nagy, G. and Seth, S.C. (1984). Hierarchical Representation of Optically Scanned Documents. Proceedings of the 7th International Conference on Pat-tern Recognition, Montreal, 1984, pp. 347-349.

    Google Scholar 

  2. Ulichney, R. (1987). Digital Halftoning. Cambridge, MA: The MIT Press.

    Google Scholar 

  3. Haralick, R.M. (1994). Document image understanding: geometric and log-ical layout. Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, Seattle, WA, pp. 385-390.

    Google Scholar 

  4. Jain, A.K. and Yu, B. (1998). Document representation and its application to page decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20, pp. 294-308.

    CrossRef  Google Scholar 

  5. Nagy, G. (2000). Twenty years of document image analysis in PAMI. IEEE Transactions on Pattern Analysis and Machine Intelligence,22, pp. 38-62.

    CrossRef  Google Scholar 

  6. Bagdanov, A.D. and Worring, M. (2003). First order Gaussian Graphs for efficient structure classification. Pattern Recognition, 36, pp. 1311-1324.

    CrossRef  MATH  Google Scholar 

  7. Etemad, K., Doermann, D.S., and Chellappa, R. (1997). Multiscale docu-ment page segmentation using soft decision integration. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19. pp. 92-96.

    CrossRef  Google Scholar 

  8. Jain, A.K. and Bhattacharjee, S. (1992). Text segmentation using Gabor filters for automatic document processing. Machine Vision and Applications, 5, pp. 169-184.

    CrossRef  Google Scholar 

  9. Jain, A.K. and Zhong, Y. (1996). Page segmentation using texture analysis. Pattern Recognition, 29, pp. 743-770.

    CrossRef  Google Scholar 

  10. Fisher, J.L. (1991). Logical structure descriptions of segmented document images. Proceedings of International Conference on Document Analysis and Recognition, Saint-Malo, France, pp. 302-310.

    Google Scholar 

  11. Jain, A.K., Namboodiri, A.M., and Subrahmonia, J. (2001). Structure in on-line documents. Proceedings of International Conference on Document Analysis and Recognition, Seattle, WA, pp. 844-848.

    Google Scholar 

  12. Nagy, G., Seth, S., and Viswanathan, M. (1992). A prototype document image-analysis system for technical journals. Computer, 25, pp. 10-22.

    CrossRef  Google Scholar 

  13. Kise, K., Sato, A., and Iwata, M. (1998). Segmentation of page images using the area Voronoi diagram. Computer Vision and Image Understanding, 70, pp. 370-382.

    CrossRef  Google Scholar 

  14. O’Gorman, L. (1993). The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15, pp. 1162-1173.

    CrossRef  Google Scholar 

  15. Kopec, G.E. and Chou, P.A. (1994). Document image decoding using Markov source models. IEEE Transactions on Pattern Analysis and Machine Intel-ligence, 16, pp. 602-617.

    CrossRef  Google Scholar 

  16. Baird, H.S., Jones, S.E., and Fortune, S.J. (1990). Image segmentation by shape-directed covers. Proceedings of International Conference on Pattern Recognition, Atlantic City, NJ, pp. 820-825.

    Google Scholar 

  17. Baird, H.S. (1994). Background Structure in Document Images. Document Image Analysis. Singapore: Word Scientific, pp. 17-34.

    Google Scholar 

  18. Breuel, T.M. (2002). Two geometric algorithms for layout analysis. Proceed-ings of the Fifth International Workshop on Document Analysis Systems, Princeton, NY, LNCS 2423, pp. 188-199.

    Google Scholar 

  19. Pavlidis, T. and Zhou, J. (1991). Page segmentation by white streams. Pro-ceedings of International Conference on Document Analysis and Recognition, Saint-Malo, France, pp. 945-953.

    Google Scholar 

  20. Wahl, F., Wong, K., and Casey, R. (1982). Block segmentation and text extraction in mixed text/image documents. Graphical Models and Image Processing, 20, pp. 375-390.

    Google Scholar 

  21. Wu, V., Manmatha, R., and Riseman, E.M. (1997). Finding text in images. ACM DL, pp. 3-12.

    Google Scholar 

  22. Pavlidis, T. and Zhou, J. Page segmentation and classifcation. Graphical Models and Image Processing, 54, pp. 484-496.

    Google Scholar 

  23. Yamashita, A., Amano, T., Takahashi, I., and Toyokawa, K. (1991). A model-based layout understanding method for the document recognition system. Proceedings of the International Conference on Document Analysis and Recognition, Saint-Malo, France, pp. 130-138.

    Google Scholar 

  24. Kreich, J., Luhn, A., and Maderlechner, G. (1991). An experimental envi-ronment for model-based document analysis. Proceedings of the International Conference on Document Analysis and Recognition, Saint-Malo, France, pp. 50-58.

    Google Scholar 

  25. Niyogi, D. and Srihari, S.N. (1995). Knowledge-based derivation of document logical structure. Proceedings of the International Conference on Document Analysis and Recognition, Montreal, Canada, pp. 472-475.

    Google Scholar 

  26. Mao, S. and Kanungo, T. (2001). Empirical performance evaluation method-ology and its application to page segmentation algorithms. IEEE Transac-tions on Pattern Analysis and Machine Intelligence, 23, pp. 242-256.

    CrossRef  Google Scholar 

  27. Artières, T. (2003). Poorly structured handwritten documents segmentation using continuous probabilistic feature grammars. Workshop on Document Layout Interpretation and its Applications (DLIA2003).

    Google Scholar 

  28. Namboodiri, A.M. and Jain, A.K. (2004). Robust segmentation of uncon-strained on-line handwritten documents. Proceedings of the Fourth Indian Conference on Computer Vision, Graphics and Image Processing, Calcutta, India, pp. 165-170.

    Google Scholar 

  29. NIST. NIST Scientific and Technical Databases, http://www.nist.gov/srd/.

  30. LAMP. Documents and Standards Information, http://documents.cfar.umd.edu/resources/database/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and Permissions

Copyright information

© 2007 Springer-Verlag London Limited

About this chapter

Cite this chapter

Namboodiri, A.M., Jain, A.K. (2007). Document Structure and Layout Analysis. In: Chaudhuri, B.B. (eds) Digital Document Processing. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84628-726-8_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-84628-726-8_2

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-84628-501-1

  • Online ISBN: 978-1-84628-726-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics