Advertisement

A method of table detection in metafiles

  • A. O. ShigarovEmail author
  • I. V. Bychkov
  • G. M. Ruzhnikov
  • A. E. Khmel’nov
Application Problems

Abstract

A method is proposed for the detection of statistical tables that use metafiles as input data; the latter fact allows one to apply this method to documents of different formats. In this method, the table detection process is viewed as a bottom-up segmentation of a document page, i.e., segmentation from simple elements of a page to more complicated ones. The experimental evaluation of the method shows that it is efficient as applied to a wide class of statistical tables.

Key words

Document analysis and recognition table extraction from documents table detection 

References

  1. 1.
    A. C. Costa Silva, A. M. Jorge, and L. Torgo, “Design of an End-to-End Method to Extract Information from Tables,” Int. J. Doc. Anal. Recognit. 8(2), 144–171 (2006).CrossRefGoogle Scholar
  2. 2.
    D. W. Embley, M. Hurst, D. Lopresti, and G. Nagy, “Table-Processing Paradigms: A Research Survey,” Int. J. Doc. Anal. Recognit. 8(2), 66–86 (2006).CrossRefGoogle Scholar
  3. 3.
    D. Lopresti and G. Nagy, “A Tabular Survey of Automated Table Processing,” Lect. Notes Comput. Sci. 1941 (Springer, 2000), pp. 93–120.Google Scholar
  4. 4.
    R. Zanibbi, D. Blostein, and J. R. Cordy, “A Survey of Table Recognition: Models, Observations, Transformations, and Inferences,” Int. J. Doc. Anal. Recognit. 7(1), 1–16 (2004).Google Scholar
  5. 5.
    PostScript Language Reference, 3rd ed. (Addison-Wesley, 1999).Google Scholar
  6. 6.
    PDF reference. Adobe, 5th ed.Google Scholar
  7. 7.
    Microsoft Developer Network, Available from http://msdn.microsoft.com.
  8. 8.
    T. Hassan and R. Baumgartner, “Table Recognition and Understanding from PDF Files,” in Proc. of the 9th Int. Conf. on Document Analysis and Recognition (ICDAR 2007), Morretes, Sept. 23–26 (IEEE Computer Society, 2007), pp. 1143–1147.Google Scholar
  9. 9.
    S. Mandal, S. P. Chowghury, A. K. Das, and B. A. Chanda, “A Simple and Effective Table Detection System from Document Images,” Int. J. Doc. Anal. Recognit. 8(2), 172–182 (2006).CrossRefGoogle Scholar
  10. 10.
    J. Hu, R. Kashi, D. Lopresti, and G. Wilfong, “Medium-Independent Table Detection,” in Document Recognition Retrieval VII (IS&T/SPIE Electronic Imaging, San Jose, 2000), pp. 291–302.Google Scholar

Copyright information

© Pleiades Publishing, Ltd. 2009

Authors and Affiliations

  • A. O. Shigarov
    • 1
    Email author
  • I. V. Bychkov
    • 1
  • G. M. Ruzhnikov
    • 1
  • A. E. Khmel’nov
    • 1
  1. 1.Institute of System Dynamics and Control Theory, Siberian BranchRussian Academy of SciencesIrkutsk, a/ya 292Russia

Personalised recommendations