Advertisement

Semantics Driven Table Understanding in Born-Digital Documents

  • Jacek Siciarek
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 233)

Summary

This paper presents a new approach to table understanding, suitable for born-digital PDF documents. Advance beyond the current state of the art in table understanding is provided by the proposed reverse MVC method, which takes advantage of only partial logic structure loss (degradation) in born-digital PDF documents, as opposed to unrecoverable loss (deterioration) taking place in scan based PDF documents.

Keywords

Tabular Data Portable Document Format Knowledge Layer View Component Tabular Structure 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Adobe Systems Inc., Document management – Portable document format – Part 1: PDF 1.7 (2008)Google Scholar
  2. 2.
    Elsevier. Grand Paper Challenge Webpage, http://www.executablepapers.com
  3. 3.
    Gbel, M., Hassan, T., Oro, E., Orsi, G.: A Methodology for Evaluating Algorithms for Table Understanding in PDF Documents (2013)Google Scholar
  4. 4.
    Hu, J., Nagy, G., Kashi, R., Wilfong, G., Lopresti, D.: Why table ground-truthing is hard (2011)Google Scholar
  5. 5.
    Hurst, M.: A constraint-based approach to table structure derivation (2003)Google Scholar
  6. 6.
    Siciarek, J., Wiszniewski, B.: IODA - an Interactive Open Document Architecture. Procedia Computer Science 4, 668–677 (2011)CrossRefGoogle Scholar
  7. 7.
    Veit, M., Herrmann, S.: Model-View-Controller and Object Teams: A Perfect Match of Paradigms. In: AOSD 2003, pp. 140–149. ACM Press (2003)Google Scholar
  8. 8.
    Wang, X.: Tabular Abstraction, Editing and Formatting. PhD thesis, University of Waterloo (1996)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Faculty of Electronics, Telecommunications and InformaticsGdansk University of TechnologyGdanskPoland

Personalised recommendations