Advertisement

Document Analysis and Recognition

, Volume 7, Issue 1, pp 1–16 | Cite as

A survey of table recognition

Models, observations, transformations, and inferences
  • Richard Zanibbi
  • Dorothea Blostein
  • James R. Cordy
Article

Abstract.

Table characteristics vary widely. Consequently, a great variety of computational approaches have been applied to table recognition. In this survey, the table recognition literature is presented as an interaction of table models, observations, transformations, and inferences. A table model defines the physical and logical structure of tables; the model is used to detect tables and to analyze and decompose the detected tables. Observations perform feature measurements and data lookup, transformations alter or restructure data, and inferences generate and test hypotheses. This presentation clarifies both the decisions made by a table recognizer and the assumptions and inferencing techniques that underlie these decisions.

Keywords:

Table recognition Modeling table structure Performance evaluation Information retrieval 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Abu-Tarif AA (1998) Table processing and understanding. Master’s thesis, Rensselaer Polytechnic Institute, Troy, NYGoogle Scholar
  2. 2.
    Amano A, Asada N (2002) Complex table form analysis using graph grammar. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 283-386Google Scholar
  3. 3.
    Amano A, Asada N, Motoyama T, Sumiyoshi T, Suzuki K (2001) Table form document synthesis by grammar-based structure analysis. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 533-537Google Scholar
  4. 4.
    Arias JF, Chhabra A, Misra V (1996) Efficient interpretation of tabular documents. In: Proceedings of the 13th international conference on pattern recognition, Vienna, Austria, 25-30 August 1996, pp 681-685Google Scholar
  5. 5.
    Arias JF, Chhabra A, Misra V (1996) Interpreting and representing tabular documents. In: Proceedings of the conference on computer vision and pattern recognition, San Francisco, 18-20 June 1996, pp 600-605Google Scholar
  6. 6.
    Balasubramanian S, Chandran S, Arias J, Kasturi R (1994) Information extraction from tabular drawings. In: Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging), San Jose, CA, 9 February 1994, 2181:152-163Google Scholar
  7. 7.
    Belaïd A (2001) Recognition of table of contents for electronic library consulting. Int J Doc Anal Recog 4(1):35-45Google Scholar
  8. 8.
    Bing L, Zao J, Hong X (1999) New method for logical structure extraction of form document image. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE Electronic Imaging), San Jose, CA, 27 January 1999, 3651:183-193Google Scholar
  9. 9.
    Bourgeois FL, Emptoz H, Bensafi S (2001) Document understanding using probabilistic relaxation: application on tables of contents of periodicals. In: Proceedings of the international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-512Google Scholar
  10. 10.
    Bunke H (1993) Structural and syntactic pattern recognition. Chen CH, Pau LF, Wang PSP (eds) Handbook of pattern recognition and computer vision. World Scientific, Singapore, pp 163-209Google Scholar
  11. 11.
    Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Patt Anal Mach Intell 18(7):690-706Google Scholar
  12. 12.
    Cesarini F, Gori M, Marinai S, Soda G (1998) INFORMys: a flexible invoice-like form-reader system. IEEE Trans Patt Anal Mach Intell 20(7):730-745Google Scholar
  13. 13.
    Cesarini F, Gori M, Marinai S, Soda G (1999) Structured document segmentation and representation by the modified X-Y tree. In: Proceedings of the 5th international conference on document analysis and recognition, Bangalore, India, 20-22 September 1999, pp 563-566Google Scholar
  14. 14.
    Cesarini F, Marinai S, Sarti L, Soda G (2002) Trainable table location in document images. In: Proceedings of the 16th international conference on pattern recognition, Québec City, Canada, 11-15 August 2002, 3:236-240Google Scholar
  15. 15.
    Chandran S, Kasturi R (1993) Structural recognition of tabulated data. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 516-519Google Scholar
  16. 16.
    Chhabra AK, Misra V, Arias J (1996) Detection of horizontal lines in noisy run length encoded images: the FAST method. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 35-48Google Scholar
  17. 17.
    Coll RA, Coll JH, Thakur G (1994) Graphs and tables: a four-factor experiment. Commun ACM 37(4):76-86Google Scholar
  18. 18.
    Coüasnon B (2001) DMOS: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table recognition systems. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 215-220Google Scholar
  19. 19.
    Coüasnon B, Pasquer L (2001) A real-world evaluation of a generic document recognition method applied to a military form of the 19th century. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 779-783Google Scholar
  20. 20.
    Douglas S, Hurst M (1996) Layout and language: lists and tables in technical documents. In: Proceedings of the ACL SIGPARSE workshop on punctuation in computational linguistics, Santa Cruz, CA, 28 June 1996, pp 19-24Google Scholar
  21. 21.
    Douglas S, Hurst M, Quinn D (1995) Using natural language processing for identifying and interpreting tables in plain text. In: Proceedings of the 4th annual symposium on document analysis and information retrieval, Las Vegas, 15-17 April 1995, pp 535-546Google Scholar
  22. 22.
    Green E, Krishnamoorthy M (1996) Model-based analysis of printed tables. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 80-91Google Scholar
  23. 23.
    Grossman J (ed) (1993) Chicago manual of style, 14th edn, chap 12. University of Chicago Press, ChicagoGoogle Scholar
  24. 24.
    Tsai S, Chen H, Tsai J (2000) Mining tables from large scale HTML texts. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000Google Scholar
  25. 25.
    Ha J, Haralick RM, Phillips IT (1995) Recursive X-Y cut using bounding boxes of connected components. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 952-955Google Scholar
  26. 26.
    Hall R (1943) Handbook of tabular presentation. Ronald Press, New YorkGoogle Scholar
  27. 27.
    Handley JC (1999) Electronic imaging technology, chap 8. IS&T/SPIE Optical Engineering Press, Bellingham, WAGoogle Scholar
  28. 28.
    Handley JC (2001) Table analysis for multi-line cell identification. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE electronic imaging), San Jose, CA, 24 January 2001, 4307:34-43Google Scholar
  29. 29.
    Haralick RM (1994) Document image understanding: geometric and logical layout. In: Proceedings of the conference on computer vision and pattern recognition, Seattle, 20-24 June 1994, pp 385-390Google Scholar
  30. 30.
    Haralick RM, Shapiro LG (1992) Computer and robot vision (2 vols). Addison-Wesley, Reading, MAGoogle Scholar
  31. 31.
    Hinkelmann K, Kempthorne O (1994) Design and analysis of experiments: introduction to experimental design, vol 1. Wiley, New YorkGoogle Scholar
  32. 32.
    Hirayama Y (1993) A block segmentation method for document images with complicated column structures. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 91-94Google Scholar
  33. 33.
    Hirayama Y (1995) A method for table structure analysis using DP matching. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 583-586Google Scholar
  34. 34.
    Hori O, Doermann DS (1995) Robust table-form structure analysis based on box-driven reasoning. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 218-221Google Scholar
  35. 35.
    Hu J, Kashi R, Lopresti D, Nagy G, Wilfong G (2001) Why table ground-truthing is hard. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 129-133Google Scholar
  36. 36.
    Hu J, Kashi R, Lopresti D, Wilfong G (2000) Medium-independent table detection. In: Proceedings of Document Recognition and Retrieval VII (IS&T/SPIE electronic imaging), San Jose, CA, 26 January 2000, 3967:291-302Google Scholar
  37. 37.
    Hu J, Kashi R, Lopresti D, Wilfong G (2001a) Experiments in table recognition. In: Proceedings of the workshop on document layout interpretation and applications, Seattle, 9 September 2001Google Scholar
  38. 38.
    Hu J, Kashi R, Lopresti D, Wilfong G (2001b) Table structure recognition and its evaluation. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:44-55Google Scholar
  39. 39.
    Hu J, Kashi RS, Lopresti D, Wilfong GT (2002) Evaluating the performance of table processing algorithms. Int J Doc Anal Recog 4(3):140-153Google Scholar
  40. 40.
    Hurst M (1999) Layout and language: beyond simple text for information interaction - modelling the table. In: Proceedings of the 2nd international conference on multimodal interfaces, Hong Kong, 5-7 January 1999Google Scholar
  41. 41.
    Hurst M (2001a) Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:56-67Google Scholar
  42. 42.
    Hurst M (2001b) Layout and language: challenges for table understanding on the web. In: Proceedings of the first international workshop on Web document analysis, Seattle, 8 September 2001, pp 27-30Google Scholar
  43. 43.
    Hurst M, Douglas S (1997) Layout and language: preliminary investigations in recognizing the structure of tables. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 1043-1047Google Scholar
  44. 44.
    Hurst M, Nasukawa T (2000) Layout and language: integrating spatial and linguistic knowledge for layout understanding tasks. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000Google Scholar
  45. 45.
    Itonori K (1993) Table structure recognition based on textblock arrangement and ruled line position. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 765-768Google Scholar
  46. 46.
    Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Patt Anal Mach Intell 20(3):294-308Google Scholar
  47. 47.
    Kieninger TG (1998) Table structure recognition based on robust block segmentation. In: Proceedings of Document Recognition V (IS&T/SPIE Electronic Imaging), San Jose, CA, 28 January 1998, 3305:22-32Google Scholar
  48. 48.
    Kieninger TG, Dengel A (2001) Applying the T-RECS table recognition system to the business letter domain. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 518-522Google Scholar
  49. 49.
    Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 513-517Google Scholar
  50. 50.
    Kojima H, Akiyama T (1990) Table recognition for automated document entry system. High-speed inspection architectures, barcoding, and character recognition. In: (Proceedings of SPIE), Boston, MA, 5-7 November 1990, 1384:285-292Google Scholar
  51. 51.
    Kornfeld W, Wattecamps J (1998) Automatically locating, extracting and analyzing tabular data. In: Proceedings of the 21st ACM SIGIR international conference on research and development in information retrieval, Melbourne, Australia, 24-28 August 1998, pp 347-348Google Scholar
  52. 52.
    Krishnamoorthy S, Nagy G, Seth S, Viswanathan M (1993) Syntactic segmentation and labelling of digitized pages from technical journals. IEEE Trans Patt Anal Mach Intell 15(7):737-747Google Scholar
  53. 53.
    Lam SW, Javanbakht L, Srihari SN (1993) Anatomy of a form reader. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 506-509Google Scholar
  54. 54.
    Laurentini A, Viada P (1992) Identifying and understanding tabular material in compound documents. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 405-409Google Scholar
  55. 55.
    Lewandowksy S, Spence I (1989) The perception of statistical graphs. Sociol Meth Res 18(2&3):200-242Google Scholar
  56. 56.
    Liang J (1999) Document structure analysis and performance evaluation. PhD thesis, University of Washington, SeattleGoogle Scholar
  57. 57.
    Lopresti D (2002) Exploiting WWW resources in experimental document analysis research. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 532-543Google Scholar
  58. 58.
    Lopresti D, Nagy G (1999) Automated table processing: an (opinionated) survey. In: Proceedings of the 3rd international workshop on graphics recognition, Jaipur, India, 26-27 September 1999, pp 109-134Google Scholar
  59. 59.
    Lopresti D, Nagy G (2000) A tabular survey of automated table processing. Lecture notes in computer science, vol 1941. Springer, Berlin Heidelberg New York, pp 93-120Google Scholar
  60. 60.
    Lopresti D, Wilfong G (2001) Evaluating document analysis results via graph probing. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 116-120Google Scholar
  61. 61.
    Mao S, Kanungo T (2001) Empirical performance evaluation methodology and its application to page segmentation algorithms. IEEE Trans Patt Anal Mach Intell 23(3):242-256Google Scholar
  62. 62.
    Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62Google Scholar
  63. 63.
    Nagy G, Seth S (1984) Hierarchical representation of optically scanned documents. In: Proceedings of the 7th international conference on pattern recognition, Montreal, July 1984, pp 347-349Google Scholar
  64. 64.
    Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J (7):308-313Google Scholar
  65. 65.
    Ng HT, Lim CY, Koo JLT (1999) Learning to recognize tables in free text. In: Proceedings of the 37th annual meeting of the Association of Computational Linguistics, College Park, MD, 20-26 June 1999, pp 443-450Google Scholar
  66. 66.
    O’Gorman L (1992) Image and document processing techniques for the RightPages electronic library system. In: Proceedings of the 11th international conference on pattern recognition, The Hague, Netherlands, 30 August-3 September 1992, pp 260-263Google Scholar
  67. 67.
    O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Patt Anal Mach Intell 15(11):1162-1173Google Scholar
  68. 68.
    Perlovsky LI (1998) Conundrum of combinatorial complexity. IEEE Trans Patt Anal Mach Intell 20(6):666-670Google Scholar
  69. 69.
    Peterman C, Chang CH, Alam H (1997) A system for table understanding. In: Proceedings of the conference on document image understanding technology, Annapolis, MD, April/May 1997, pp 55-62Google Scholar
  70. 70.
    Petrou M (1999) Learning in pattern recognition. Lecture notes in computer science, vol 1715. Springer, Berlin Heidelberg New York, pp 1-12Google Scholar
  71. 71.
    Phillips I, Chhabra AK (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Patt Anal Mach Intell 21(9):849-870Google Scholar
  72. 72.
    Pizano A (1992) Extracting line features from images of business forms and tables. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 399-403Google Scholar
  73. 73.
    Pyreddy P, Croft WB (1997) Tintin: a system for retrieval in text tables. In: Proceedings of the 2nd international conference on digital libraries, Philadelphia, 25-28 July 1997, pp 193-200Google Scholar
  74. 74.
    Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, San FranciscoGoogle Scholar
  75. 75.
    RAF Technology (1995) Illuminator user’s manual. Redmond, WAGoogle Scholar
  76. 76.
    Rahgozar MA, Cooperman R (1996) A graph-based table recognition system. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:192-203Google Scholar
  77. 77.
    Rosenfeld A, Kak AC (1982) Digital picture processing (2 vols). Academic, Orlando, FLGoogle Scholar
  78. 78.
    Rus D, Subramanian D (1997) Customizing information capture and access. ACM Trans Inf Sys 15(1):67-101Google Scholar
  79. 79.
    Seong-Whan L, Dae-Seok R (2001) Parameter-free geometric document layout analysis. IEEE Trans Patt Anal Mach Intell 23(11):1240-1256Google Scholar
  80. 80.
    Shamillian JH, Baird HS, Wood TL (1997) A retargetable table reader. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 158-163Google Scholar
  81. 81.
    Souafi-Bensafi S, Parizeau M, Lebourgeois F, Emptoz H (2001) Bayesian networks classifiers applied to documents. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-511Google Scholar
  82. 82.
    Takasu A, Satoh S, Katsura E (1994) A document understanding method for database construction of an electronic library. In: Proceedings of the 12th international conference on pattern recognition, Jerusalem, Israel, 9-13 October 1994, pp 463-466Google Scholar
  83. 83.
    Takasu A, Satoh S, Katsura E (1995) A rule learning method for academic document image processing. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 239-242Google Scholar
  84. 84.
    Tersteegen W, Wenzel C (1998) Scantab: table recognition by reference tables. In: Proceedings of the 3rd workshop on document analysis systems, Nagano, Japan, 4-6 November 1998Google Scholar
  85. 85.
    Tubbs KM, Embley DW (2002) Recognizing records from the extracted cells of microfilm tables. In: Proceedings of the ACM symposium on document engineering, McLean, VA, 8-9 November 2002, pp 149-156Google Scholar
  86. 86.
    Tupaj S, Shi Z, Chang CH, Alam H (1996) Extracting tabular information from text files. Available from citeseer: http://citeseer.nj.nec.com/tupaj96extracting.htmlGoogle Scholar
  87. 87.
    Turolla E, Belaid A, Belaid Y (1996) Form item extraction based on line searching. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 69-79Google Scholar
  88. 88.
    Viswanathan M, Green E, Krishnamoorthy MS (1996) Document recognition: an attribute grammar approach. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:101-111Google Scholar
  89. 89.
    Wang X (1996) Tabular abstraction, editing and formatting. PhD thesis, University of Waterloo, CanadaGoogle Scholar
  90. 90.
    Wang Y, Haralick R, Phillips IT (2001) Zone content classification and its performance evaluation. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 540-544Google Scholar
  91. 91.
    Wang Y, Hu J (2002) Detecting tables in HTML documents. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 249-260Google Scholar
  92. 92.
    Wang Y, Phillips IT, Haralick R (2001) Automatic table ground truth generation and a background-analysis-based table structure extraction method. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 528-532Google Scholar
  93. 93.
    Wang Y, Phillips IT, Haralick R (2002) Table detection via probability optimization. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 272-282Google Scholar
  94. 94.
    Watanabe T, Luo Q, Sugie N (1995) Layout recognition of multi-kinds of table-form documents. IEEE Trans Patt Anal Mach Intell 17(4):432-445Google Scholar
  95. 95.
    Wong KY, Casey RG, Wahl FM (1982) Document analysis system. IBM J Res Dev 26(6):647-656Google Scholar
  96. 96.
    Yoshida M, Torisawa K, Tsujii J (2001) A method to integrate tables of the world wide web. In: Proceedings of the 1st international workshop on Web document analysis, Seattle, 8 September 2001, pp 31-34Google Scholar
  97. 97.
    Yu B, Jain AK (1996) A generic system for form dropout. IEEE Trans Patt Anal Mach Intell 18(11):1127-1134Google Scholar
  98. 98.
    Zuyev K (1997) Table image segmentation. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 705-708Google Scholar

Copyright information

© Springer-Verlag Berlin/Heidelberg 2004

Authors and Affiliations

  • Richard Zanibbi
    • 1
  • Dorothea Blostein
    • 1
  • James R. Cordy
    • 1
  1. 1.School of ComputingQueen’s UniversityKingstonCanada

Personalised recommendations