Abstract.
Table characteristics vary widely. Consequently, a great variety of computational approaches have been applied to table recognition. In this survey, the table recognition literature is presented as an interaction of table models, observations, transformations, and inferences. A table model defines the physical and logical structure of tables; the model is used to detect tables and to analyze and decompose the detected tables. Observations perform feature measurements and data lookup, transformations alter or restructure data, and inferences generate and test hypotheses. This presentation clarifies both the decisions made by a table recognizer and the assumptions and inferencing techniques that underlie these decisions.
This is a preview of subscription content, access via your institution.
References
Abu-Tarif AA (1998) Table processing and understanding. Master’s thesis, Rensselaer Polytechnic Institute, Troy, NY
Amano A, Asada N (2002) Complex table form analysis using graph grammar. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 283-386
Amano A, Asada N, Motoyama T, Sumiyoshi T, Suzuki K (2001) Table form document synthesis by grammar-based structure analysis. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 533-537
Arias JF, Chhabra A, Misra V (1996) Efficient interpretation of tabular documents. In: Proceedings of the 13th international conference on pattern recognition, Vienna, Austria, 25-30 August 1996, pp 681-685
Arias JF, Chhabra A, Misra V (1996) Interpreting and representing tabular documents. In: Proceedings of the conference on computer vision and pattern recognition, San Francisco, 18-20 June 1996, pp 600-605
Balasubramanian S, Chandran S, Arias J, Kasturi R (1994) Information extraction from tabular drawings. In: Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging), San Jose, CA, 9 February 1994, 2181:152-163
Belaïd A (2001) Recognition of table of contents for electronic library consulting. Int J Doc Anal Recog 4(1):35-45
Bing L, Zao J, Hong X (1999) New method for logical structure extraction of form document image. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE Electronic Imaging), San Jose, CA, 27 January 1999, 3651:183-193
Bourgeois FL, Emptoz H, Bensafi S (2001) Document understanding using probabilistic relaxation: application on tables of contents of periodicals. In: Proceedings of the international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-512
Bunke H (1993) Structural and syntactic pattern recognition. Chen CH, Pau LF, Wang PSP (eds) Handbook of pattern recognition and computer vision. World Scientific, Singapore, pp 163-209
Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Patt Anal Mach Intell 18(7):690-706
Cesarini F, Gori M, Marinai S, Soda G (1998) INFORMys: a flexible invoice-like form-reader system. IEEE Trans Patt Anal Mach Intell 20(7):730-745
Cesarini F, Gori M, Marinai S, Soda G (1999) Structured document segmentation and representation by the modified X-Y tree. In: Proceedings of the 5th international conference on document analysis and recognition, Bangalore, India, 20-22 September 1999, pp 563-566
Cesarini F, Marinai S, Sarti L, Soda G (2002) Trainable table location in document images. In: Proceedings of the 16th international conference on pattern recognition, Québec City, Canada, 11-15 August 2002, 3:236-240
Chandran S, Kasturi R (1993) Structural recognition of tabulated data. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 516-519
Chhabra AK, Misra V, Arias J (1996) Detection of horizontal lines in noisy run length encoded images: the FAST method. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 35-48
Coll RA, Coll JH, Thakur G (1994) Graphs and tables: a four-factor experiment. Commun ACM 37(4):76-86
Coüasnon B (2001) DMOS: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table recognition systems. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 215-220
Coüasnon B, Pasquer L (2001) A real-world evaluation of a generic document recognition method applied to a military form of the 19th century. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 779-783
Douglas S, Hurst M (1996) Layout and language: lists and tables in technical documents. In: Proceedings of the ACL SIGPARSE workshop on punctuation in computational linguistics, Santa Cruz, CA, 28 June 1996, pp 19-24
Douglas S, Hurst M, Quinn D (1995) Using natural language processing for identifying and interpreting tables in plain text. In: Proceedings of the 4th annual symposium on document analysis and information retrieval, Las Vegas, 15-17 April 1995, pp 535-546
Green E, Krishnamoorthy M (1996) Model-based analysis of printed tables. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 80-91
Grossman J (ed) (1993) Chicago manual of style, 14th edn, chap 12. University of Chicago Press, Chicago
Tsai S, Chen H, Tsai J (2000) Mining tables from large scale HTML texts. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000
Ha J, Haralick RM, Phillips IT (1995) Recursive X-Y cut using bounding boxes of connected components. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 952-955
Hall R (1943) Handbook of tabular presentation. Ronald Press, New York
Handley JC (1999) Electronic imaging technology, chap 8. IS&T/SPIE Optical Engineering Press, Bellingham, WA
Handley JC (2001) Table analysis for multi-line cell identification. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE electronic imaging), San Jose, CA, 24 January 2001, 4307:34-43
Haralick RM (1994) Document image understanding: geometric and logical layout. In: Proceedings of the conference on computer vision and pattern recognition, Seattle, 20-24 June 1994, pp 385-390
Haralick RM, Shapiro LG (1992) Computer and robot vision (2 vols). Addison-Wesley, Reading, MA
Hinkelmann K, Kempthorne O (1994) Design and analysis of experiments: introduction to experimental design, vol 1. Wiley, New York
Hirayama Y (1993) A block segmentation method for document images with complicated column structures. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 91-94
Hirayama Y (1995) A method for table structure analysis using DP matching. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 583-586
Hori O, Doermann DS (1995) Robust table-form structure analysis based on box-driven reasoning. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 218-221
Hu J, Kashi R, Lopresti D, Nagy G, Wilfong G (2001) Why table ground-truthing is hard. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 129-133
Hu J, Kashi R, Lopresti D, Wilfong G (2000) Medium-independent table detection. In: Proceedings of Document Recognition and Retrieval VII (IS&T/SPIE electronic imaging), San Jose, CA, 26 January 2000, 3967:291-302
Hu J, Kashi R, Lopresti D, Wilfong G (2001a) Experiments in table recognition. In: Proceedings of the workshop on document layout interpretation and applications, Seattle, 9 September 2001
Hu J, Kashi R, Lopresti D, Wilfong G (2001b) Table structure recognition and its evaluation. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:44-55
Hu J, Kashi RS, Lopresti D, Wilfong GT (2002) Evaluating the performance of table processing algorithms. Int J Doc Anal Recog 4(3):140-153
Hurst M (1999) Layout and language: beyond simple text for information interaction - modelling the table. In: Proceedings of the 2nd international conference on multimodal interfaces, Hong Kong, 5-7 January 1999
Hurst M (2001a) Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:56-67
Hurst M (2001b) Layout and language: challenges for table understanding on the web. In: Proceedings of the first international workshop on Web document analysis, Seattle, 8 September 2001, pp 27-30
Hurst M, Douglas S (1997) Layout and language: preliminary investigations in recognizing the structure of tables. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 1043-1047
Hurst M, Nasukawa T (2000) Layout and language: integrating spatial and linguistic knowledge for layout understanding tasks. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000
Itonori K (1993) Table structure recognition based on textblock arrangement and ruled line position. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 765-768
Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Patt Anal Mach Intell 20(3):294-308
Kieninger TG (1998) Table structure recognition based on robust block segmentation. In: Proceedings of Document Recognition V (IS&T/SPIE Electronic Imaging), San Jose, CA, 28 January 1998, 3305:22-32
Kieninger TG, Dengel A (2001) Applying the T-RECS table recognition system to the business letter domain. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 518-522
Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 513-517
Kojima H, Akiyama T (1990) Table recognition for automated document entry system. High-speed inspection architectures, barcoding, and character recognition. In: (Proceedings of SPIE), Boston, MA, 5-7 November 1990, 1384:285-292
Kornfeld W, Wattecamps J (1998) Automatically locating, extracting and analyzing tabular data. In: Proceedings of the 21st ACM SIGIR international conference on research and development in information retrieval, Melbourne, Australia, 24-28 August 1998, pp 347-348
Krishnamoorthy S, Nagy G, Seth S, Viswanathan M (1993) Syntactic segmentation and labelling of digitized pages from technical journals. IEEE Trans Patt Anal Mach Intell 15(7):737-747
Lam SW, Javanbakht L, Srihari SN (1993) Anatomy of a form reader. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 506-509
Laurentini A, Viada P (1992) Identifying and understanding tabular material in compound documents. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 405-409
Lewandowksy S, Spence I (1989) The perception of statistical graphs. Sociol Meth Res 18(2&3):200-242
Liang J (1999) Document structure analysis and performance evaluation. PhD thesis, University of Washington, Seattle
Lopresti D (2002) Exploiting WWW resources in experimental document analysis research. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 532-543
Lopresti D, Nagy G (1999) Automated table processing: an (opinionated) survey. In: Proceedings of the 3rd international workshop on graphics recognition, Jaipur, India, 26-27 September 1999, pp 109-134
Lopresti D, Nagy G (2000) A tabular survey of automated table processing. Lecture notes in computer science, vol 1941. Springer, Berlin Heidelberg New York, pp 93-120
Lopresti D, Wilfong G (2001) Evaluating document analysis results via graph probing. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 116-120
Mao S, Kanungo T (2001) Empirical performance evaluation methodology and its application to page segmentation algorithms. IEEE Trans Patt Anal Mach Intell 23(3):242-256
Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62
Nagy G, Seth S (1984) Hierarchical representation of optically scanned documents. In: Proceedings of the 7th international conference on pattern recognition, Montreal, July 1984, pp 347-349
Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J (7):308-313
Ng HT, Lim CY, Koo JLT (1999) Learning to recognize tables in free text. In: Proceedings of the 37th annual meeting of the Association of Computational Linguistics, College Park, MD, 20-26 June 1999, pp 443-450
O’Gorman L (1992) Image and document processing techniques for the RightPages electronic library system. In: Proceedings of the 11th international conference on pattern recognition, The Hague, Netherlands, 30 August-3 September 1992, pp 260-263
O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Patt Anal Mach Intell 15(11):1162-1173
Perlovsky LI (1998) Conundrum of combinatorial complexity. IEEE Trans Patt Anal Mach Intell 20(6):666-670
Peterman C, Chang CH, Alam H (1997) A system for table understanding. In: Proceedings of the conference on document image understanding technology, Annapolis, MD, April/May 1997, pp 55-62
Petrou M (1999) Learning in pattern recognition. Lecture notes in computer science, vol 1715. Springer, Berlin Heidelberg New York, pp 1-12
Phillips I, Chhabra AK (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Patt Anal Mach Intell 21(9):849-870
Pizano A (1992) Extracting line features from images of business forms and tables. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 399-403
Pyreddy P, Croft WB (1997) Tintin: a system for retrieval in text tables. In: Proceedings of the 2nd international conference on digital libraries, Philadelphia, 25-28 July 1997, pp 193-200
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Francisco
RAF Technology (1995) Illuminator user’s manual. Redmond, WA
Rahgozar MA, Cooperman R (1996) A graph-based table recognition system. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:192-203
Rosenfeld A, Kak AC (1982) Digital picture processing (2 vols). Academic, Orlando, FL
Rus D, Subramanian D (1997) Customizing information capture and access. ACM Trans Inf Sys 15(1):67-101
Seong-Whan L, Dae-Seok R (2001) Parameter-free geometric document layout analysis. IEEE Trans Patt Anal Mach Intell 23(11):1240-1256
Shamillian JH, Baird HS, Wood TL (1997) A retargetable table reader. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 158-163
Souafi-Bensafi S, Parizeau M, Lebourgeois F, Emptoz H (2001) Bayesian networks classifiers applied to documents. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-511
Takasu A, Satoh S, Katsura E (1994) A document understanding method for database construction of an electronic library. In: Proceedings of the 12th international conference on pattern recognition, Jerusalem, Israel, 9-13 October 1994, pp 463-466
Takasu A, Satoh S, Katsura E (1995) A rule learning method for academic document image processing. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 239-242
Tersteegen W, Wenzel C (1998) Scantab: table recognition by reference tables. In: Proceedings of the 3rd workshop on document analysis systems, Nagano, Japan, 4-6 November 1998
Tubbs KM, Embley DW (2002) Recognizing records from the extracted cells of microfilm tables. In: Proceedings of the ACM symposium on document engineering, McLean, VA, 8-9 November 2002, pp 149-156
Tupaj S, Shi Z, Chang CH, Alam H (1996) Extracting tabular information from text files. Available from citeseer: http://citeseer.nj.nec.com/tupaj96extracting.html
Turolla E, Belaid A, Belaid Y (1996) Form item extraction based on line searching. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 69-79
Viswanathan M, Green E, Krishnamoorthy MS (1996) Document recognition: an attribute grammar approach. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:101-111
Wang X (1996) Tabular abstraction, editing and formatting. PhD thesis, University of Waterloo, Canada
Wang Y, Haralick R, Phillips IT (2001) Zone content classification and its performance evaluation. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 540-544
Wang Y, Hu J (2002) Detecting tables in HTML documents. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 249-260
Wang Y, Phillips IT, Haralick R (2001) Automatic table ground truth generation and a background-analysis-based table structure extraction method. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 528-532
Wang Y, Phillips IT, Haralick R (2002) Table detection via probability optimization. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 272-282
Watanabe T, Luo Q, Sugie N (1995) Layout recognition of multi-kinds of table-form documents. IEEE Trans Patt Anal Mach Intell 17(4):432-445
Wong KY, Casey RG, Wahl FM (1982) Document analysis system. IBM J Res Dev 26(6):647-656
Yoshida M, Torisawa K, Tsujii J (2001) A method to integrate tables of the world wide web. In: Proceedings of the 1st international workshop on Web document analysis, Seattle, 8 September 2001, pp 31-34
Yu B, Jain AK (1996) A generic system for form dropout. IEEE Trans Patt Anal Mach Intell 18(11):1127-1134
Zuyev K (1997) Table image segmentation. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 705-708
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 29 May 2003, Revised: 28 October 2003, Published online: 1 April 2004
Correspondence to: Richard Zanibbi
Rights and permissions
About this article
Cite this article
Zanibbi, R., Blostein, D. & Cordy, J.R. A survey of table recognition. IJDAR 7, 1–16 (2004). https://doi.org/10.1007/s10032-004-0120-9
Issue Date:
DOI: https://doi.org/10.1007/s10032-004-0120-9
Keywords:
- Table recognition
- Modeling table structure
- Performance evaluation
- Information retrieval