Abu-Tarif AA (1998) Table processing and understanding. Master’s thesis, Rensselaer Polytechnic Institute, Troy, NY
Amano A, Asada N (2002) Complex table form analysis using graph grammar. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 283-386
Amano A, Asada N, Motoyama T, Sumiyoshi T, Suzuki K (2001) Table form document synthesis by grammar-based structure analysis. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 533-537
Arias JF, Chhabra A, Misra V (1996) Efficient interpretation of tabular documents. In: Proceedings of the 13th international conference on pattern recognition, Vienna, Austria, 25-30 August 1996, pp 681-685
Arias JF, Chhabra A, Misra V (1996) Interpreting and representing tabular documents. In: Proceedings of the conference on computer vision and pattern recognition, San Francisco, 18-20 June 1996, pp 600-605
Balasubramanian S, Chandran S, Arias J, Kasturi R (1994) Information extraction from tabular drawings. In: Proceedings of Document Recognition I (IS&T/SPIE Electronic Imaging), San Jose, CA, 9 February 1994, 2181:152-163
Belaïd A (2001) Recognition of table of contents for electronic library consulting. Int J Doc Anal Recog 4(1):35-45
Google Scholar
Bing L, Zao J, Hong X (1999) New method for logical structure extraction of form document image. In: Proceedings of Document Recognition and Retrieval VI (IS&T/SPIE Electronic Imaging), San Jose, CA, 27 January 1999, 3651:183-193
Bourgeois FL, Emptoz H, Bensafi S (2001) Document understanding using probabilistic relaxation: application on tables of contents of periodicals. In: Proceedings of the international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-512
Bunke H (1993) Structural and syntactic pattern recognition. Chen CH, Pau LF, Wang PSP (eds) Handbook of pattern recognition and computer vision. World Scientific, Singapore, pp 163-209
Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Patt Anal Mach Intell 18(7):690-706
Google Scholar
Cesarini F, Gori M, Marinai S, Soda G (1998) INFORMys: a flexible invoice-like form-reader system. IEEE Trans Patt Anal Mach Intell 20(7):730-745
Google Scholar
Cesarini F, Gori M, Marinai S, Soda G (1999) Structured document segmentation and representation by the modified X-Y tree. In: Proceedings of the 5th international conference on document analysis and recognition, Bangalore, India, 20-22 September 1999, pp 563-566
Cesarini F, Marinai S, Sarti L, Soda G (2002) Trainable table location in document images. In: Proceedings of the 16th international conference on pattern recognition, Québec City, Canada, 11-15 August 2002, 3:236-240
Chandran S, Kasturi R (1993) Structural recognition of tabulated data. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 516-519
Chhabra AK, Misra V, Arias J (1996) Detection of horizontal lines in noisy run length encoded images: the FAST method. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 35-48
Coll RA, Coll JH, Thakur G (1994) Graphs and tables: a four-factor experiment. Commun ACM 37(4):76-86
Google Scholar
Coüasnon B (2001) DMOS: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table recognition systems. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 215-220
Coüasnon B, Pasquer L (2001) A real-world evaluation of a generic document recognition method applied to a military form of the 19th century. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 779-783
Douglas S, Hurst M (1996) Layout and language: lists and tables in technical documents. In: Proceedings of the ACL SIGPARSE workshop on punctuation in computational linguistics, Santa Cruz, CA, 28 June 1996, pp 19-24
Douglas S, Hurst M, Quinn D (1995) Using natural language processing for identifying and interpreting tables in plain text. In: Proceedings of the 4th annual symposium on document analysis and information retrieval, Las Vegas, 15-17 April 1995, pp 535-546
Green E, Krishnamoorthy M (1996) Model-based analysis of printed tables. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 80-91
Grossman J (ed) (1993) Chicago manual of style, 14th edn, chap 12. University of Chicago Press, Chicago
Tsai S, Chen H, Tsai J (2000) Mining tables from large scale HTML texts. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000
Ha J, Haralick RM, Phillips IT (1995) Recursive X-Y cut using bounding boxes of connected components. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 952-955
Hall R (1943) Handbook of tabular presentation. Ronald Press, New York
Handley JC (1999) Electronic imaging technology, chap 8. IS&T/SPIE Optical Engineering Press, Bellingham, WA
Handley JC (2001) Table analysis for multi-line cell identification. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE electronic imaging), San Jose, CA, 24 January 2001, 4307:34-43
Haralick RM (1994) Document image understanding: geometric and logical layout. In: Proceedings of the conference on computer vision and pattern recognition, Seattle, 20-24 June 1994, pp 385-390
Haralick RM, Shapiro LG (1992) Computer and robot vision (2 vols). Addison-Wesley, Reading, MA
Hinkelmann K, Kempthorne O (1994) Design and analysis of experiments: introduction to experimental design, vol 1. Wiley, New York
Hirayama Y (1993) A block segmentation method for document images with complicated column structures. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 91-94
Hirayama Y (1995) A method for table structure analysis using DP matching. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 583-586
Hori O, Doermann DS (1995) Robust table-form structure analysis based on box-driven reasoning. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 218-221
Hu J, Kashi R, Lopresti D, Nagy G, Wilfong G (2001) Why table ground-truthing is hard. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 129-133
Hu J, Kashi R, Lopresti D, Wilfong G (2000) Medium-independent table detection. In: Proceedings of Document Recognition and Retrieval VII (IS&T/SPIE electronic imaging), San Jose, CA, 26 January 2000, 3967:291-302
Hu J, Kashi R, Lopresti D, Wilfong G (2001a) Experiments in table recognition. In: Proceedings of the workshop on document layout interpretation and applications, Seattle, 9 September 2001
Hu J, Kashi R, Lopresti D, Wilfong G (2001b) Table structure recognition and its evaluation. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:44-55
Hu J, Kashi RS, Lopresti D, Wilfong GT (2002) Evaluating the performance of table processing algorithms. Int J Doc Anal Recog 4(3):140-153
Google Scholar
Hurst M (1999) Layout and language: beyond simple text for information interaction - modelling the table. In: Proceedings of the 2nd international conference on multimodal interfaces, Hong Kong, 5-7 January 1999
Hurst M (2001a) Layout and language: an efficient algorithm for detecting text blocks based on spatial and linguistic evidence. In: Proceedings of Document Recognition and Retrieval VIII (IS&T/SPIE Electronic Imaging), San Jose, CA, 24 January 2001, 4307:56-67
Hurst M (2001b) Layout and language: challenges for table understanding on the web. In: Proceedings of the first international workshop on Web document analysis, Seattle, 8 September 2001, pp 27-30
Hurst M, Douglas S (1997) Layout and language: preliminary investigations in recognizing the structure of tables. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 1043-1047
Hurst M, Nasukawa T (2000) Layout and language: integrating spatial and linguistic knowledge for layout understanding tasks. In: Proceedings of the 18th international conference on computational linguistics, Saarbrucken, Germany, 31 July-4 August 2000
Itonori K (1993) Table structure recognition based on textblock arrangement and ruled line position. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 765-768
Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Patt Anal Mach Intell 20(3):294-308
Google Scholar
Kieninger TG (1998) Table structure recognition based on robust block segmentation. In: Proceedings of Document Recognition V (IS&T/SPIE Electronic Imaging), San Jose, CA, 28 January 1998, 3305:22-32
Kieninger TG, Dengel A (2001) Applying the T-RECS table recognition system to the business letter domain. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 518-522
Klein B, Gökkus S, Kieninger T, Dengel A (2001) Three approaches to “industrial” table spotting. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 513-517
Kojima H, Akiyama T (1990) Table recognition for automated document entry system. High-speed inspection architectures, barcoding, and character recognition. In: (Proceedings of SPIE), Boston, MA, 5-7 November 1990, 1384:285-292
Kornfeld W, Wattecamps J (1998) Automatically locating, extracting and analyzing tabular data. In: Proceedings of the 21st ACM SIGIR international conference on research and development in information retrieval, Melbourne, Australia, 24-28 August 1998, pp 347-348
Krishnamoorthy S, Nagy G, Seth S, Viswanathan M (1993) Syntactic segmentation and labelling of digitized pages from technical journals. IEEE Trans Patt Anal Mach Intell 15(7):737-747
Google Scholar
Lam SW, Javanbakht L, Srihari SN (1993) Anatomy of a form reader. In: Proceedings of the 2nd international conference on document analysis and recognition, Tsukuba Science City, Japan, 20-22 October 1993, pp 506-509
Laurentini A, Viada P (1992) Identifying and understanding tabular material in compound documents. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 405-409
Lewandowksy S, Spence I (1989) The perception of statistical graphs. Sociol Meth Res 18(2&3):200-242
Google Scholar
Liang J (1999) Document structure analysis and performance evaluation. PhD thesis, University of Washington, Seattle
Lopresti D (2002) Exploiting WWW resources in experimental document analysis research. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 532-543
Lopresti D, Nagy G (1999) Automated table processing: an (opinionated) survey. In: Proceedings of the 3rd international workshop on graphics recognition, Jaipur, India, 26-27 September 1999, pp 109-134
Lopresti D, Nagy G (2000) A tabular survey of automated table processing. Lecture notes in computer science, vol 1941. Springer, Berlin Heidelberg New York, pp 93-120
Lopresti D, Wilfong G (2001) Evaluating document analysis results via graph probing. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 116-120
Mao S, Kanungo T (2001) Empirical performance evaluation methodology and its application to page segmentation algorithms. IEEE Trans Patt Anal Mach Intell 23(3):242-256
Google Scholar
Nagy G (2000) Twenty years of document image analysis in PAMI. IEEE Trans Patt Anal Mach Intell 22(1):38-62
Google Scholar
Nagy G, Seth S (1984) Hierarchical representation of optically scanned documents. In: Proceedings of the 7th international conference on pattern recognition, Montreal, July 1984, pp 347-349
Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J (7):308-313
Ng HT, Lim CY, Koo JLT (1999) Learning to recognize tables in free text. In: Proceedings of the 37th annual meeting of the Association of Computational Linguistics, College Park, MD, 20-26 June 1999, pp 443-450
O’Gorman L (1992) Image and document processing techniques for the RightPages electronic library system. In: Proceedings of the 11th international conference on pattern recognition, The Hague, Netherlands, 30 August-3 September 1992, pp 260-263
O’Gorman L (1993) The document spectrum for page layout analysis. IEEE Trans Patt Anal Mach Intell 15(11):1162-1173
Google Scholar
Perlovsky LI (1998) Conundrum of combinatorial complexity. IEEE Trans Patt Anal Mach Intell 20(6):666-670
Google Scholar
Peterman C, Chang CH, Alam H (1997) A system for table understanding. In: Proceedings of the conference on document image understanding technology, Annapolis, MD, April/May 1997, pp 55-62
Petrou M (1999) Learning in pattern recognition. Lecture notes in computer science, vol 1715. Springer, Berlin Heidelberg New York, pp 1-12
Phillips I, Chhabra AK (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Patt Anal Mach Intell 21(9):849-870
Google Scholar
Pizano A (1992) Extracting line features from images of business forms and tables. In: Proceedings of the 11th international conference on pattern recognition, The Hague, The Netherlands, 30 August-3 September 1992, pp 399-403
Pyreddy P, Croft WB (1997) Tintin: a system for retrieval in text tables. In: Proceedings of the 2nd international conference on digital libraries, Philadelphia, 25-28 July 1997, pp 193-200
Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Francisco
RAF Technology (1995) Illuminator user’s manual. Redmond, WA
Rahgozar MA, Cooperman R (1996) A graph-based table recognition system. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:192-203
Rosenfeld A, Kak AC (1982) Digital picture processing (2 vols). Academic, Orlando, FL
Rus D, Subramanian D (1997) Customizing information capture and access. ACM Trans Inf Sys 15(1):67-101
Google Scholar
Seong-Whan L, Dae-Seok R (2001) Parameter-free geometric document layout analysis. IEEE Trans Patt Anal Mach Intell 23(11):1240-1256
Google Scholar
Shamillian JH, Baird HS, Wood TL (1997) A retargetable table reader. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 158-163
Souafi-Bensafi S, Parizeau M, Lebourgeois F, Emptoz H (2001) Bayesian networks classifiers applied to documents. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 508-511
Takasu A, Satoh S, Katsura E (1994) A document understanding method for database construction of an electronic library. In: Proceedings of the 12th international conference on pattern recognition, Jerusalem, Israel, 9-13 October 1994, pp 463-466
Takasu A, Satoh S, Katsura E (1995) A rule learning method for academic document image processing. In: Proceedings of the 3rd international conference on document analysis and recognition, Montreal, 14-16 August 1995, pp 239-242
Tersteegen W, Wenzel C (1998) Scantab: table recognition by reference tables. In: Proceedings of the 3rd workshop on document analysis systems, Nagano, Japan, 4-6 November 1998
Tubbs KM, Embley DW (2002) Recognizing records from the extracted cells of microfilm tables. In: Proceedings of the ACM symposium on document engineering, McLean, VA, 8-9 November 2002, pp 149-156
Tupaj S, Shi Z, Chang CH, Alam H (1996) Extracting tabular information from text files. Available from citeseer: http://citeseer.nj.nec.com/tupaj96extracting.html
Turolla E, Belaid A, Belaid Y (1996) Form item extraction based on line searching. Lecture notes in computer science, vol 1072. Springer, Berlin Heidelberg New York, pp 69-79
Viswanathan M, Green E, Krishnamoorthy MS (1996) Document recognition: an attribute grammar approach. In: Proceedings of Document Recognition III (IS&T/SPIE Electronic Imaging), San Jose, CA, 29 January 1996, 2660:101-111
Wang X (1996) Tabular abstraction, editing and formatting. PhD thesis, University of Waterloo, Canada
Wang Y, Haralick R, Phillips IT (2001) Zone content classification and its performance evaluation. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 540-544
Wang Y, Hu J (2002) Detecting tables in HTML documents. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 249-260
Wang Y, Phillips IT, Haralick R (2001) Automatic table ground truth generation and a background-analysis-based table structure extraction method. In: Proceedings of the 6th international conference on document analysis and recognition, Seattle, 10-13 September 2001, pp 528-532
Wang Y, Phillips IT, Haralick R (2002) Table detection via probability optimization. Lecture notes in computer science, vol 2423. Springer, Berlin Heidelberg New York, pp 272-282
Watanabe T, Luo Q, Sugie N (1995) Layout recognition of multi-kinds of table-form documents. IEEE Trans Patt Anal Mach Intell 17(4):432-445
Google Scholar
Wong KY, Casey RG, Wahl FM (1982) Document analysis system. IBM J Res Dev 26(6):647-656
Google Scholar
Yoshida M, Torisawa K, Tsujii J (2001) A method to integrate tables of the world wide web. In: Proceedings of the 1st international workshop on Web document analysis, Seattle, 8 September 2001, pp 31-34
Yu B, Jain AK (1996) A generic system for form dropout. IEEE Trans Patt Anal Mach Intell 18(11):1127-1134
Google Scholar
Zuyev K (1997) Table image segmentation. In: Proceedings of the 4th international conference on document analysis and recognition, Ulm, Germany, 18-20 August 1997, pp 705-708