DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way

Coüasnon, Bertrand

doi:10.1007/s10032-005-0148-5

DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way

Regular Paper
Published: 24 March 2006

Volume 8, pages 111–122, (2006)
Cite this article

International Journal of Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Bertrand Coüasnon¹

364 Accesses
44 Citations
6 Altmetric
Explore all metrics

Abstract

We will show in this paper one of the numerous interests of designing a generic recognition system, i.e. the possibility of producing either general or specific systems. We propose the Description and Modification of Segmentation (DMOS) method, which is made of a new grammatical language (Enhanced Position Formalism—EPF) and an associated parser able to deal with noise. From an EPF description of a kind of document structure, a new recognition system is produced by compilation. This method has been successfully used to produce recognition systems on musical scores, mathematical formulae and even tennis courts in videos. This DMOS generic method separates knowledge from program. Therefore, for a same kind of document like table structures, it is possible to define with EPF, more or less specific descriptions to produce more or less specific recognition systems. For example, we have been able to produce a general recognition system of table structures. It can recognize the hierarchical organization of a table made with rulings, whatever the number/size of column/rows and the deep of the hierarchy contents in it, as soon as the document has a not too bad quality (no missing rulings for example). We will present the way the description is done using EPF to be general enough to recognize very different table organizations. With the same DMOS generic method, we have also been able to easily define a specific recognition system of the table structure of quite damaged military forms of the 19th century. This specific description was necessary to compensate some missing informations concerning the table structure of those military forms, due to a very bad quality or hidden part of the table. This system has been successfully validated on 88,745 images, showing that this DMOS generic method can be used at an industrial level.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Table Structure Recognition Using Top-Down and Bottom-Up Cues

TRACE: Table Reconstruction Aligned to Corner and Edges

Automatic Stave Discovery for Musical Facsimiles

References

Coüasnon, B.: Dealing with noise in DMOS, a generic method for structured document recognition: an example on a complete grammar. In: Lladós, J., Kwon, Y.-B. (eds.) Graphics Recognition: Recent Advances and Perspectives. LNCS, vol. 3088, pp. 38–49. Springer-Verlag, Berlin Heidelberg New York (2004)
Google Scholar
Coüasnon, B.: DMOS: A generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 215–220, Seattle, USA (2001)
Coüasnon, B., Camillerapp, J.: Using grammars to segment and recognize music scores. In: Spitz, L., Dengel, A. (eds.) Document Analysis Systems. World Scientific, Singapore (1995)
Google Scholar
Poulain d'Andecy, V., Camillerapp, J., Leplumey, I.: Kalman filtering for segment detection: application to music scores analysis. In: ICPR, 12th International Conference on Pattern Recognition (IAPR), vol. 1, pp. 301–305, Jérusalem, Israel (1994)
Garcia, P., Coüasnon, B.: Using a generic document recognition method for mathematical formulae recognition. In: Graphics Recognition: Algorithms and Applications. LNCS, vol. 2390, pp. 236–244. Springer-Verlag, Berlin Heidelberg New York (2002)
Chapter Google Scholar
Klein, B., Dengel, A.R., Fordan, A.: smartFix: an adaptive system for document analysis and understanding. In: Dengel, A., Junker, M., Weisbecker, A. (eds.) Reading and Learning: Adaptive Content Recognition. LNCS, vol. 2956, pp. 166–186. Springer-Verlag, Berlin Heidelberg New York (2004)
Google Scholar
Schäfer, H., Thomas Bayer, T., Kreuzer, K., Miletzki, U., Schambach, M.-P., Schulte-Austum, M.: How postal address readers are made adaptive. In: Dengel, A., Junker, M., Weisbecker, A. (eds.) Reading and Learning: Adaptive Content Recognition. LNCS, vol. 2956, pp. 187–215. Springer-Verlag, Berlin Heidelberg New York (2004)
Google Scholar
Esposito, F., Malerba, D., Lisi, F.A.: Machine learning for intelligent processing of printed documents. J. Intell. Inform. Syst. 14(2–3), 175–198 (2000)
Article Google Scholar
Adam, S., Rigamonti, M., Clavier, E., Trupin, E., Ogier, J.-M., Tombre, K., Gardes, J.: Docmining: A document analysis system builder. In: Marinai, S., Dengel, A. (eds.) Document Analysis Systems VI, 6th International Workshop, DAS 2004, Florence, Italy, September 2004. Lecture Notes in Computer Science, vol. 3163, pp. 472–483. Springer-Verlag, Berlin Heidelberg New York (2004)
Middendorf, M., Peust, J., Schacht, C.: A component-based framework for recognition systems. In: Dengel, A., Junker, M., Weisbecker, A. (eds.) Reading and Learning: Adaptive Content Recognition. LNCS, vol. 2956, pp. 153–165. Springer-Verlag, Berlin Heidelberg New York (2004)
Google Scholar
Clavier, E., Masini, G., Delalandre, M., Rigamonti, M., Tombre, K., Gardes, J.: Docmining: a cooperative platform for heterogeneous document interpretation according to user-defined scenarios. In: Lladós, J., Kwon, Y.-B. (eds.) Graphics Recognition: Recent Advances and Perspectives. LNCS, vol. 3088, pp. 13–24. Springer-Verlag, Berlin Heidelberg New York (2004)
Google Scholar
Mao, S., Rosenfeld, A., Kanungo, T.: Document structure analysis algorithms: a literature survey. In: Document Recognition and Retreval X (Proceedings of SPIE/IST), vol. 5010: Santa Clara, California (2003)
Brainerd, W.S.: Tree generating regular systems. Inform. Control 14, 217–231 (1969)
Article MATH MathSciNet Google Scholar
Pfaltz, J.L., Rosenfeld, A.: Web grammars. In: Proceedings of the First International Joint Conference on Artificial Intelligence, pp. 609–619. Washington, DC (1969)
Feder, J.: Plex languages. Inform. Sci. 3, 225–241 (1971)
Article MathSciNet Google Scholar
Grbavec, A., Blostein, D.: Mathematics recognition using graph rewriting. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 1, pp. 417–421. Montréal, Canada (1995)
Pereira, F.C.N., Warren, D.H.D.: Definite clauses for language analysis. Artif. Intell. 13, 231–278 (1980)
Article MATH MathSciNet Google Scholar
Searls, D.B., Taylor, S.L.: Document image analysis using logic-grammar-based syntactic pattern recognition. In: Yamamoto, K., Baird, H.S., Bunke, H. (eds.) Structured Document Image Analysis, pp. 520–545. Springer-Verlag, Berlin Heidelberg New York (1992)
Google Scholar
Coüasnon, B., Brisset, P., Stephan, I.: Using logic programming languages for optical music recognition. In: International Conference on the Practical Application of Prolog, pp. 115–134. Paris, France (1995)
Lopresti, D., Nagy, G.: A tabular survey of automated table processing. In: Chhabra, A.K., Dori, D. (eds.) Graphics Recognition, Recent Advances. Lecture Notes in Computer Science, vol. 1941, pp. 93–120. Springer-Verlag, Berlin Heidelberg New York (2000)
Zanibbi, R., Blostein, D., Cordy, J.R.: A survey of table recognition: models, observations, transformations, and inferences. Int. J. Document Anal. Recog. 7(1) (2004)
Taylor, S.L., Fritzson, R., Pastor, J.A.: Extraction of data from preprinted forms. Machine Vision Appl. 5(3), 211–222 (1992)
Article Google Scholar
Watanabe, T., Luo, Q., Sugie, N.: Toward a practical document understanding of table-form documents: its framework and knowledge representation. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 510–515. Tsukuba Science City, Japan (1993)
Hori, O., Doermann, D.S.: Robust table-form structure analysis based on box-driven reasoning. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 1, pp. 218–221. Montréal, Canada (1995)
Xingyuan, L., Doerman, D., Oh, W., Gao, W.: A robust method for unknown forms analysis. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 531–534. Bangalore, India (1999)
Nielson, H.E., Barrett, W.A.: Consensus-based table form recognition. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 2, pp. 906–910. Edinburgh, Scotland (2003)
Hu, J., Kashi, R., Lopresti, D., Wilfong, G.: System for understanding and reformulating tables. In: Fourth IAPR International Workshop on Document Analysis Systems, pp. 361–372. Rio de Janeiro, Brazil (2000)
Hurst, M.: A constraint-based approach to table structure derivation. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 2, pp. 910–915. Edinburgh, Scotland (2003)
Hurst, M., Douglas, S.: Layout and language: preliminary investigations in recognizing the structure of tables. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 2, pp. 1043–1047. Ulm, Germany (1997)
Kieninger, T., Dengel, A.: Applying the t-recs table recognition system to the business letter domain. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 518–522, Seattle, USA, September 2001
Ramel, J.-Y., Crucianu, M., Vincent, N., Faure, C.: Detection, extraction and representation of tables. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 1, pp. 374–378. Edinburgh, Scotland (2003)
Wang, Y., Phillips, I.T., Haralick, R.M.: Table detection via probability optimization. In: Hu, J., Lopresti, D., Kashi, R. (eds.) DAS 2002. LNCS 2423, pp. 272–282. Springer-Verlag, Berlin Heidelberg New York (2002)
Google Scholar
Klein, B., Gökkus, S., Kieninger, T., Dengel, A.: Three approaches to “industrial” table spotting. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 513–517. Seattle, USA (2001)
Amano, A., Asada, N.: Graph grammar based analysis system of complex table form document. In: ICDAR, International Conference on Document Analysis and Recognition, vol. 2, pp. 916–920. Edinburgh, Scotland (2003)
Coüasnon, B., Pasquer, L.: A real-world evaluation of a generic document recognition method applied to a military form of the 19th century. In: ICDAR, International Conference on Document Analysis and Recognition, pp. 779–783, Seattle, USA (2001)
Coüasnon, B., Camillerapp, J., Leplumey, I.: Making handwritten archives documents accessible to public with a generic system of document image analysis. In: International Workshop on Document Image Analysis for Libraries (DIAL'04), pp. 270–277. Palo Alto, USA (2004)

Download references

Author information

Authors and Affiliations

IRISA/INRIA, Campus universitaire de Beaulieu, F-35042, Rennes Cedex, France
Bertrand Coüasnon

Authors

Bertrand Coüasnon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bertrand Coüasnon.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Coüasnon, B. DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way. IJDAR 8, 111–122 (2006). https://doi.org/10.1007/s10032-005-0148-5

Download citation

Received: 27 July 2004
Accepted: 11 May 2005
Published: 24 March 2006
Issue Date: June 2006
DOI: https://doi.org/10.1007/s10032-005-0148-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way

Abstract

Access this article

Similar content being viewed by others

Table Structure Recognition Using Top-Down and Bottom-Up Cues

TRACE: Table Reconstruction Aligned to Corner and Edges

Automatic Stave Discovery for Musical Facsimiles

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

DMOS, a generic document recognition method: application to table structure analysis in a general and in a specific way

Abstract

Access this article

Similar content being viewed by others

Table Structure Recognition Using Top-Down and Bottom-Up Cues

TRACE: Table Reconstruction Aligned to Corner and Edges

Automatic Stave Discovery for Musical Facsimiles

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation