Abstract.
This paper is concerned with research on OCR (optical character recognition) of printed mathematical expressions. Construction of a representative corpus of technical and scientific documents containing expressions is discussed. A statistical investigation of the corpus is presented, and usefulness of this analysis is demonstrated in the related research problems, namely, (i) identification and segmentation of expression zones from the rest of the document, (ii) recognition of expression symbols, (iii) interpretation of expression structures, and (iv) performance evaluation of a mathematical expression recognition system. Moreover, a groundtruthing format has been proposed to facilitate automatic evaluation of expression recognition techniques.
Similar content being viewed by others
References
Fateman RJ, Tokuyasu T (1996) Progress in recognizing typeset mathematics. In: Proc. SPIE, San Jose, CA, 2660:37-50
Blostein D, Grbavec A (1997) Recognition of mathematical notation. In: Bunke H, Wang PSP (eds) Handbook of character recognition and document image analysis. World Scientific, Singapore, pp 557-582
Chan K-F, Yeung D-Y (2000) Mathematical expression recognition: a survey. Int J Doc Anal Recog 3(1):3-15
Phillips I (1998) Methodologies for using UW databases for OCR and image understanding systems. Proc. SPIE, Document Recognition V, 3305:112-127
Chan K-F, Yeung D-Y (2001) Error detection, error correction and performance evaluation in on-line mathematical expression recognition. Pattern Recog 34:1671-1684
Okamoto M, Imai H, Takagi K (2001) Performance evaluation of a robust method for mathematical expression recognition. In: Proc. 6th international conference on document analysis and recognition, Seattle, pp 121-128
Zanibbi R, Blostein D, Cordy JR (2002) Recognizing mathematical expressions using tree transformation. IEEE Trans Pattern Anal Mach Intell 24(11):1455-1467
Raman TV (1994) Audio system for technical readings. Doctoral dissertation, Cornell University, Ithaca, NY
Garain U, Chaudhuri BB (2001) On development and statistical analysis of a corpus for printed and handwritten mathematical expressions. In: Proc. 4th IAPR international workshop on graphics recognition (GREC2001), Canada, 2001, pp 429-439
Jain AK, Yu B (1998) Document representation and its application to page decomposition. IEEE Trans Pattern Anal Mach Intell 20(3):294-308
Pavlidis T, Zhou J (1992) Page segmentation and classification. Comput Vis Graph Image Process 54:484-496
Lee HJ, Wang J-S (1995) Design of a mathematical expression recognition system. In: Proc. 3rd international conference on document analysis and recognition, Montreal, pp 1084--1087
Toumit J-Y, Garcia-Salicetti S, Emptoz H (1999) A hierarchical and recursive model of mathematical expressions for automatic reading of mathematical documents. In: Proc. 5th international conference on document analysis and recognition, Bangalore, India, pp 119-122
Fateman RJ (2000) How to find mathematics on a scanned page. In: Proc. Document Recognition and Retrieval VII, January 2000, San Jose, CA
Kacem A, Belaid A, Ben Ahmed M (2001) Automatic extraction of printed mathematical formulas using fuzzy logic and propagation of context. Int J Doc Anal Recog 4(2):97-108
Chowdhury SP, Mandal S, Das AK, Chanda B (2003) Automated segmentation of math-zones from document images. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 755-759
Jin J, Han X, Wang Q (2003) Mathematical formulas extraction. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 1138-1141
Suzuki M, Tamari F, Fukuda R, Uchida S, Kanahori T (2003) INFTY - An integrated OCR system for mathematical documents. In: Proc. ACM symposium on document engineering (DocEng), Grenoble, France, pp 95-104
Chaudhuri BB, Garain U (1998) Automatic detection of italic, bold and all-capital words in document. In: Proc. 14th international conference on pattern recognition (ICPR), Brisbane, Australia, pp 610-612
Chaudhuri BB, Garain U (2001) Extraction of type style based meta-information from imaged documents. Int J Doc Anal Recog 3(3):138-149
Chaudhuri BB, Garain U (2000) An approach for recognition and interpretation of mathematical expressions in printed documents. Pattern Anal Appl 3:120-131
Mitra J, Garain U, Chaudhuri BB, Swamy HVK, Pal T (2003) Automatic understanding of structures in printed mathematical expressions. In: Proc. 7th international conference on document analysis and recognition, Edinburgh, UK, pp 540-544
Zipf KG (1949) Human behavior and the principle of least effort: an introduction to human ecology. Addison-Wesley, Reading, MA
Phillips I, Chhabra A (1999) Empirical performance evaluation of graphics recognition systems. IEEE Trans Pattern Anal Mach Intell 21(9):849-870
Author information
Authors and Affiliations
Corresponding author
Additional information
Received: 10 July 2003, Accepted: 22 November 2004, Published online: 18 March 2005
Correspondence to: Utpal Garain
Rights and permissions
About this article
Cite this article
Garain, U., Chaudhuri, B.B. A corpus for OCR research on mathematical expressions. IJDAR 7, 241–259 (2005). https://doi.org/10.1007/s10032-004-0140-5
Issue Date:
DOI: https://doi.org/10.1007/s10032-004-0140-5