An Approach for Processing Mathematical Expressions in Printed Document

  • B. B. Chaudhuri
  • U. Garain
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1655)


In this paper, we propose an approach for understanding mathematical expressions in printed document. The system consists of three main components namely (i) detection of mathematical expressions in a document, (ii) recognition of the symbols present in the expression and (iii) meaningful arrangement of the recognized symbols. However, detection of mathematical expressions is done through recognition of symbols. Moreover, some structural features of the expressions are also used for this purpose. For recognition of the symbols a hybrid of feature based and template based recognition techniques is used. The bounding-box coordinates and the size information of the symbols help to determine the spatial relationships among the symbols. A set of predefined grammar rules is used to form the meaningful symbol groups to properly arrange the symbols. Experiments conducted using these approaches on a large number of documents show high accuracy.


  1. 1.
    D. Blostein, A. Grbavec: Recognition of Mathematical Notation. In: H. Bunke, P. S. P. Wang (eds.): Handbook of Character Recognition and Document Image Analysis, World Scientific Publishing Company, (1997) 557–582Google Scholar
  2. 2.
    R. H. Anderson: Syntax-directed recognition of handprinted 2-D mathematics. Ph.D. Dissertation. Harvard University, Cambridge, M. A. (1968)Google Scholar
  3. 3.
    A. Grbavec, D. Blostein: mathematics recognition using graph rewriting. In: Proceedings of Third International Conference on Document Analysis and Recognition. Montreal, Canada (1995) 417–421Google Scholar
  4. 4.
    W. Martin: Computer input/output of mathematical expressions. In: Proceedings of Second Symposium on Symbolic and Algebraic Manipulations. New York (1971) 78–87Google Scholar
  5. 5.
    A. Belaid, J. Haton: A syntactic approach for handwritten mathematical formula Recognition. IEEE Transaction on pattern Analysis and machine Intelligence.6, 1 (1984) 105–111CrossRefGoogle Scholar
  6. 6.
    S. K. Chang: A method for the structural analysis of 2-D mathematical expressions. Information Sciences. 2, 3 (1970) 253–272zbMATHCrossRefGoogle Scholar
  7. 7.
    M. Okamoto, H. Miyazawa: An experimental implementation of a document recognition system for papers containing mathematical expressions. In: Structured Document Image Analysis. Springer-Verlag (1992) 36–53Google Scholar
  8. 8.
    M. Okamoto, H. Twaakyondo: Structure Analysis and Recognition of Mathematical Expressions. IEEE Computer Society Press (1995) 430–437Google Scholar
  9. 9.
    S. Larvirotte, L. Pottier: Mathematical formula recognition using graph grammar. In: Proceedings of SPIE, Vol. 3305. California, USA (1998)Google Scholar
  10. 10.
    H. Lee, M. Lee: Understanding mathematical expressions using procedure oriented transformation. Pattern Recognition, 27, 3 (1994) 447–457CrossRefGoogle Scholar
  11. 11.
    P. Chou: Recognition of equations using a two-dimensional context-free grammar. In: Proceedings of SPIE Visual Communication and Image Processing IV. Philadelphia PA (1989) 852–863Google Scholar
  12. 12.
    LATEX: A document Presentation System. Addison Wesley Publishing Company, Inc. (1986)Google Scholar
  13. 14.
    H. Lee and J. Wang: Design of a mathematical expression recognition system. In: Proceedings of Third International Conference on Document Analysis and Recognition. Montreal, Canada (1995) 1084–1087Google Scholar
  14. 15.
    U. Garain, B. B. Chaudhuri: Compound character recognition by a run number based metric distance. In: Proceedings of SPIE, Vol. 3305. San Jose (1998) 90–97CrossRefGoogle Scholar
  15. 16.
    B. B. Chaudhuri, U. Garain: Automatic detection of italic, bold and all-capital words from documents. In: Proceedings of International Conference on Pattern Recognition. Australia (1998) 610–612Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • B. B. Chaudhuri
    • 1
  • U. Garain
    • 1
  1. 1.Computer Vision & Pattern Recognition UnitIndian Statistical InstituteCalcuttaIndia

Personalised recommendations