An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility

  • Masakazu Suzuki
  • Toshihiro Kanahori
  • Nobuyuki Ohtake
  • Katsuhito Yamaguchi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3118)


This paper describes shortly a practical integrated system for scientific documents including mathematical formulae, named ‘Infty’. The system consists of three components of applications: an OCR system named ‘InftyReader’, an editor named ‘InftyEditor’ and converting tools into various formats. Those applications are linked each other via XML files.

InftyReader recognizes scanned images of clearly printed mathematical documents and outputs the recognition results in a XML format. It recognizes complex mathematical formulae used in various research papers of mathematics including matrices. InftyEditor provides a very efficient interface to correct the recognition results using keyboard. Another feature of InftyEditor is its handwriting interface to input mathematical formulae for users with vision and speech interface for visually impaired uses.

The XML files output by InftyReader/Editor can be converted into various formats: LATEX, MathML, HTML and Braille Codes; in UBC (Unified Braille Codes) for English texts and in Japanese Braille Codes for Japanese texts.


Mathematical Formula Recognition Result Virtual Link Screen Reader Text Area 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Fukuda, R., Ohtake, N., Suzuki, M.: Optical Recognition and Braille Transcription of Mathematical Documents. In: Proc. ICCHP, pp. 711–718 (2000)Google Scholar
  2. 2.
    Eto, Y., Suzuki, M.: Mathematical formula recognition using virtual link network. In: Proc. ICDAR, pp. 762–767 (2001)Google Scholar
  3. 3.
    Kanahori, T., Suzuki, M.: A recognition method of matrices by using variable block pattern elements generating rectangular area. In: Blostein, D., Kwon, Y.-B. (eds.) GREC 2001. LNCS, vol. 2390, p. 320. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  4. 4.
    Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., Kanahori, T.: INFTY – An Integrated OCR System for Mathematical Documents. In: Proc. DocEng (2003)Google Scholar
  5. 5.
    Kanahori, T., Tabata, K., Cong, W., Tamari, F., Suzuki, M.: On-Line Recognition of Mathematical Expressions Using Automatic Rewriting Method. In: Tan, T., Shi, Y., Gao, W. (eds.) ICMI 2000. LNCS, vol. 1948, pp. 394–401. Springer, Heidelberg (2000)CrossRefGoogle Scholar
  6. 6.
    Ishikawa, J.: EXTRA for Windows ver. 1.0 users manual, Amedia Co., Ltd., Tokyo (2001) Google Scholar
  7. 7.

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Masakazu Suzuki
    • 1
  • Toshihiro Kanahori
    • 2
  • Nobuyuki Ohtake
    • 2
  • Katsuhito Yamaguchi
    • 3
  1. 1.Faculty of MathematicsKyushu UniversityFukuokaJapan
  2. 2.Research Center on Educational MediaTsukuba College of TechnologyIbarakiJapan
  3. 3.Junior College Funabashi CampusNihon UniversityChibaJapan

Personalised recommendations