An Integrated OCR Software for Mathematical Documents and Its Output with Accessibility
This paper describes shortly a practical integrated system for scientific documents including mathematical formulae, named ‘Infty’. The system consists of three components of applications: an OCR system named ‘InftyReader’, an editor named ‘InftyEditor’ and converting tools into various formats. Those applications are linked each other via XML files.
InftyReader recognizes scanned images of clearly printed mathematical documents and outputs the recognition results in a XML format. It recognizes complex mathematical formulae used in various research papers of mathematics including matrices. InftyEditor provides a very efficient interface to correct the recognition results using keyboard. Another feature of InftyEditor is its handwriting interface to input mathematical formulae for users with vision and speech interface for visually impaired uses.
The XML files output by InftyReader/Editor can be converted into various formats: LATEX, MathML, HTML and Braille Codes; in UBC (Unified Braille Codes) for English texts and in Japanese Braille Codes for Japanese texts.
KeywordsMathematical Formula Recognition Result Virtual Link Screen Reader Text Area
Unable to display preview. Download preview PDF.
- 1.Fukuda, R., Ohtake, N., Suzuki, M.: Optical Recognition and Braille Transcription of Mathematical Documents. In: Proc. ICCHP, pp. 711–718 (2000)Google Scholar
- 2.Eto, Y., Suzuki, M.: Mathematical formula recognition using virtual link network. In: Proc. ICDAR, pp. 762–767 (2001)Google Scholar
- 4.Suzuki, M., Tamari, F., Fukuda, R., Uchida, S., Kanahori, T.: INFTY – An Integrated OCR System for Mathematical Documents. In: Proc. DocEng (2003)Google Scholar
- 6.Ishikawa, J.: EXTRA for Windows ver. 1.0 users manual, Amedia Co., Ltd., Tokyo (2001) Google Scholar