A Complete OCR System for Tamil Magazine Documents

Kokku, Aparna; Chakravarthy, Srinivasa

doi:10.1007/978-1-84800-330-9_7

Aparna Kokku³ &
Srinivasa Chakravarthy³

Part of the book series: Advances in Pattern Recognition ((ACVPR))

739 Accesses
3 Citations

Abstract

We present a complete optical character recognition (OCR) system for Tamil magazines/documents. All the standard elements of OCR process like de-skewing, preprocessing, segmentation, character recognition, and reconstruction are implemented. Experience with OCR problems teaches that for most subtasks of OCR, there is no single technique that gives perfect results for every type of document image. We exploit the ability of neural networks to learn from experience in solving the problems of segmentation and character recognition. Text segmentation of Tamil newsprint poses a new challenge owing to its italic-like font type; problems that arise in recognition of touching and close characters are discussed. Character recognition efficiency varied from 94 to 97% for this type of font. The grouping of blocks into logical units and the determination of reading order within each logical unit helped us in reconstructing automatically the document image in an editable format.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Nagy, G.: Twenty years of document image analysis in PAMI. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1), 2000, pp. 38–62.
Article Google Scholar
Chaudhuri, B.B. and Pal, U.: An OCR system to read two Indian language scripts: Bangla and Devanagari (Hindi). Proceedings of Intl’ Conference on Document Analysis and Recognition, Ulm, Germany, pp. 1011–1015, 1997.
Google Scholar
Rajasekharan, S.N.S. and Deekshatulu, B.L.: Generation and recognition of printed Telugu characters, Computer Graphics and Image Processing 6, 1977, pp. 335–360.
Article Google Scholar
Bagdanov, A. and Kanai, J.: Projection profile based Skew estimation Algorithm for JBIG compressed images. Proceedings of Intl’ Conference on Document Analysis and Recognition, Ulm, Germany, 1997, pp. 401–405.
Google Scholar
Srihari, S.N. and Govindaraju, V.: Analysis of textual images using the hough transform. Machine Vision and Applications, 2(3), 1989, pp. 141–153.
Article Google Scholar
Pal, U. and Chaudhuri, B.B.: An improved document skew angle estimation technique. Pattern Recognition Letters, 17(8), 1996, pp. 899–904.
Article Google Scholar
Yu, B. and Jain, A.K.: A robust and fast skew detection algorithm for generic documents. Pattern Recognition, 29(10), 1996, pp. 1599–1629.
Article Google Scholar
Hashizume, A., Yeh, P.S. and Rosenfeld, A.: A method of detecting the orientation of aligned components. Pattern Recognition Letters, 4, 1986, pp. 125–132.
Article Google Scholar
O’Gorman, L.: The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11), 1993, pp. 1162–1173.
Article Google Scholar
Chaudhuri, B.B. and Pal, U.: A complete printed Bangla OCR system. Pattern Recognition, 31(5), 1998, pp. 531–549.
Article Google Scholar
Yan, H.: Skew correction of document images using interline cross correlation. CVGIP: Graphical Models and Image Processing, 55(6), 1993, pp. 538–543.
Article Google Scholar
Chen, S. and Haralick, R.M.: An automatic algorithm for Text Skew estimation in document images using recursive morphological transforms. Proceedings of First IEEE International Conference on Image Processing, 1994, pp. 139–143, Austin, Texas.
Google Scholar
Chen, S. and Haralick, R.M.: Recursive erosion, dilation, opening and closing transforms. IEEE Transaction on Image Processing, 4(3),1995, pp. 335–345.
Article Google Scholar
Aghajan, H.K. and Kailath, T.: SLIDE: Subspace-Based line detection. IEEE Trans on Pattern Analysis and Machine Intelligence, 16(11), 1994, pp. 1057–1073.
Article Google Scholar
Ostu, N.: A threshold selection method from gray scale Histograms. IEEE Transactions on Systems Man Cybernet, 8, 1979, pp. 62–66.
Google Scholar
Liu, Y. and Srihari, S.N.: Document image binarization based on texture features. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(5), 1997, pp. 540–544.
Article Google Scholar
Trier, O.D. and Taxt, T.: Evaluation of binarization methods for document images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(3), 1995, pp. 312–315.
Article Google Scholar
Abak, A.T., Baris, U., and. Sankur, B.: The performance evaluation of thresholding algorithms for optical character recognition. Proceedings of 4th International Conference on Document Analysis and Recognition, 1997, pp. 697–700, Ulm, Germany
Google Scholar
Wong, K.J., Casey, R.G., and Wahl, F.M.: Document analysis system. IBM Journal of Research and Development, 26(6) 1982, pp. 647–656.
Article Google Scholar
Wang, D. and Srihari, S.N.: Classification of newspaper image blocks using texture analysis. Computer Vision, Graphics and Image Processing 47, 1989, pp. 327–352.
Article Google Scholar
Nagy, G., Seth, S., and Viswanathan, M.: A prototype document image analysis system for technical journals. IEEE Computer, 25(7), 1992, pp. 10–22.
Google Scholar
Krishnamoorthy, M., Nagy, G., Seth, S., and Viswanathan, M.: Syntactic segmentation and labeling of digitized pages from technical journals. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(7), 1993, pp. 737–747.
Article Google Scholar
Pavlidis, T. and Zhou, J.: Page segmentation and classification. CVGIP 54(6), 1992, pp. 484–496.
Google Scholar
Jain, A. K. and Yu, B.: Document representation and its application to page decomposition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 20(3), 1998, pp. 294–307.
Article Google Scholar
Jain, A.K. and Bhattacharjee, S.: Text segmentation using Gabor filters for automatic document processing. Machine Vision and Applications, 5(3), 1992, pp. 169–184.
Article Google Scholar
Le, D.X., Thoma, G.R., and Wechsler, H.: Classification of binary document images into textual or nontextual data blocks using neural network models. Machine Vision and Applications, 8, 1995, pp. 289–304.
Article Google Scholar
Casey, R.G. and Lecolinet, E.: A survey of methods and strategies in character segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(7), 1996, pp. 690–706.
Article Google Scholar
Lu, Y.: Machine printed character recognition – An overview. Pattern Recognition, 28(1), 1995, pp. 67–80.
Article Google Scholar
Tsujimoto, S. and Asada, H.: Resolving ambiguity in segmenting touching characters. Proceedings of First International Conference on Document Analysis and Recognition, 1991, pp. 701–709.
Google Scholar
Hoffman, R.L. and McCullough, J.W.: Segmentation methods for recognition of machine printed characters. IBM Journal of Research and Development, 1971, pp. 153–165.
Google Scholar
Wang, J., and Jean, J.: Segmentation of merged characters by neural networks and shortest path. Pattern recognition, 27(5), 1994, pp. 649–658.
Article Google Scholar
Mori, S., Suen, C.Y. and Yamamoto, K.: Historical review of OCR research and development. Proceedings of IEEE, 80(7), 1992, pp. 1029–1058.
Article Google Scholar
Lee, S.W. and Kim, Y.J.: Direct extraction of topographic features for gray scale character recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 17(7), 1995, pp. 724–729.
Article Google Scholar
Lee, S.W., Lee, D.J., and Park, H.S.: A new methodology for gray-scale character segmentation and recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(10), 1996, pp. 1045–1050.
Article Google Scholar
Siromoney, G., Chandrasekaran, R., and Chandrasekaran, M.: Machine recognition of printed Tamil characters. Pattern Recognition, 10, 1978, pp. 243–247.
Article MATH Google Scholar
Sinha, R.M.K., and Mahabala, H.: Machine recognition of Devanagari script. IEEE Transactions on Systems Man Cybernet, SMC-9, 1979, pp. 435–441.
Google Scholar
Sinha, R.M.K.: Rule based contextual post-processing for Devanagiri text recognition. Pattern Recognition, 20, 1987, pp. 475–485.
Article Google Scholar
Tsujimoto, S. and Asada, H.: Major components of a complete text reading system. Proceedings of IEEE, 80(7), 1992, pp. 1133–1149.
Article Google Scholar
Niyogi, D. and Srihari, S.N.: Knowledge-based derivation of document logical structure. International Conference on Document Analysis and Recognition, 1995, pp. 472–475, Montreal, Canada
Chapter Google Scholar
Moody, J.E. and Darken, C.J.: Fast learning in networks of locally tuned processing units. Neural Computation 1, 1989, pp. 281–294.
Article Google Scholar
Haykin, S.: Neural networks: A comprehensive foundation. Prentice Hall, 1999
Google Scholar
Daugman, J.G.: Uncertainty relation for resolution in space, spatial frequency, and orientation optimized by two-dimensional visual cortial filters. Journal of the Optical Society of America A, 2(7), 1985, pp. 1160–1169.
Article Google Scholar
Aparna, H.K.: “Document image analysis: A complete OCR system development for Tamil magazine documents”, M.S. Thesis, Department of Electrical Engineering, Indian Institute of Technology, May, 2003, Madras.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biotechnology, IIT-Madras, Chennai, 600036, India
Aparna Kokku & Srinivasa Chakravarthy

Authors

Aparna Kokku
View author publications
You can also search for this author in PubMed Google Scholar
Srinivasa Chakravarthy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Analysis & Recognition (CEDAR), Center of Excellence for Document, Lee Entrance 520, Amherst, 14228, U.S.A.
Venu Govindaraju
Analysis & Recognition (CEDAR), Center of Excellence for Document, Lee Entrance 520, Amherst, 14228, U.S.A.
Srirangaraj (Ranga) Setlur

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kokku, A., Chakravarthy, S. (2009). A Complete OCR System for Tamil Magazine Documents. In: Govindaraju, V., Setlur, S. (eds) Guide to OCR for Indic Scripts. Advances in Pattern Recognition. Springer, London. https://doi.org/10.1007/978-1-84800-330-9_7

Download citation

DOI: https://doi.org/10.1007/978-1-84800-330-9_7
Published: 28 August 2009
Publisher Name: Springer, London
Print ISBN: 978-1-84800-329-3
Online ISBN: 978-1-84800-330-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics