Skip to main content
Log in

A contour distance-based approach for multi-oriented and multi-sized character recognition

  • Published:
Sadhana Aims and scope Submit manuscript

Abstract

In this paper, we propose a novel scheme towards the recognition of multi-oriented and multi-sized isolated characters of printed script. For recognition, at first, distances of the outer contour points from the centroid of the individual characters are calculated and these contour distances are then arranged in a particular order to get size and rotation invariant feature. Next, based on the arranged contour distances, the features are derived from different class of characters. Finally, we use these derived features of the characters to statistically compare the features of the input character for recognition. We have tested our scheme on printed Bangla and Devnagari multi-oriented characters and we obtained encouraging results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Adam S, Ogier JM, Carlon C, Mullot R, Labiche J, Gardes J 2000 Symbol and Character recognition: application to engineering drawing. Int. Journal of Document Analysis and Recognition 3: 89–101

    Article  Google Scholar 

  • Chaudhuri B B, Pal U 1998 A complete printed Bangla OCR system. Pattern Recognition 31: 531–549

    Article  Google Scholar 

  • Dhandra B V, Nagabhushan P, Hangarge M, Hegadi R, Malemath V S 2006 Script Identification Based on Morphological Reconstruction in Document Images. In Proc. International Conf. on Pattern Recognition 950–953

  • Hase H, Shinokawa T, Yoneda M, Suen C Y 2003 Recognition of Rotated Characters by Eigen-space. In Proc. 7th International Conference on Document Analysis and Recognition 731–735

  • Hase H, Yoneda M, Shinokawa T, Suen C Y 2001 Alignment of Free layout colour texts for character recognition. In Proc. 6th Int. Conference on Document Analysis and Recognition 932–936

  • Lehal G S, Singh C 2000 A Gurumukhi Script Recognition System. In Proc. International Conf. on Pattern Recognition 2557–2560

  • Loo P K, Tan C L 2002 Word and sentence extraction using irregular pyramid. In Proc. 5th International Workshop on Document Analysis Systems 307–318

  • Lu Y, Wang Z, Tan C L 2004 Word Grouping in Document Images Based on Voronoi Tessellation. In Proc. 6th International Workshop on Document Analysis Systems 147–157

  • Monwar M, Haque W, Paul P P 2007 A New Approach For Rotation Invariant Optical Character Recognition Using Eigendigit. In Proc. Canadian Conference on Electrical and Computer Engineering 1317–1320

  • Negi A, Chakravarthy B, Krishna B 2001 An OCR System for Telugu. In Proc. 6th Int. Conference on Document Analysis and Recognition 1110–1114

  • Pal U, Chaudhuri B B 2004 Indian Script Character Recognition: A Survey. Pattern Recognition 37: 1887–1899

    Article  Google Scholar 

  • Pal U, Roy P P 2004 Multi-oriented and curved text lines extraction from Indian documents. IEEE Trans. on Systems, Man and Cybernetics-Part B 34: 1676–1684

    Article  Google Scholar 

  • Pal U, Kimura F, Roy K and Pal T 2006 Recognition of English Multi-oriented Characters. In Proc. International Conf. on Pattern Recognition 873–876

  • Roy K, Pal U, Chaudhuri B B 2004 A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation. In Proc. of 4th Indian Conference on Computer Vision, Graphics and Image Processing 641–646

  • Sato S, Miyake S, Aso H 2000 Evaluation of Two Neocognitron-type Models for recognition of rotated patterns. In Proc. ICONIP 295–299

  • Tang Y Y, Cheng H D, Suen C Y 1991 Translation-ring-projection (TRP) algorithm and its VLSI Implementations. Character and Handwriting Recognition Ed. Wang P S P World scientific, Singapore, 25–56

    Google Scholar 

  • Uchida S, Iwamura M, Omachi S, Kise K 2006 OCR Fonts Revisited for Camera-Based Character Recognition. In Proc. International Conf. on Pattern Recognition 1134–1137

  • Uchida S, Sakai M, Iwamura M, Omachi S, Kise K 2007 Extraction of Embedded Class Information from Universal Character Pattern. In Proc. 9th International Conference on Document Analysis and Recognition 437–441

  • Xie Q, Kobayashi A 1991 A construction of pattern recognition system invariant of translation, scalechange and rotation transformation of pattern. Trans. of the Society of Instrument and Control Engineers 27: 1167–1174

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to U. Pal.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pal, U., Tripathy, N. A contour distance-based approach for multi-oriented and multi-sized character recognition. Sadhana 34, 755–765 (2009). https://doi.org/10.1007/s12046-009-0044-7

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12046-009-0044-7

Keywords

Navigation