Abstract
In this paper, we propose a novel scheme towards the recognition of multi-oriented and multi-sized isolated characters of printed script. For recognition, at first, distances of the outer contour points from the centroid of the individual characters are calculated and these contour distances are then arranged in a particular order to get size and rotation invariant feature. Next, based on the arranged contour distances, the features are derived from different class of characters. Finally, we use these derived features of the characters to statistically compare the features of the input character for recognition. We have tested our scheme on printed Bangla and Devnagari multi-oriented characters and we obtained encouraging results.
Similar content being viewed by others
References
Adam S, Ogier JM, Carlon C, Mullot R, Labiche J, Gardes J 2000 Symbol and Character recognition: application to engineering drawing. Int. Journal of Document Analysis and Recognition 3: 89–101
Chaudhuri B B, Pal U 1998 A complete printed Bangla OCR system. Pattern Recognition 31: 531–549
Dhandra B V, Nagabhushan P, Hangarge M, Hegadi R, Malemath V S 2006 Script Identification Based on Morphological Reconstruction in Document Images. In Proc. International Conf. on Pattern Recognition 950–953
Hase H, Shinokawa T, Yoneda M, Suen C Y 2003 Recognition of Rotated Characters by Eigen-space. In Proc. 7th International Conference on Document Analysis and Recognition 731–735
Hase H, Yoneda M, Shinokawa T, Suen C Y 2001 Alignment of Free layout colour texts for character recognition. In Proc. 6th Int. Conference on Document Analysis and Recognition 932–936
Lehal G S, Singh C 2000 A Gurumukhi Script Recognition System. In Proc. International Conf. on Pattern Recognition 2557–2560
Loo P K, Tan C L 2002 Word and sentence extraction using irregular pyramid. In Proc. 5th International Workshop on Document Analysis Systems 307–318
Lu Y, Wang Z, Tan C L 2004 Word Grouping in Document Images Based on Voronoi Tessellation. In Proc. 6th International Workshop on Document Analysis Systems 147–157
Monwar M, Haque W, Paul P P 2007 A New Approach For Rotation Invariant Optical Character Recognition Using Eigendigit. In Proc. Canadian Conference on Electrical and Computer Engineering 1317–1320
Negi A, Chakravarthy B, Krishna B 2001 An OCR System for Telugu. In Proc. 6th Int. Conference on Document Analysis and Recognition 1110–1114
Pal U, Chaudhuri B B 2004 Indian Script Character Recognition: A Survey. Pattern Recognition 37: 1887–1899
Pal U, Roy P P 2004 Multi-oriented and curved text lines extraction from Indian documents. IEEE Trans. on Systems, Man and Cybernetics-Part B 34: 1676–1684
Pal U, Kimura F, Roy K and Pal T 2006 Recognition of English Multi-oriented Characters. In Proc. International Conf. on Pattern Recognition 873–876
Roy K, Pal U, Chaudhuri B B 2004 A System for Joining and Recognition of Broken Bangla Numerals for Indian Postal Automation. In Proc. of 4th Indian Conference on Computer Vision, Graphics and Image Processing 641–646
Sato S, Miyake S, Aso H 2000 Evaluation of Two Neocognitron-type Models for recognition of rotated patterns. In Proc. ICONIP 295–299
Tang Y Y, Cheng H D, Suen C Y 1991 Translation-ring-projection (TRP) algorithm and its VLSI Implementations. Character and Handwriting Recognition Ed. Wang P S P World scientific, Singapore, 25–56
Uchida S, Iwamura M, Omachi S, Kise K 2006 OCR Fonts Revisited for Camera-Based Character Recognition. In Proc. International Conf. on Pattern Recognition 1134–1137
Uchida S, Sakai M, Iwamura M, Omachi S, Kise K 2007 Extraction of Embedded Class Information from Universal Character Pattern. In Proc. 9th International Conference on Document Analysis and Recognition 437–441
Xie Q, Kobayashi A 1991 A construction of pattern recognition system invariant of translation, scalechange and rotation transformation of pattern. Trans. of the Society of Instrument and Control Engineers 27: 1167–1174
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Pal, U., Tripathy, N. A contour distance-based approach for multi-oriented and multi-sized character recognition. Sadhana 34, 755–765 (2009). https://doi.org/10.1007/s12046-009-0044-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12046-009-0044-7