Structural and Syntactic Techniques for Recognition of Ethiopic Characters
OCR technology of Latin scripts is well advanced in comparison to other scripts. However, the available results from Latin are not always sufficient to directly adopt them for other scripts such as the Ethiopic script. In this paper, we propose a novel approach that uses structural and syntactic techniques for recognition of Ethiopic characters. We reveal that primitive structures and their spatial relationships form a unique set of patterns for each character. The relationships of primitives are represented by a special tree structure, which is also used to generate a pattern. A knowledge base of the alphabet that stores possibly occurring patterns for each character is built. Recognition is then achieved by matching the generated pattern against each pattern in the knowledge base. Structural features are extracted using direction field tensor. Experimental results are reported, and the recognition system is insensitive to variations on font types, sizes and styles.
- 1.Bigun, J.: Vision with Direction: A Systematic Introduction to Image Processing and Vision. Springer, Heidelberg (2006)Google Scholar
- 3.Cowell, J., Hussain, F.: Amharic character recognition using a fast signature based algorithm. In: Proc. Fourth Int’l. Conf. Information Visualization, pp. 384–389 (2003)Google Scholar
- 5.Gerard, A.S.: African Language Literatures: An introduction to the literary history of sub-Saharan Africa. Three Continents Press, Washington (1981)Google Scholar
- 6.Premaratne, L., Assabie, Y., Bigun, J.: Recognition of modification-based scripts using direction tensors. In: ICVGIP 2004, Kolkata, pp. 587–592 (2004)Google Scholar