Advertisement

Character Segmentation for Cursive Handwritten Text Using Ligature Classification and Transition Feature

  • S. Gomathi Rohini
  • R. S. Uma Devi
  • S. Mohanavel
Conference paper
Part of the Lecture Notes in Electrical Engineering book series (LNEE, volume 221)

Abstract

Splitting of touching characters in cursive handwritten text is a critical task in segmentation process. A perfect segmentation of character is required to reduce the error rate of recognition. This paper proposes an approach to segment touching/overlapping and shadow characters in the handwritten text using ligature classification. It falls under the category of dissection method, but does not over segment ‘m’, ‘n’ and ‘u’, where the existing methods do. Binarization is the pre processing step for segmentation, which is performed by global or local thresholding. Sauvola’s method of threshold calculation is employed in this approach to binarize the gray scale image. The skew of the image is corrected by MATLAB code. Statistical analysis of ligature is done, in order to classify the inter-letter links and intra-letter links for evaluating the segmentation points. The Possible Segmentation Points (PSP) is generated based on the transition feature, followed by removal of invalid PSP by incorporating ligature extraction. The integration of transition feature in dissection method avoids unnecessary segmentation points without any attempt of classification and consequently reduces computational cost. A benchmark database IAM is used for fair comparison. The paper exhibits many examples with challenging and normal cases. The experimental results show that the proposed method achieves the segmentation rate of 92 %.

Keywords

Ligature detection and classification Core detection Stroke height analysis Inter-letter links Intra-letter links 

References

  1. 1.
    Liu Z, Cai J, Buse R (2003) Handwritten recognition, soft computing and probabilistic approaches. Springer, BerlinGoogle Scholar
  2. 2.
    Madhavanath S, Govindaraju V (2001) The role of holistic paradigms in handwritten word recognition. IEEE Trans Pattern Anal and Mach Intell 23(2):1433–1446Google Scholar
  3. 3.
    Casey RG, Lecolinet E (1996) A survey of methods and strategies in character segmentation. IEEE Trans Pattern Anal Mach Intell 18(7):609–706CrossRefGoogle Scholar
  4. 4.
    Dzulkifli BM, Fajri K Off-line character segmentation technique for handwritten cursive word—A surveyGoogle Scholar
  5. 5.
    Ribas FC, Oliverira LS, Britto AS, Jr Sabourin R (2012) Handwritten digit segmentation: A comparative study. IJDAR. doi:  10.1007/s10032-012-0185-9
  6. 6.
    Xu L, Yin F, Liu C-L (2010) Touching character splitting of Chinese handwriting using contour analysis and DTW. IEEE 978-1-4244-7210-9/10Google Scholar
  7. 7.
    Louloudis G, Tamatopolous N, Gatos B (2009) A novel two stage evaluation methodology for word segmentation techniques. In: Proceedings of the 10th international conference on document analysis and recognition IEEE. doi: 10.1109
  8. 8.
    Makridis M, Nikolaou N, Gatos B (2007) An efficient word segmentation technique for historical and regarded machine-printed documents. In: 9th international conference on document analysis and recognition IEEEGoogle Scholar
  9. 9.
    Markus F, Klaus D (2003) Word segmentation of handwritten dates in historical documents by combining semantic a priori-knowledge with local features. In: Proceedings of the 7th international conference on document analysis and recognition IEEEGoogle Scholar
  10. 10.
    Lecolinet E, Jean-Pierre C A grapheme-based segmentation technique for cursive script recognitionGoogle Scholar
  11. 11.
    Kang L, David D Local segmentation of touching character using contour based shape decompositionGoogle Scholar
  12. 12.
    Ghazali S, Amjad R, Tanzila S (2010) Improved offline connected script recognition based on hybrid strategy. Int Eng Sci Technol 2(6):1603–1611Google Scholar
  13. 13.
    Tripathy N, Pal U (2006) Handwriting segmentation of unconstrained oriya text. Sadhana 31(6):755–769CrossRefGoogle Scholar
  14. 14.
    Nicchiotti G, Scagliola C, Rimassa S (2000) A simple and effective cursive word segmentation method. In: Proceedings of the 7th international workshop on frontiers in handwriting recognition, pp 499–504Google Scholar
  15. 15.
    Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recogn 33:225–236CrossRefGoogle Scholar

Copyright information

© Springer India 2013

Authors and Affiliations

  • S. Gomathi Rohini
    • 1
  • R. S. Uma Devi
    • 2
  • S. Mohanavel
    • 3
  1. 1.Sri Ramakrishna Engineering CollegeCoimbatoreIndia
  2. 2.GR Govindarajulu School of Applied Computer TechnologyCoimbatoreIndia
  3. 3.Dr. N.G.P. Institute of TechnologyCoimbatoreIndia

Personalised recommendations