Abstract
In this paper, SFF (Segmentation Facilitate Feature) technique is proposed to find the junction path to segment touched components based on the seed pixel selected among candidate pixels. Handwritten Recognition system has number of applications like reading postal address, filling forms, reading bank cheques, offering several challenges. In practice, constitute of the word images get touched in handwritten data due to variability in stroke, shortage of space which make the individual character extraction from the word image more complicated. Segmentation of individual in a word image requires a technique that takes care of the variability of writing. This paper proposed the SFF (Segmentation Facilitate Feature) technique to find seed pixel among candidate pixels based on 3-neighbouring pixels. It is used to find junction pixels which form a junction path to segregate the touched component. The junction path is selected to avoid the issues arising due to artifacts or deletion of components features. For experimentation, 1840 legal amount words containing touching components are used. The above number includes 250 words from benchmark database (ICDAR) and 1590 words are gathered from 15 different writers. On implementing, SFF (Segmentation Facilitate Feature) technique on the above mentioned database, 89.9% accuracy is achieved and a higher accuracy level 96.2% is achieved when performed on 1000 words containing two touching consonants.
Similar content being viewed by others
References
Aneja N, Aneja S (2019) Transfer learning using CNN for handwritten devanagari character recognition. 1st IEEE Int. Conf. Adv. Inf. Technol. ICAIT 2019 - Proc., pp. 293–296.
Avola D, Caschera MC, Ferri F, Grifoni P (2010) Classifying and resolving ambiguities in sketch-based interaction. Int J Virtual Technol Multimed 1(2):104
Bansal V, Sinha RMK (2002) Segmentation of touching and fused Devanagari characters. Pattern Recogn 35(4):875–893
Chen M-Y (1994) Off-line handwritten word recognition using a hidden Markov model type stochastic network. IEEE Trans Pattern Anal Mach Intell 16(5):481–496
Choudhary A, Rishi R, Ahlawat S (2013) New character segmentation approach for off-line cursive handwritten words. Procedia Comput Sci 17:88–95
Dhaka VP, Sharma MK (2015) An efficient segmentation technique for Devanagari offline handwritten scripts using the feedforward neural network. Neural Comput Appl 26(8):1881–1893
Gaurav DD, Ramesh R (2012) A feature extraction technique based on character geometry for character recognition, no. eprint 1202.3884, pp. 1–4
Gupta D, Bag S (2019) Handwritten multilingual word segmentation using polygonal approximation of digital curves for Indian languages. Multimed Tools Appl 78(14):19361–19386
Kamble SN, Kamble PM (2011) Morphological approach for segmentation of scanned handwritten devnagari text. 3:99–108
Kapoor S, Verma V (2014) Fragmentation of handwritten touching characters in devanagari script. Int J Inf Technol Model Comput 2(1):11–21
Kaur A, Singh P, Rani S (2015) Segmentation of broken and isolated characters in handwritten Gurumukhi word using neighboring pixel technique. Trans Networks Commun 3(2):37–42
Kumar M, Jindal MK, Sharma RK (2014) Segmentation of isolated and touching characters in offline handwritten Gurmukhi script recognition. Int J Inf Technol Comput Sci 6(2):58–63
Kurniawan F, Rahim MSM, Sholihah N, Rakhmadi A, Mohamad D (2011) Characters segmentation of cursive hand written words based on contour analysis and neural network validation. ITB J Inf Commun Technol 5(1):1–16
Ladwani VM, Malik L (2010) Novel approach to segmentation of handwritten Devnagari word. Proc. - 3rd Int. Conf. Emerg. Trends Eng. Technol. ICETET 2010, pp. 219–224
Lemaitre A, Camillerapp J, Coüasnon B (2011) A perceptive method for handwritten text segmentation. Doc Recognit Retr XVIII 7874:78740C
Louloudis G, Gatos B, Pratikakis I, Halatsis C (2009) Text line and word segmentation of handwritten documents. Pattern Recogn 42(12):3169–3183
Lu Y, Shridhar M (1996) Character segmentation in handwritten words - an overview. Pattern Recogn 29(1):77–96
Mamatha HR, Srikantamurthy K (2012) Morphological operations and projection profiles based segmentation of handwritten Kannada document. Int J Appl Inf Syst 4(5):13–19
Modi N, Jindal K (2013) Text line detection and segmentation in handwritten Gurumukhi scripts. IJ Adv Res Comput Sci Softw Engeenring (2013) 3(5):1075–1080
Naveena C, Manjunath Aradhya VN (2012) Handwritten character segmentation for Kannada scripts. Proc. 2012 World Congr. Inf. Commun. Technol. WICT 2012, pp. 144–149
Pal U, Datta S (2003) Segmentation of Bangla unconstrained handwritten text. in Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, vol. 2003–Janua, no. ICDAR, pp. 1128–1132.
Pal U, Behaid A, Choisy C (2003) Touching numeral segmentation using water reservoir concept. Pattern Recognit Lett 24(jan):261–272
Sarkar R (2010) A script independent technique for extraction of characters from handwritten word images. 1(23):83–88
Thakral B, Kumar M (2014) Devanagari handwritten text segmentation for overlapping and conjunct characters- A proficient technique. Proc. - 2014 3rd Int Conf Reliab Infocom Technol Optim Trends Futur Dir ICRITO 2014, 2015.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Kohli, M., Kumar, S. Segmentation of handwritten words into characters. Multimed Tools Appl 80, 22121–22133 (2021). https://doi.org/10.1007/s11042-021-10638-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-021-10638-0