Abstract
Text line segmentation in offline handwritten documents remains a challenge because the offline handwritten text lines are often inconsistency curved and skewed. More serious is the space between lines is not enough to distinguish them. In this paper, we propose a novel offline handwritten text line segmentation method by writing pheromone diffusion and convergence. According to the principle of gravity, we apply it to the lines location of the offline handwritten texts, the pheromone diffusion and convergence can learn to generate the pheromone matrix for extracting the key locations and fragments of the text line, that is made robust to deal with various offline handwritten documents with curved and multi-skewed text lines. In experiments on a commonly used database with offline handwritten text images, our method can significantly improve upon state-of-the-art text line segmentation methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ryu, J., Koo, H.I., Cho, N.I.: Language-independent text-line extraction algorithm for handwritten documents. IEEE Signal Process. Lett. 21(9), 1115–1119 (2014)
Renton, G., Soullard, Y., Chatelain, C., Adam, S., Kermorvant, C., Paquet, T.: Fully convolutional network with dilated convolutions for handwritten text line segmentation. Int. J. Doc. Anal. Recogn. (IJDAR) 21(3), 177–186 (2018). https://doi.org/10.1007/s10032-018-0304-3
Sindhushree, G.S., Amarnath, R., Nagabhushan, P.: Entropy-based approach for enabling text line segmentation in handwritten documents. In: Nagabhushan, P., Guru, D.S., Shekar, B.H., Kumar, Y.H.S. (eds.) Data Analytics and Learning. LNNS, vol. 43, pp. 169–184. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-2514-4_15
Arivazhagan, M., Srinivasan, H., Srihari, S.: A statistical approach to line segmentation in handwritten documents. In: International Society for Optics and Photonics in Document Recognition and Retrieval, vol. 65000, pp. 1–11 (2007)
Yin, F., Liu, C.-L.: Handwritten Chinese text line segmentation by clustering with distance metric learning. Pattern Recogn. 42(12), 3146–3157 (2009)
Deshmukh, M.S., Patil, M.P., Kolhe, S.R.: A hybrid text line segmentation approach for the ancient handwritten unconstrained freestyle modi script documents. Imaging Sci. J. 66(7), 433–442 (2018)
Pak, I., Teh, P.L.: Text segmentation techniques: a critical review. In: Zelinka, I., Vasant, P., Duy, V.H., Dao, T.T. (eds.) Innovative Computing, Optimization and Its Applications. SCI, vol. 741, pp. 167–181. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-66984-7_10
Nagy, G., Seth, S., Viswanathan, M.: A prototype document image analysis system for technical journals. Computer 25(7), 10–22 (1992)
Su, T.-H., Zhang, T.-W., Huang, H.-J., Zhou, Y.: Skew detection for Chinese handwriting by horizontal stroke histogram. In: Ninth International Conference on Document Analysis and Recognition, vol. 2, pp. 899–903. IEEE (2007)
Koo, H.I., Cho, N.I.: Text-line extraction in handwritten Chinese documents based on an energy minimization framework. IEEE Trans. Image Process. 21(3), 1169–1175 (2012)
Zhang, X., Tan, C.L.: Text line segmentation for handwritten documents using constrained seam carving. In: International Conference on Frontiers in Handwriting Recognition, pp. 98–103. IEEE (2014)
Vo, Q.N., Kim, S.H., Yang, H.J., Lee, G.S.: Text line segmentation using a fully convolutional network in handwritten document images. IET Image Proc. 12(3), 438–446 (2017)
Shi, Z., Setlur, S., Govindaraju, V.: Text extraction from gray scale historical document images using adaptive local connectivity map. In: Eighth International Conference on Document Analysis and Recognition, pp. 794–798. IEEE (2005)
Nguyen, T.D., Lee, G.: Text line segmentation in handwritten document images using tensor voting. Trans. Fund. Electron. Commun. Comput. Sci. 94(11), 2434–2441 (2011)
Shi, Z., Setlur, S., Govindaraju, V.: A steerable directional local profile technique for extraction of handwritten Arabic text lines. In: International Conference on Document Analysis and Recognition, pp. 176–180. IEEE (2009)
Zezhong, X., Shin, B.-S., Klette, R.: Closed form line-segment extraction using the hough transform. Pattern Recogn. 48(12), 4012–4023 (2015)
Boukharouba, A.: A new algorithm for skew correction and baseline detection based on the randomized hough transform. J. King Saud Univ. Comput. Inf. Sci. 29(1), 29–38 (2017)
Zhang, L., Weidong, Yu.: Orientation image analysis of electrospun submicro-fibers based on hough transform and regionprops function. Text. Res. J. 87(18), 2263–2274 (2017)
Guo, Y., Sun, Y., Bauer, P., Allebach, J.P., Bouman, C.A.: Text line detection based on cost optimized local text line direction estimation. In: The International Society for Optical Engineering, vol. 9395, pp. 1–7 (2015)
Adiguzel, H., Sahin, E., Duygulu, P.: A hybrid for line segmentation in handwritten documents. In: International Conference on Frontiers in Handwriting Recognition, pp. 503–508 (2012)
Ali, A.A.A., Suresha, M.: Efficient algorithms for text lines and words segmentation for recognition of Arabic handwritten script. In: Shetty, N.R., Patnaik, L.M., Nagaraj, H.C., Hamsavath, P.N., Nalini, N. (eds.) Emerging Research in Computing, Information, Communication and Applications. AISC, vol. 882, pp. 387–401. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-5953-8_32
Motulsky, H., Christopoulos, A.: Fitting Models to Biological Data Using Linear and Nonlinear Regression: A Practical Guide to Curve Fitting. Oxford University Press, Oxford (2004)
Su, T.: Chinese Handwriting Recognition: An Algorithmic Perspective. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-31812-2
Acknowledgement
This work is sponsored by the National Natural Science Fund of China (61976118, 61806098), Jiangsu Province Natural Science Foundation (BK20180142), Jiangsu Province Natural Science Foundation for Colleges and Universities (17KJB520020, 18KJB520029).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, Y., Xiao, W. (2020). Handwritten Text Line Segmentation Method by Writing Pheromone Diffusion and Convergence. In: Shen, J., Chang, YC., Su, YS., Ogata, H. (eds) Cognitive Cities. IC3 2019. Communications in Computer and Information Science, vol 1227. Springer, Singapore. https://doi.org/10.1007/978-981-15-6113-9_12
Download citation
DOI: https://doi.org/10.1007/978-981-15-6113-9_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6112-2
Online ISBN: 978-981-15-6113-9
eBook Packages: Computer ScienceComputer Science (R0)