Abstract
During the scanning of bound documents, some part of the document image is curled near the corners or near the binding resulting in bending of text lines. This hard to tackle distortion makes recognition very difficult. A method has been proposed for estimation and removal of line bending deformations introduced in document images during the process of scanning. The estimation of bend involves determining the side of the document on which curl is present and direction of the bend. The method has been tested on varieties of printed document images of Gurmukhi containing the bent text-lines at page borders. The method consists of three stages. In the first stage, a decision methodology is proposed to locate the site of deformation and the direction of deformation. An elliptical approximation model is derived to estimate the amount of deformation in the second stage. Finally, a transformation process brings out the correction. Experiments show that the method developed works well under conditions where pixel distribution is uniform.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Vasudev, T., Hemanthakumar, G., Nagabhushan, P.: An Elliptical Approximation Model for Removal of Text-line Bending Deformations at Page Borders in a Document Image. In: Proceedings of the International Conference on Cognition and Recognition, pp. 645–654 (September 2005)
Gatos, B., Ntirogiannis, K.: Restoration of arbitrarily warped document images based on text line and word detection. In: Proceedings of Fourth IASTED Int. Conf. on Signal Processing, Pattern Recognition, and Applications, pp. 203–208 (February 2007)
Yin, X.-C., Sun, J., Naoi, S.: Perspective rectification for mobile phone camera-based documents using a hybrid approach to vanishing point detection. In: Proceedings of the Second International Workshop on Camera Based Document Analysis and Rectification, pp. 37–44 (2007)
Fu, B., Wu, M., Li, R., Li, W., Xu, Z., Yang, C.: A Model based Book Dewarping Method Using Text Line Detection. In: Proceedings of 2nd International Workshop on Camera Based Document Analysis and Recognition, pp. 63–70 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sharma, D.V., Wadhwa, S. (2011). Dewarping Machine Printed Documents of Gurmukhi Script. In: Singh, C., Singh Lehal, G., Sengupta, J., Sharma, D.V., Goyal, V. (eds) Information Systems for Indian Languages. ICISIL 2011. Communications in Computer and Information Science, vol 139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19403-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-19403-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19402-3
Online ISBN: 978-3-642-19403-0
eBook Packages: Computer ScienceComputer Science (R0)