Abstract
Camera-captured, warped document images usually contain curled text-lines because of distortions caused by camera perspective view and page curl. Warped document images can be transformed into planar document images for improving optical character recognition accuracy and human readability using monocular dewarping techniques. Curled text-lines segmentation is a crucial initial step for most of the monocular dewarping techniques. Existing curled text-line segmentation approaches are sensitive to geometric and perspective distortions. In this paper, we introduce a novel curled text-line segmentation algorithm by adapting active contour (snake). Our algorithm performs text-line segmentation by estimating pairs of x-line and baseline. It estimates a local pair of x-line and baseline on each connected component by jointly tracing top and bottom points of neighboring connected components, and finally each group of overlapping pairs is considered as a segmented text-line. Our algorithm has achieved curled text-line segmentation accuracy of above 95% on the DFKI-I (CBDAR 2007 dewarping contest) dataset, which is significantly better than previously reported results on this dataset.
Similar content being viewed by others
References
Bukhari S.S., Shafait F., Breuel T.M.: Adaptive binarization of unconstrained hand-held camera-captured document images. J. Univers. Comput. Sci. 15(18), 3343–3363 (2009)
Shafait F., van Beusekom J., Keysers D., Breuel T.: Document cleanup using page frame detection. Int. J. Doc. Anal. Recognit. 11, 81–96 (2008)
Kao, C.-H., Don, H.-S.: Skew detection of document images using line structural information. In: International Conference on Information Technology and Applications, vol. 1, pp. 704–715. Los Alamitos (2005)
Arvind, K., Kumar, J., Ramakrishnan A.: Entropy based skew correction of document images. In: Pattern Recognition and Machine Intelligence, vol. 4815 of Lecture Notes in Computer Science, pp. 495–502 (2007)
van Beusekom J., Shafait F., Breuel T.M.: Combined orientation and skew detection using geometric text-line modeling. Int. J. Doc. Anal. Recognit. 13(2), 79–92 (2010)
O’Gorman L.: The document spectrum for page layout analysis. IEEE Trans. Pattern Anal. Mach. Intell. 15(11), 1162–1173 (1993)
Marti, U.-V., Bunke, H.: Text line segmentation and word recognition in a system for general writer independent handwriting recognition, In: Proceedings of the 6th International Conference on Document Analysis and Recognition, pp. 159–163. Los Alamitos (2001)
Bukhari, S.S., Shafait, F., Breuel, T.M.: Dewarping of document images using coupled-snakes. In: Proceedings of Third International Workshop on Camera-Based Document Analysis and Recognition, pp. 34–41. Barcelona (2009)
Tan C.L., Zhang L., Zhang Z., Xia T.: Restoring warped document images through 3D shape modeling. IEEE Trans. Pattern Anal. Mach. Intell. 28(2), 195–208 (2006)
Shafait F., Keysers D., Breuel T.M.: Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 30(6), 941–954 (2008)
Glauberman M.H.: Character recognition for business machines. Electronics 29, 132–136 (1956)
Nagy G., Seth S., Viswanathan M.: A prototype document image analysis system for technical journals. Computer 25(7), 10–22 (1992)
Fletcher L.A., Kasturi R.: A robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Anal. Mach. Intell. 10(6), 910–918 (1988)
Wong K.Y., Casey R.G., Wahl F.M.: Document analysis system. IBM J. Res. Dev. 26(6), 647–656 (1982)
Breuel, T.M.: Robust least square baseline finding using a branch and bound algorithm. In: Proceedings SPIE Document Recognition and Retrieval IX, pp. 20–27. San Jose (2002)
Liang J., Doermann D., Li H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recognit. 7, 84–104 (2005)
Zhang, Z., Tan, C.L.: Recovery of distorted document images from bound volumes. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 429–433. (2001)
Lu, S., Tan, C.L.: The restoration of camera documents through image segmentation. In: 7th IAPR Workshop on Document Analysis Systems, pp. 484–495 (2006)
Gatos, B., Pratikakis, I., Ntirogiannis, K.: Segmentation based recovery of arbitrarily warped document images. In: Proceedings 9th International Conference on Document Analysis and Recognition, pp. 989–993. Curitiba (2007)
Fu, B., Wu, M., Li, R., Li, W., Xu, Z.: A model-based dewarping method using text line detection. In: 2nd International Workshop on Camera Based Document Analysis and Recognition (2007)
Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, Curitiba (2007)
Oliveira, D.M., Lins, R.D., Torreao G., Fan J., Thielo M.: A new method for text-line segmentation for warped document. In: Proceedings of International Conference on Image Analysis and Recognition, pp. 398–408. Povoa de Varzim (2010)
Liang J., DeMenthon D., Doermann D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30, 591–605 (2008)
Goto H., Aso H.: Extracting curved text lines using local linearity of the text line. Int. J. Doc. Anal. Recognit. 2, 111–119 (1999)
Loo, P.K., Tan, C.L.: Word and sentence extraction using irregular pyramid. In: Document Analysis Systems V, vol. 2423 of Lecture Notes in Computer Science, pp. 307–318. Springer, Berlin (2002)
Bukhari, S.S., Shafait, F., Breuel, T.M.: Segmentation of curled textlines using active contours. In: Proceedings of the 8th IAPR Workshop on Document Analysis Systems, pp. 270–277. Nara (2008)
Bukhari, S.S., Shafait, F., Breuel, T.M.: Ridges based curled textline region detection from grayscale camera-captured document images. In: Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns, vol. 5702/2009 of Lecture Notes in Computer Science, pp. 173–180. Muenster (2009)
Bukhari, S.S., Shafait, F., Breuel, T.M.: Curled textline information extraction from grayscale camera-captured document images, In: Proceedings of the 13th International Conference on Image Processing, Cairo (2009)
Kass M., Witkin A., Terzopoulos D.: Snakes: active contour models. Int. J. Comput. Vis. 1(4), 1162–1173 (1988)
Bukhari, S.S., Shafait, F. Breuel, T.M.: Coupled snakelet model for curled textline segmentation of camera-captured document images. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, pp. 61–65. Barcelona (2009)
Strouthopoulos C., Papamarkos N., Chamzas C.: Identification of text-only areas in mixed-type documents. Eng. Appl. Artif. Intell. 10(4), 387–401 (1997)
Hsieh C.T., Lai E., Wang Y.C.: An effective algorithm for fingerprint image enhancement based on wavelet transform. Pattern Recognit. 36(2), 302–312 (2003)
Xu, C., Prince, J.L.: Snakes, shapes, and gradient vector flow. In: IEEE Transaction of Image Processing, pp. 359–369 (1998)
Gunn S.R., Nixon M.S.: A robust snake implementation; a dual active contour. IEEE Trans. Pattern Anal. Mach. Intell 19(1), 63–68 (1997)
Hohnhaeuser B., Hommel G.: 3D pose estimation using coupled snakes. J. WSCG 12(1–3), 1213–6972 (2003)
Wang, Q., Ronneberger, O., Schulze, E., Baumeister, R., Burkhardt, H.: Using lateral coupled snakes for modeling the contours of worms. In: Pattern Recognition Lecture Notes in Computer Science, vol. 5748, pp. 542–551. (2009)
Ulges, A., Lampert, C., Breuel, T.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 1001–1005 (2005)
Breuel, T.M.: Segmentation of handprinted letter strings using a dynamic programming algorithm. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition pp. 821–826 (2001)
Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188. Curitiba (2007)
Stamatopoulos, N., Gatos, B., Kesidis, A.: Automatic borders detection of camera document images. In: Proceedings of Second International Workshop on Camera-Based Document Analysis and Recognition, pp. 71–78. Curitiba (2007)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bukhari, S.S., Shafait, F. & Breuel, T.M. Coupled snakelets for curled text-line segmentation from warped document images. IJDAR 16, 33–53 (2013). https://doi.org/10.1007/s10032-011-0176-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-011-0176-2