Skip to main content

Advertisement

Log in

Coupled snakelets for curled text-line segmentation from warped document images

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

Camera-captured, warped document images usually contain curled text-lines because of distortions caused by camera perspective view and page curl. Warped document images can be transformed into planar document images for improving optical character recognition accuracy and human readability using monocular dewarping techniques. Curled text-lines segmentation is a crucial initial step for most of the monocular dewarping techniques. Existing curled text-line segmentation approaches are sensitive to geometric and perspective distortions. In this paper, we introduce a novel curled text-line segmentation algorithm by adapting active contour (snake). Our algorithm performs text-line segmentation by estimating pairs of x-line and baseline. It estimates a local pair of x-line and baseline on each connected component by jointly tracing top and bottom points of neighboring connected components, and finally each group of overlapping pairs is considered as a segmented text-line. Our algorithm has achieved curled text-line segmentation accuracy of above 95% on the DFKI-I (CBDAR 2007 dewarping contest) dataset, which is significantly better than previously reported results on this dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bukhari S.S., Shafait F., Breuel T.M.: Adaptive binarization of unconstrained hand-held camera-captured document images. J. Univers. Comput. Sci. 15(18), 3343–3363 (2009)

    Google Scholar 

  2. Shafait F., van Beusekom J., Keysers D., Breuel T.: Document cleanup using page frame detection. Int. J. Doc. Anal. Recognit. 11, 81–96 (2008)

    Article  Google Scholar 

  3. Kao, C.-H., Don, H.-S.: Skew detection of document images using line structural information. In: International Conference on Information Technology and Applications, vol. 1, pp. 704–715. Los Alamitos (2005)

  4. Arvind, K., Kumar, J., Ramakrishnan A.: Entropy based skew correction of document images. In: Pattern Recognition and Machine Intelligence, vol. 4815 of Lecture Notes in Computer Science, pp. 495–502 (2007)

  5. van Beusekom J., Shafait F., Breuel T.M.: Combined orientation and skew detection using geometric text-line modeling. Int. J. Doc. Anal. Recognit. 13(2), 79–92 (2010)

    Article  Google Scholar 

  6. O’Gorman L.: The document spectrum for page layout analysis. IEEE Trans. Pattern Anal. Mach. Intell. 15(11), 1162–1173 (1993)

    Article  Google Scholar 

  7. Marti, U.-V., Bunke, H.: Text line segmentation and word recognition in a system for general writer independent handwriting recognition, In: Proceedings of the 6th International Conference on Document Analysis and Recognition, pp. 159–163. Los Alamitos (2001)

  8. Bukhari, S.S., Shafait, F., Breuel, T.M.: Dewarping of document images using coupled-snakes. In: Proceedings of Third International Workshop on Camera-Based Document Analysis and Recognition, pp. 34–41. Barcelona (2009)

  9. Tan C.L., Zhang L., Zhang Z., Xia T.: Restoring warped document images through 3D shape modeling. IEEE Trans. Pattern Anal. Mach. Intell. 28(2), 195–208 (2006)

    Article  Google Scholar 

  10. Shafait F., Keysers D., Breuel T.M.: Performance evaluation and benchmarking of six page segmentation algorithms. IEEE Trans. Pattern Anal. Mach. Intell. 30(6), 941–954 (2008)

    Article  Google Scholar 

  11. Glauberman M.H.: Character recognition for business machines. Electronics 29, 132–136 (1956)

    Google Scholar 

  12. Nagy G., Seth S., Viswanathan M.: A prototype document image analysis system for technical journals. Computer 25(7), 10–22 (1992)

    Article  Google Scholar 

  13. Fletcher L.A., Kasturi R.: A robust algorithm for text string separation from mixed text/graphics images. IEEE Trans. Pattern Anal. Mach. Intell. 10(6), 910–918 (1988)

    Article  Google Scholar 

  14. Wong K.Y., Casey R.G., Wahl F.M.: Document analysis system. IBM J. Res. Dev. 26(6), 647–656 (1982)

    Article  Google Scholar 

  15. Breuel, T.M.: Robust least square baseline finding using a branch and bound algorithm. In: Proceedings SPIE Document Recognition and Retrieval IX, pp. 20–27. San Jose (2002)

  16. Liang J., Doermann D., Li H.: Camera-based analysis of text and documents: a survey. Int. J. Doc. Anal. Recognit. 7, 84–104 (2005)

    Article  Google Scholar 

  17. Zhang, Z., Tan, C.L.: Recovery of distorted document images from bound volumes. In: Proceedings of the International Conference on Document Analysis and Recognition, pp. 429–433. (2001)

  18. Lu, S., Tan, C.L.: The restoration of camera documents through image segmentation. In: 7th IAPR Workshop on Document Analysis Systems, pp. 484–495 (2006)

  19. Gatos, B., Pratikakis, I., Ntirogiannis, K.: Segmentation based recovery of arbitrarily warped document images. In: Proceedings 9th International Conference on Document Analysis and Recognition, pp. 989–993. Curitiba (2007)

  20. Fu, B., Wu, M., Li, R., Li, W., Xu, Z.: A model-based dewarping method using text line detection. In: 2nd International Workshop on Camera Based Document Analysis and Recognition (2007)

  21. Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, Curitiba (2007)

  22. Oliveira, D.M., Lins, R.D., Torreao G., Fan J., Thielo M.: A new method for text-line segmentation for warped document. In: Proceedings of International Conference on Image Analysis and Recognition, pp. 398–408. Povoa de Varzim (2010)

  23. Liang J., DeMenthon D., Doermann D.: Geometric rectification of camera-captured document images. IEEE Trans. Pattern Anal. Mach. Intell. 30, 591–605 (2008)

    Article  Google Scholar 

  24. Goto H., Aso H.: Extracting curved text lines using local linearity of the text line. Int. J. Doc. Anal. Recognit. 2, 111–119 (1999)

    Article  Google Scholar 

  25. Loo, P.K., Tan, C.L.: Word and sentence extraction using irregular pyramid. In: Document Analysis Systems V, vol. 2423 of Lecture Notes in Computer Science, pp. 307–318. Springer, Berlin (2002)

  26. Bukhari, S.S., Shafait, F., Breuel, T.M.: Segmentation of curled textlines using active contours. In: Proceedings of the 8th IAPR Workshop on Document Analysis Systems, pp. 270–277. Nara (2008)

  27. Bukhari, S.S., Shafait, F., Breuel, T.M.: Ridges based curled textline region detection from grayscale camera-captured document images. In: Proceedings of the 13th International Conference on Computer Analysis of Images and Patterns, vol. 5702/2009 of Lecture Notes in Computer Science, pp. 173–180. Muenster (2009)

  28. Bukhari, S.S., Shafait, F., Breuel, T.M.: Curled textline information extraction from grayscale camera-captured document images, In: Proceedings of the 13th International Conference on Image Processing, Cairo (2009)

  29. Kass M., Witkin A., Terzopoulos D.: Snakes: active contour models. Int. J. Comput. Vis. 1(4), 1162–1173 (1988)

    Google Scholar 

  30. Bukhari, S.S., Shafait, F. Breuel, T.M.: Coupled snakelet model for curled textline segmentation of camera-captured document images. In: Proceedings of the 10th International Conference on Document Analysis and Recognition, pp. 61–65. Barcelona (2009)

  31. Strouthopoulos C., Papamarkos N., Chamzas C.: Identification of text-only areas in mixed-type documents. Eng. Appl. Artif. Intell. 10(4), 387–401 (1997)

    Article  Google Scholar 

  32. Hsieh C.T., Lai E., Wang Y.C.: An effective algorithm for fingerprint image enhancement based on wavelet transform. Pattern Recognit. 36(2), 302–312 (2003)

    Article  Google Scholar 

  33. Xu, C., Prince, J.L.: Snakes, shapes, and gradient vector flow. In: IEEE Transaction of Image Processing, pp. 359–369 (1998)

  34. Gunn S.R., Nixon M.S.: A robust snake implementation; a dual active contour. IEEE Trans. Pattern Anal. Mach. Intell 19(1), 63–68 (1997)

    Article  Google Scholar 

  35. Hohnhaeuser B., Hommel G.: 3D pose estimation using coupled snakes. J. WSCG 12(1–3), 1213–6972 (2003)

    Google Scholar 

  36. Wang, Q., Ronneberger, O., Schulze, E., Baumeister, R., Burkhardt, H.: Using lateral coupled snakes for modeling the contours of worms. In: Pattern Recognition Lecture Notes in Computer Science, vol. 5748, pp. 542–551. (2009)

  37. Ulges, A., Lampert, C., Breuel, T.: Document image dewarping using robust estimation of curled text lines. In: Proceedings of the Eighth International Conference on Document Analysis and Recognition, pp. 1001–1005 (2005)

  38. Breuel, T.M.: Segmentation of handprinted letter strings using a dynamic programming algorithm. In: Proceedings of the Sixth International Conference on Document Analysis and Recognition pp. 821–826 (2001)

  39. Shafait, F., Breuel, T.M.: Document image dewarping contest. In: 2nd International Workshop on Camera-Based Document Analysis and Recognition, pp. 181–188. Curitiba (2007)

  40. Stamatopoulos, N., Gatos, B., Kesidis, A.: Automatic borders detection of camera document images. In: Proceedings of Second International Workshop on Camera-Based Document Analysis and Recognition, pp. 71–78. Curitiba (2007)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Syed Saqib Bukhari.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bukhari, S.S., Shafait, F. & Breuel, T.M. Coupled snakelets for curled text-line segmentation from warped document images. IJDAR 16, 33–53 (2013). https://doi.org/10.1007/s10032-011-0176-2

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-011-0176-2

Keywords

Navigation