Skip to main content

Text Localization and Extraction from Complex Gray Images

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4338))

Abstract

We propose two texture-based approaches, one involving Gabor filters and the other employing log-polar wavelets, for separating text from non-text elements in a document image. Both the proposed algorithms compute local energy at some information-rich points, which are marked by Harris’ corner detector. The advantage of this approach is that the algorithm calculates the local energy at selected points and not throughout the image, thus saving a lot of computational time. The algorithm has been tested on a large set of scanned text pages and the results have been seen to be better than the results from the existing algorithms. Among the proposed schemes, the Gabor filter based scheme marginally outperforms the wavelet based scheme.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Fan, K.C., Wang, L.S., Wang, Y.K.: page segmentation and identification for intelligent signal processing. Signal Processing 45, 329–346 (1995)

    Article  MATH  Google Scholar 

  2. Smith, M.A., Kanade, T.: Video skimming for quick browsing based on audio and image characterization, CMU-CS-95-186, Technical report, Carnegie Mellon University (1995)

    Google Scholar 

  3. Jung, K.: Neural network-based text location in color images. Pattern Recognition Letters 22, 1503–1515 (2001)

    Article  MATH  Google Scholar 

  4. Wu, J., Qu, S.-L., Zhuo, Q., Wang, W.-Y.: Automatic text detection in complex color images. In: Proc. of Intl. Conf. on Machine Learning and Cybernetics (2002)

    Google Scholar 

  5. Yuan, Q., Tan, C.L.: Text Extraction from Gray Scale Document Images Using Edge Information. In: Proc. of Sixth Intl. Conf. on Document Analysis and Recognition (2001)

    Google Scholar 

  6. Messelodi, S., Modena, C.M.: Automatic identification and skew estimation of text lines in real scene images. Pattern Recognition 32, 791–810 (1999)

    Article  Google Scholar 

  7. Jain, A.K., Yu, B.: Automatic text location in images and video frames. Pattern Recognition 31, 2055–2076 (1998)

    Article  Google Scholar 

  8. Strouthpoulos, C., Papamarkos, N., Atsalakis, A.E.: Text extraction in complex color Document. Pattern Recognition 35, 1743–1758 (2002)

    Article  Google Scholar 

  9. Sabari Raju, S., Pati, P.B., Ramakrishnan, A.G.: Text Localization and Extraction from Complex Color Images. In: Bebis, G., Boyle, R., Koracin, D., Parvin, B. (eds.) ISVC 2005. LNCS, vol. 3804, pp. 486–493. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  10. Jain, R., Antani, S., Kasturi, R.: A survey on the use of pattern recognition methods for abstraction, indexing and retrieval of images and video. Pattern Recognition 35(4), 945–965 (2002)

    Article  MATH  Google Scholar 

  11. Jung, K., Kim, K.I., Jain, A.K.: Text Information Extraction in Images and Video: A Survey. Pattern Recognition 37(5), 977–997 (2004)

    Article  Google Scholar 

  12. Pun, C.M., Lee, M.C.: Log-polar wavelet energy signature for rotation and scale invariant texture classification. IEEE Trans. PAMI 25(5), 590–603 (2003)

    Google Scholar 

  13. Harris, C., Stephens, M.: A combined corner and edge detector. In: Proc. 4th Alvey Vision Conf., pp. 147–151 (1988)

    Google Scholar 

  14. Davoine, F., et al.: Fractal images compression based on Delaunay triangulation and vector quantization. IEEE Trans. on Image Processing 5(2), 338–346 (1996)

    Article  Google Scholar 

  15. Xiao, Y., Yan, H.: Text region extraction in a document image based on the Delaunay tessellation. Pattern Recognition 36, 799–809 (2003)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nourbakhsh, F., Pati, P.B., Ramakrishnan, A.G. (2006). Text Localization and Extraction from Complex Gray Images. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_69

Download citation

  • DOI: https://doi.org/10.1007/11949619_69

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68301-8

  • Online ISBN: 978-3-540-68302-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics