Skip to main content

Color Segmentation of Complex Document Images

  • Conference paper
Advances in Computer Graphics and Computer Vision

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 4))

Abstract

In this paper we present a new method for color segmentation of complex document images which can be used as a preprocessing step of a text information extraction application. From the edge map of an image, we choose a representative set of samples of the input color image and built the 3D histogram of the RGB color space. These samples are used to locate a relatively large number of proper points in the 3D color space and use them in order to initially reduce the colors. From this step an oversegmented image is produced which usually has no more than 100 colors. To extract the final result, a mean shift procedure starts from the calculated points and locates the final color clusters of the RGB color distribution. Also, to overcome noise problems, a proposed edge preserving smoothing filter is used to enhance the quality of the image. Experimental results showed the method’s capability of producing correctly segmented complex color documents while removing background noise or low contrast objects which is very desirable in text information extraction applications. Additionally, our method has the ability to cluster randomly shaped distributions.

This paper was partially supported by the project Archimedes 04-3-001/4 and Pythagoras 1249-6.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)

    Article  Google Scholar 

  • Chen, W.Y., Chen, S.Y.: Adaptive page segmentation for color technical journals cover images. Image and Vision Computing 16, 855–877 (1998)

    Article  Google Scholar 

  • Sobottka, K., et al.: Text Extraction from Colored Book and Journal Covers. International Journal on Document Analysis and Recognition 2(4), 163–176 (2000)

    Google Scholar 

  • Hase, H., Shinokawa, T., Yoneda, M., Suen, C.Y.: Character string extraction from color documents. Pattern Recognition 34(7), 1349–1365 (2001)

    Article  MATH  Google Scholar 

  • Strouthopoulos, C., Papamarkos, N., Atsalakis, A.: Text extraction in complex color documents. Pattern Recognition 35(8), 1743–1758 (2002)

    Article  MATH  Google Scholar 

  • Hase, H., Yoneda, M., Tokai, S., Kato, J., Suen, C.Y.: Color segmentation for text extraction. International Journal on Document Analysis and Recognition 6(4), 271–284 (2003)

    Article  Google Scholar 

  • Wang, B., Li, X.-F., Liu, F., Hu, F.-Q.: Color text image binarization based on binary texture analysis. Pattern Recognition Letters 26(11), 1650–1657 (2005)

    Article  Google Scholar 

  • Roerdink, J.B.T.M., Meijster, A.: The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41, 187–228 (2000)

    MathSciNet  MATH  Google Scholar 

  • Perona, P., Malik, J.: Scale-Space and Edge Detection Using Anisotropic Diffusion. IEEE Trans. Pattern Analysis and Machine Intelligence 12, 629–639 (1990)

    Article  Google Scholar 

  • Fukunaga, P., Hostetler, L.D.: The Estimation of the Gradient of a Density Function, with Applications in Pattern Recognition. IEEE Trans. Information Theory 21, 32–40 (1975)

    Article  MathSciNet  MATH  Google Scholar 

  • Cheng, Y.: Mean Shift, Mode Seeking, and Clustering. IEEE Trans. Pattern Analysis and Machine Intelligence 17(8), 790–799 (1995)

    Article  Google Scholar 

  • Comaniciu, D., Meer, P.: Mean Shift: A Robust Approach Toward Feature Space Analysis. IEEE Trans. Pattern Analysis and Machine Intelligence 24(5), 603–619 (2002)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nikolaou, N., Papamarkos, N. (2007). Color Segmentation of Complex Document Images. In: Braz, J., Ranchordas, A., AraĂºjo, H., Jorge, J. (eds) Advances in Computer Graphics and Computer Vision. Communications in Computer and Information Science, vol 4. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75274-5_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-75274-5_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-75272-1

  • Online ISBN: 978-3-540-75274-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics