Abstract
In this paper we present a new method for color segmentation of complex document images which can be used as a preprocessing step of a text information extraction application. From the edge map of an image, we choose a representative set of samples of the input color image and built the 3D histogram of the RGB color space. These samples are used to locate a relatively large number of proper points in the 3D color space and use them in order to initially reduce the colors. From this step an oversegmented image is produced which usually has no more than 100 colors. To extract the final result, a mean shift procedure starts from the calculated points and locates the final color clusters of the RGB color distribution. Also, to overcome noise problems, a proposed edge preserving smoothing filter is used to enhance the quality of the image. Experimental results showed the method’s capability of producing correctly segmented complex color documents while removing background noise or low contrast objects which is very desirable in text information extraction applications. Additionally, our method has the ability to cluster randomly shaped distributions.
This paper was partially supported by the project Archimedes 04-3-001/4 and Pythagoras 1249-6.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Zhong, Y., Karu, K., Jain, A.K.: Locating text in complex color images. Pattern Recognition 28(10), 1523–1535 (1995)
Chen, W.Y., Chen, S.Y.: Adaptive page segmentation for color technical journals cover images. Image and Vision Computing 16, 855–877 (1998)
Sobottka, K., et al.: Text Extraction from Colored Book and Journal Covers. International Journal on Document Analysis and Recognition 2(4), 163–176 (2000)
Hase, H., Shinokawa, T., Yoneda, M., Suen, C.Y.: Character string extraction from color documents. Pattern Recognition 34(7), 1349–1365 (2001)
Strouthopoulos, C., Papamarkos, N., Atsalakis, A.: Text extraction in complex color documents. Pattern Recognition 35(8), 1743–1758 (2002)
Hase, H., Yoneda, M., Tokai, S., Kato, J., Suen, C.Y.: Color segmentation for text extraction. International Journal on Document Analysis and Recognition 6(4), 271–284 (2003)
Wang, B., Li, X.-F., Liu, F., Hu, F.-Q.: Color text image binarization based on binary texture analysis. Pattern Recognition Letters 26(11), 1650–1657 (2005)
Roerdink, J.B.T.M., Meijster, A.: The watershed transform: Definitions, algorithms and parallelization strategies. Fundamenta Informaticae 41, 187–228 (2000)
Perona, P., Malik, J.: Scale-Space and Edge Detection Using Anisotropic Diffusion. IEEE Trans. Pattern Analysis and Machine Intelligence 12, 629–639 (1990)
Fukunaga, P., Hostetler, L.D.: The Estimation of the Gradient of a Density Function, with Applications in Pattern Recognition. IEEE Trans. Information Theory 21, 32–40 (1975)
Cheng, Y.: Mean Shift, Mode Seeking, and Clustering. IEEE Trans. Pattern Analysis and Machine Intelligence 17(8), 790–799 (1995)
Comaniciu, D., Meer, P.: Mean Shift: A Robust Approach Toward Feature Space Analysis. IEEE Trans. Pattern Analysis and Machine Intelligence 24(5), 603–619 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nikolaou, N., Papamarkos, N. (2007). Color Segmentation of Complex Document Images. In: Braz, J., Ranchordas, A., AraĂºjo, H., Jorge, J. (eds) Advances in Computer Graphics and Computer Vision. Communications in Computer and Information Science, vol 4. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75274-5_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-75274-5_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-75272-1
Online ISBN: 978-3-540-75274-5
eBook Packages: Computer ScienceComputer Science (R0)