Skip to main content
Log in

Color space transformations for analysis and enhancement of ancient degraded manuscripts

  • Applications Problems
  • Published:
Pattern Recognition and Image Analysis Aims and scope Submit manuscript

Abstract

In this paper we focus on ancient manuscripts, acquired in the RGB modality, which are degraded by the presence of complex background textures that interfere with the text of interest. Removing these artifacts is not trivial, especially with ancient originals, where they are usually very strong. Rather than applying techniques to just cancel out the interferences, we adopt the point of view of separating, extracting and classifying the various patterns superimposed in the document. We show that representing RGB images in different color spaces can be effective for this goal. In fact, even if the RGB color representation is the most frequently used color space in image processing, it does not maximize the information contents of the image. Thus, in the literature, several color spaces have been developed for analysis tasks, such as object segmentation and edge detection. Some color spaces seem to be particularly suitable to the analysis of degraded documents, allowing for the enhancement of the contents, the improvement of the text readability, the extraction of partially hidden features, and a better performance of thresholding techniques for text binarization. We present and discuss several examples of the successful application of both fixed color spaces and self-adaptive color spaces, based on the decorrelation of the original RGB channels. We also show that even simpler arithmetic operations among the channels can be effective for removing bleed-through, refocusing and improving the contrast of the foreground text, and to recover the original RGB appearance of the enhanced document.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. L. T. Chew, R. Cao, and S. Peiyi, “Restoration of Archival Documents Using a Wavelet Technique,” IEEE Trans. Pattern Analysis Machine Intelligence 24, 1399–1404 (2002).

    Article  Google Scholar 

  2. A. Cichocki and S. Amari, Adaptive Blind Signal and Image Processing (Wiley, New York, 2002).

    Book  Google Scholar 

  3. P. Dano, “Joint Restoration and Compression of Document Images with Bleed-through Distortion,” Master Thesis (Ottawa-Carleton Institute for Electrical and Computer Engineering, School of Information Technology and Engineering, University of Ottawa, 2003).

  4. F. Drida, F. LeBourgeois, and H. Emptoz, “Restoring Ink bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique,” in Proc. 7th Workshop on Document Analysts Systems, 2006, pp. 38–49.

  5. R. L. De Queiroz, “On Independent Color Space Transformations for the Compression of CMYK Images,” IEEE Trans. Image Proc. 8, 1446–1451 (1999).

    Article  Google Scholar 

  6. E. Dubois and A. Pathak, “Reduction of Bleed-Through in Scanned Manuscript Documents,” in Proc. IS&T Image Processing, Image Quality, Image Capture Systems Conf., Montreal, Canada, 2001, pp. 177–180.

  7. K. Knox, R. Johnston, and R. L. Easton, Jr., “Imaging the Dead Sea Scroll,” Opt. Photonics News 8, 30–34 (1997).

    Article  Google Scholar 

  8. K. T. Knox, R. L. Easton, Jr., and W. Christens-Barry, “Image restoration of damaged or Erased Manuscripts,” EUSIPCO, 2008.

  9. A. Hyvärinen and E. Oja, “Independent Component Analysis: Algorithms and Applications,” Neural Networks 13, 411–430 (2000).

    Article  Google Scholar 

  10. Y. Leydier, F. LeBourgeois, and H. Emptoz, “Serialized Unsupervised Classifier for Adaptive Color Image Segmentation: Application to Digitized Ancient Manuscripts,” in Proc. Int. Conf. on Pattern Recognition, 2004, pp. 494–497.

  11. G. Leedham, S. Varma, A. Patankar, and V. Govindaraju, “Separating Text and Background in Degraded Document Images—a Comparison of Global Thresholding Techniques for Multi-Stage Thresholding,” in Proc. 8th Int. Workshop on Frontiers in Handwriting Recognition, Niagara on the Lake, Canada, 2002, pp. 244–24.

  12. R. D. Lins, and J. M. Monte da Silva, “Quantitative Method for Assessing Algorithms to Remove Back-to-Front Interference in Documents,” in Proc. ACM Symposium on Applied computing, 2007.

  13. R. F. Moghaddam and M. Cheriet, “Low Quality Document Image Modeling and Enhancement,” Int. J. Document Analysis Recognition 11(4), 183–201 (2009).

    Article  Google Scholar 

  14. H. Nishida and T. Suzuki, “Correction Show-Through Effects in Document Images by Multiscale Analysi,” in Proc. 16th Conf. on Pattern Recognition, Quebec City, Canada, 2002, Vol. 3.

  15. H. Nishida and T. Suzuki, “A Multiscale Approach to Restoring Scanned Color Document Images with Show-Through Effects,” in Proc. 7th Int. Conf. on Document Analysis and Recognition, 2003.

  16. Y. Ohta, T. Kanade, and T. Sakai, “Color Information for Region Segmentation Computer,” Comput. Vision, Graphics Image Processing 13, 222–241 (1980).

    Article  Google Scholar 

  17. G. Sharma, “Digital Color Imaging,” IEEE Trans. Image Processing 6, 901–932 (1997).

    Article  Google Scholar 

  18. G. Sharma, “Show-Through Cancellation in Scans of Duplex Printed Documents,” IEEE Trans. Image Processing 10(5), 736–754 (2001).

    Article  Google Scholar 

  19. R. Swift, Analysis of the Spectra of Degraded Documents (Center for Imaging Science, Rochester Inst. Tech., 2001), http://www.cis.rit.edu/research/thesis/bs/2001/swift/thesis.html.

  20. C. L. Tan, R. Cao, and P. Shen, “Restoration of Archival Documents Using a Wavelet Technique,” IEEE Trans. Pattern Analysis Machine Intelligence 24(10), 1399–1404 (2002).

    Article  Google Scholar 

  21. A. Tonazzini, L. Bedini, and E. Salerno, “Independent Component Analysis for Document Restoration,” Int. J. Document Analysis Recognition 7, 17–27 (2004).

    Google Scholar 

  22. A. Tonazzini, L. Bedini, and E. Salerno, “Image Analysis on the Archimedes Palimpsest,” Ercim News, No. 58, pp. 53–54, 2004.

  23. A. Tonazzini, E. Salerno, M. Mochi, and L. Bedini, Bleed-Through Removal from Degraded Documents Using a Color Decorrelation Method (Lecture Notes in Computer Science), in Proc. 6th Int. Workshop on Document Analysis Systems, Ed. by S. Marinai and A. Dengel (Florence, 2004), Vol. 3163, pp. 229–240.

  24. M. Tkalcic and J. Tosic, “Color Spaces-Perceptual, Historical and Application Background,” Thesis, Faculty of Electrical Engineering, University of Ljubljana, 2000.

  25. C. Vertan and N. Boujemaa, “Color Texture Classification by Normalized Color Space Representation,” in Proc. Int. Conf. on Pattern Recognition, Barcelona, Spain, 2000, pp. 3584–3587.

  26. C. Wolf, Document Ink Bleed-Through Removal with Two Hidden Markov Random Fields and a Single Observation Field (Laboratoire d’Informatique en Images et Systémes d’Information, INSA de Lyon, France, 2006), Tech. Rep. RR-LIRIS-2006-019, Nov. 2006.

  27. Color Encoding Standard (Xerox Corporation, Xerox Systems Institute, 475 Oakmead Parkway, Sunnyvale, Calif., Mar, 1989).

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to A. Tonazzini.

Additional information

The article is published in the original.

Anna Tonazzini received the degree in Mathematics (cum laude) from the University of Pisa, Italy. She is a researcher at the Signals and Images Laboratory of the Istituto di Scienza e Tecnologie dell’Informazione, Italian National Research Council (CNR) in Pisa. She was involved in a number of CNR special programs and European research projects, for basic and applied research on image processing and computer vision, and is co-author of over 80 scientific papers. Her present interests include applied inverse problems, multichannel image restoration and reconstruction, document analysis and recognition, blind source separation, computational biology.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tonazzini, A. Color space transformations for analysis and enhancement of ancient degraded manuscripts. Pattern Recognit. Image Anal. 20, 404–417 (2010). https://doi.org/10.1134/S105466181003017X

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S105466181003017X

Key words

Navigation