Skip to main content
Log in

Adaptive binarization method for degraded document images based on surface contrast variation

  • Theoretical Advances
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Document binarization is an important technique in document image analysis and recognition. Generally, binarization methods are ineffective for degraded images. Several binarization methods have been proposed; however, none of them are effective for historical and degraded document images. In this paper, a new binarization method is proposed for degraded document images. The proposed method based on the variance between pixel contrast, it consists of four stages: pre-processing, geometrical feature extraction, feature selection, and post-processing. The proposed method was evaluated based on several visual and statistical experiments. The experiments were conducted using five International Document Image Binarization Contest benchmark datasets specialized for binarization testing. The results compared with five adaptive binarization methods: Niblack, Sauvola thresholding, Sauvola compound algorithm, NICK, and Bataineh. The results show that the proposed method performs better than other methods in all binarization cases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Kefali A, Sari T, Sellami M (2010) Evaluation of several binarization techniques for old Arabic documents images. In: 1st International symposium on modeling and implementing complex systems “MISC’2010”, pp 88–99

  2. Khurshid K, Siddiqi I, Faure C, Vincent N (2010) Comparison of Niblack Inspired Binarization Methods for Ancient Documents. In: 16th International conference on Document Recognition and Retrieval, pp 1–10

  3. Stathis P, Kavallieratou E, Papamarkos N (2008) An evaluation technique for binarization algorithms. J Univers Comput Sci 14:3011–3030

  4. Otsu N (1979) A thresholding selection method from gray-scale histogram. IEEE Trans Syst Man Cybern 9:62–66

    Article  Google Scholar 

  5. Niblack W (1985) An introduction to digital image processing. Prentice Hall, pp 115–116

  6. Sauvola J, Seppanen T, Haapakoski S, Pietikainen M (1997) Adaptive document binarization. In: Fourth international conference document analysis and recognition (ICDAR), pp 147–152

  7. Bataineh B, Abdullah SNHS, Omer K (2011) An adaptive local binarization method for document images based on a novel thresholding method and dynamic windows. Pattern Recogn Lett 32:1805–1813

    Article  Google Scholar 

  8. Chou C, Lin W, Chang F (2010) A binarization method with learning-built rules for document images produced by cameras. Pattern Recognit 43:1518–1530

    Article  MATH  Google Scholar 

  9. Gatos B, Pratikakis I, Perantonis S (2006) Adaptive degraded document image binarization. Pattern Recognit 39:317–327

    Article  MATH  Google Scholar 

  10. Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recognit 33:225–236

    Article  Google Scholar 

  11. Howe R (2011) A laplacian energy for document binarization, “ICDAR 2011”. In: International conference on document analysis and recognition, pp 6–10

  12. Gatos B, Ntirogiannis K, Pratikakis I (2009) ICDAR 2009 document image binarization contest. In: proceedings 10th international conference on document analysis and recognition, pp 1375–1382

  13. Gatos B, Ntirogiannis K, Pratikakis I, DIBCO 2009 (2009) Document image binarization contest. Int J Doc Anal Recognit 14(2011):35–44

    Google Scholar 

  14. Pratikakis I, Gatos B, Ntirogiannis K (2010) H-DIBCO 2010—handwritten document image binarization competition. In: 12th international conference on frontiers in handwriting recognition, pp 727–732

  15. Pratikakis I, Gatos B, Ntirogiannis K (2011) ICDAR 2011 document image binarization contest (DIBCO 2011). In: International conference on document analysis and recognition “ICDAR2011”, pp 1506–1510

  16. Pratikakis I, Gatos B, Ntirogiannis K (2012) ICFHR 2012 competition on handwritten document image binarization (H-DIBCO 2012). In: 2012 international conference on frontiers in handwriting recognition, pp 813–818

  17. Bassiou N, Kotropoulos C (2007) Color image histogram equalization by absolute discounting back-off. Comput Vis Image Underst 107:108–122

    Article  Google Scholar 

  18. Gonzalez RC, Woods RE (2007) Digital image processing, 3rd ed. Prentice Hall, Upper Saddle River, NJ

    Google Scholar 

  19. Bar-Yosef I, Beckman I, Kedem K, Dinstein I (2007) Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents. Int J Doc Anal Recognit 9:89–99

    Article  Google Scholar 

  20. Chichilnisky E, Kalmar JRS (2002) Functional asymmetries in ON and OFF ganglion cells of primate retina. J Neurosci 22:2737–2747

    Google Scholar 

  21. Fiorentini A (2004) Brightness and lightness. In: The visual neurosciences, vol 2. MIT Press, Cambridge, pp 881–891

  22. Vonikakis V, Andreadis I, Papamarkos N (2011) Robust document binarization with OFF center-surround cells. Pattern Anal Appl 14:219–234

    Article  MathSciNet  Google Scholar 

  23. Konstantinidis K, Vonikakis V, Panitsidis G, Andreadis I (2011) A center-surround histogram for content based image retrieval. Pattern Anal Appl 14:251–260

    Article  MathSciNet  Google Scholar 

  24. Shapiro LG, Stockman G (2001) Computer vision, 1st edn. Prentice Hall PTR, Upper Saddle River, NJ

    Google Scholar 

  25. Chiu Y, Chung K, Yang W, Huang Y, Liao C (2012) Parameter-free based two-stage method for binarizing degraded document images. Pattern Recogn 45:4250–4262

    Article  Google Scholar 

  26. Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inf Process Manag 45:427–437

    Article  Google Scholar 

  27. Narendra, Patrenahalli M (1981) A separable median filter for image noise smoothing. In: IEEE transactions on pattern analysis and machine intelligence, pp 20–29

  28. Wang J, Lin L (1997) Improved median filter using minmax algorithm for image processing. Electron Lett 33:1362–1363

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bilal Bataineh.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bataineh, B., Abdullah, S.N.H.S. & Omar, K. Adaptive binarization method for degraded document images based on surface contrast variation. Pattern Anal Applic 20, 639–652 (2017). https://doi.org/10.1007/s10044-015-0520-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-015-0520-0

Keywords

Navigation