Skip to main content
Log in

A statistical tool based binarization method for document images

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Binarization of document images has great importance in several applications like historical document restoration, Optical Character Recognition (OCR). It is a challenging task due to small difference between foreground and background pixel intensities, intricate font patterns and noisy background. In this article a binarization algorithm is presented for document images which has performed significantly well on handwritten document images as well as machine printed document images. First, the RGB document images are converted to a prominent gray-scale image using statistical tools like mean, variance and standard deviation. Next, the gray-scale images are binarized using edge detection. Further the noises are removed using connected component features analysis. The proposed method is experimented on publicly available DIBCO 2016 and DIBCO 2017 datasets. The performance of the proposed algorithm is satisfactory in terms of F-Measure (FM), Pseudo-FMeasure (Fps), PSNR, Distance Reciprocal Distortion (DRD) and it also provides significant results on degraded document images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Block M, Rojas R (2009) Local contrast segmentation to binarize images. In: Proc. of the 3rd international conference on digital society (ICDS 2009), vol 1, no 1. Cancun, pp 294–299

  2. Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698

    Article  Google Scholar 

  3. Chaudhuri BB, Adak C (2017) An approach for detecting and cleaning of struck-out handwritten text. Pattern Recogn 61:282–294

    Article  Google Scholar 

  4. da Silva JMM, Lins RD, Martins FMJ, Wachenchauzer R (2008) A new and efficient algorithm to binarize document images removing back-to-front interference. J Univ Comput Sci 14:299–313

    Google Scholar 

  5. Fiel S, et al. (2017) ICDAR2017 competition on historical document writer identification (historical-WI). In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR). Kyoto, pp 1377–1382

  6. Gatos B, Ntirogiannis K, Pratikakis I (2009) ICDAR 2009 document image binarization contest (DIBCO2009). In: 10th International conference on document analysis and recognition (ICDAR09). Barcelona, pp 1375–1382

  7. Gattal A, Djeddi C, Chibani Y, Siddiqi I (2016) Isolated handwritten digit recognition using oBIFs and background features. In: 12th IAPR International workshop on document analysis systems (DAS-2016). Santorini, pp 305–310

  8. Griffin L, Lillholm M, Crosier M, van Sande J (2009) Basic image features (bifs) arising from approximate symmetry type. Scale Space and Variational Methods in Computer Vision, 343–355

  9. He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. arXiv:1512.03385

  10. Howe N (2013) Document binarization with automatic parameter tuning. Int J Doc Anal Recognit 16(3):247–258

    Article  Google Scholar 

  11. Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167 [cs.CV]

  12. Kavallieratou E, Stamatatos E (2006) Improving the quality of degraded document images. In: Proc of the second international conference on document image analysis for libraries (DIAL’06), pp 340–349

  13. Kefali A, Sari T, Bahi H (2014) Text extraction from historical document images by the combination of several thresholding techniques. Adv Multimed 2014:Article ID 934656, 10 pages

    Google Scholar 

  14. Koltun V (2011) Efficient inference in fully connected CRFs with Gaussian edge potentials. Adv Neural Inf Process Syst 2.3:4. arXiv:1210.5644 [cs.CV]

    Google Scholar 

  15. Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. arXiv:1411.4038 [cs.CV]

  16. Makridis M, Papamarkos N (2010) An adaptive layer-based local binarization technique for degraded documents. Int J Pattern Recognit Artif Intell 24(02):245–279

    Article  Google Scholar 

  17. Ntirogiannis K, Gatos B, Pratikakis I (2014) A combined approach for the binarization of handwritten document images. Pattern Recogn Lett 35:3–15

    Article  Google Scholar 

  18. Ntirogiannis K, Gatos B, Pratikakis I (2014) ICFHR 2014 competition on handwritten document image binarization (H-DIBCO 2014). In: 14th International conference on frontiers in handwriting recognition (ICFHR14). ISBN-978-1-4799-4335-7. IEEE Computer Society Press, Crete island, pp 809–813

  19. Otsu N (1979) A threshold selection method from graylevel histograms. IEEE Trans Systems Man Cybernet 9(1):62–66

    Article  Google Scholar 

  20. Papadopoulos C, Pletschacher S, Clausner C, Antonacopoulos A (2013) The IMPACT dataset of historical document images. In: Proceedings of the 2Nd international workshop on historical document imaging and processing, pp 123–130

  21. Pratikakis I, Gatos B, Ntirogiannis K (2010) H-DIBCO 2010 handwritten document image binarization competition. In: 12th International conference on frontiers in handwriting recognition (ICFHR10). Kolkata, pp 727–732

  22. Pratikakis I, Gatos B, Ntirogiannis K (2011) ICDAR 2011 document image binarization contest (DIBCO 2011). In: 2011 International conference on document analysis and recognition. Beijing, pp 1506–1510

  23. Pratikakis I, Gatos B, Ntirogiannis K (2012) ICFHR 2012 competition on handwritten document image binarization (HDIBCO 2012). In: 13th International conference on frontiers in handwriting recognition (ICFHR12). Bari, 813–818

  24. Pratikakis I, Gatos B, Ntirogiannis K (2013) ICDAR 2013 document image binarization contest (DIBCO2013). In: 12th International conference on document analysis and recognition (ICDAR13). Washington DC, pp 1395–1403

  25. Pratikakis I, Zagoris K, Barlas G, Gatos B (2016) ICFHR 2016 handwritten document image binarization contest (H-DIBCO 2016). In: 15th International conference on frontiers in handwriting recognition (ICFHR16). Shenzhen, IEEE Computer Society Press, pp 619–623

  26. Pratikakis I, Zagoris K, Barlas G, Gatos B (2017) ICDAR2017 competition on document image binarization (DIBCO 2017). In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR). Kyoto, pp 1395–1403

  27. Roe E, de Mello CAB (2015) Restoring images of ancient color postcards. Vis Comput 31(5):627–641

    Article  Google Scholar 

  28. Sanchez JA, Romero V, Toselli AH, Vidal E (2016) ICFHR2016 competition on handwritten text recognition on the READ dataset. In: 2016 15th International conference on frontiers in handwriting recognition (ICFHR). Shenzhen, pp 630–635

  29. Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recogn 33(2):225–236

    Article  Google Scholar 

  30. Singh P, Vats E, Hast A (2018) Learning surrogate models of document image quality metrics for automated document image processing. In: 13th IAPR International workshop on document analysis systems (DAS). arXiv:1712.03738 [cs.CV]

  31. Su B, Lu S, Tan CL (2013) Robust document image binarization technique for degraded document images. IEEE Trans Image Process 22(4):1408–1417

    Article  MathSciNet  MATH  Google Scholar 

  32. Tensmeyer C, Davis B, Wigington C, Lee I, Barrett B (2017) PageNet: page boundary extraction in historical handwritten documents. In: Proceedings of the 4th international workshop on historical document imaging and processing. Kyoto, Japan, November 2017 (HIP17), pp 59–64

  33. Wigington C, Tensmeyer C, Davis B, Barrett B, Price B, Cohen Start S (2018) Follow, read: end-to-end full page handwriting recognition. In: European Conference on computer vision (ECCV)

  34. Wu Y, et al. (2016) Learning document image binarization from data. In: IEEE Proceedings of international conference on image processing (ICIP). arXiv:1505.00529 [cs.CV]

  35. Ziaei Nafchi H, Farrahi Moghaddam R, Cheriet M (2013) Historical document binarization based on phase information of images. In: Lecture notes in computer science: Asian conference on computer vision (ACCVFFFD;12 Workshops), vol 7729. Springer, Berlin, pp 1–12

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sayan Das.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Das, S. A statistical tool based binarization method for document images. Multimed Tools Appl 78, 27449–27462 (2019). https://doi.org/10.1007/s11042-019-07857-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-019-07857-x

Keywords

Navigation