Abstract
Binarization of document images has great importance in several applications like historical document restoration, Optical Character Recognition (OCR). It is a challenging task due to small difference between foreground and background pixel intensities, intricate font patterns and noisy background. In this article a binarization algorithm is presented for document images which has performed significantly well on handwritten document images as well as machine printed document images. First, the RGB document images are converted to a prominent gray-scale image using statistical tools like mean, variance and standard deviation. Next, the gray-scale images are binarized using edge detection. Further the noises are removed using connected component features analysis. The proposed method is experimented on publicly available DIBCO 2016 and DIBCO 2017 datasets. The performance of the proposed algorithm is satisfactory in terms of F-Measure (FM), Pseudo-FMeasure (Fps), PSNR, Distance Reciprocal Distortion (DRD) and it also provides significant results on degraded document images.
Similar content being viewed by others
References
Block M, Rojas R (2009) Local contrast segmentation to binarize images. In: Proc. of the 3rd international conference on digital society (ICDS 2009), vol 1, no 1. Cancun, pp 294–299
Canny J (1986) A computational approach to edge detection. IEEE Trans Pattern Anal Mach Intell 8(6):679–698
Chaudhuri BB, Adak C (2017) An approach for detecting and cleaning of struck-out handwritten text. Pattern Recogn 61:282–294
da Silva JMM, Lins RD, Martins FMJ, Wachenchauzer R (2008) A new and efficient algorithm to binarize document images removing back-to-front interference. J Univ Comput Sci 14:299–313
Fiel S, et al. (2017) ICDAR2017 competition on historical document writer identification (historical-WI). In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR). Kyoto, pp 1377–1382
Gatos B, Ntirogiannis K, Pratikakis I (2009) ICDAR 2009 document image binarization contest (DIBCO2009). In: 10th International conference on document analysis and recognition (ICDAR09). Barcelona, pp 1375–1382
Gattal A, Djeddi C, Chibani Y, Siddiqi I (2016) Isolated handwritten digit recognition using oBIFs and background features. In: 12th IAPR International workshop on document analysis systems (DAS-2016). Santorini, pp 305–310
Griffin L, Lillholm M, Crosier M, van Sande J (2009) Basic image features (bifs) arising from approximate symmetry type. Scale Space and Variational Methods in Computer Vision, 343–355
He K, Zhang X, Ren S, Sun J (2015) Deep residual learning for image recognition. arXiv:1512.03385
Howe N (2013) Document binarization with automatic parameter tuning. Int J Doc Anal Recognit 16(3):247–258
Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv:1502.03167 [cs.CV]
Kavallieratou E, Stamatatos E (2006) Improving the quality of degraded document images. In: Proc of the second international conference on document image analysis for libraries (DIAL’06), pp 340–349
Kefali A, Sari T, Bahi H (2014) Text extraction from historical document images by the combination of several thresholding techniques. Adv Multimed 2014:Article ID 934656, 10 pages
Koltun V (2011) Efficient inference in fully connected CRFs with Gaussian edge potentials. Adv Neural Inf Process Syst 2.3:4. arXiv:1210.5644 [cs.CV]
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition. arXiv:1411.4038 [cs.CV]
Makridis M, Papamarkos N (2010) An adaptive layer-based local binarization technique for degraded documents. Int J Pattern Recognit Artif Intell 24(02):245–279
Ntirogiannis K, Gatos B, Pratikakis I (2014) A combined approach for the binarization of handwritten document images. Pattern Recogn Lett 35:3–15
Ntirogiannis K, Gatos B, Pratikakis I (2014) ICFHR 2014 competition on handwritten document image binarization (H-DIBCO 2014). In: 14th International conference on frontiers in handwriting recognition (ICFHR14). ISBN-978-1-4799-4335-7. IEEE Computer Society Press, Crete island, pp 809–813
Otsu N (1979) A threshold selection method from graylevel histograms. IEEE Trans Systems Man Cybernet 9(1):62–66
Papadopoulos C, Pletschacher S, Clausner C, Antonacopoulos A (2013) The IMPACT dataset of historical document images. In: Proceedings of the 2Nd international workshop on historical document imaging and processing, pp 123–130
Pratikakis I, Gatos B, Ntirogiannis K (2010) H-DIBCO 2010 handwritten document image binarization competition. In: 12th International conference on frontiers in handwriting recognition (ICFHR10). Kolkata, pp 727–732
Pratikakis I, Gatos B, Ntirogiannis K (2011) ICDAR 2011 document image binarization contest (DIBCO 2011). In: 2011 International conference on document analysis and recognition. Beijing, pp 1506–1510
Pratikakis I, Gatos B, Ntirogiannis K (2012) ICFHR 2012 competition on handwritten document image binarization (HDIBCO 2012). In: 13th International conference on frontiers in handwriting recognition (ICFHR12). Bari, 813–818
Pratikakis I, Gatos B, Ntirogiannis K (2013) ICDAR 2013 document image binarization contest (DIBCO2013). In: 12th International conference on document analysis and recognition (ICDAR13). Washington DC, pp 1395–1403
Pratikakis I, Zagoris K, Barlas G, Gatos B (2016) ICFHR 2016 handwritten document image binarization contest (H-DIBCO 2016). In: 15th International conference on frontiers in handwriting recognition (ICFHR16). Shenzhen, IEEE Computer Society Press, pp 619–623
Pratikakis I, Zagoris K, Barlas G, Gatos B (2017) ICDAR2017 competition on document image binarization (DIBCO 2017). In: 2017 14th IAPR International conference on document analysis and recognition (ICDAR). Kyoto, pp 1395–1403
Roe E, de Mello CAB (2015) Restoring images of ancient color postcards. Vis Comput 31(5):627–641
Sanchez JA, Romero V, Toselli AH, Vidal E (2016) ICFHR2016 competition on handwritten text recognition on the READ dataset. In: 2016 15th International conference on frontiers in handwriting recognition (ICFHR). Shenzhen, pp 630–635
Sauvola J, Pietikainen M (2000) Adaptive document image binarization. Pattern Recogn 33(2):225–236
Singh P, Vats E, Hast A (2018) Learning surrogate models of document image quality metrics for automated document image processing. In: 13th IAPR International workshop on document analysis systems (DAS). arXiv:1712.03738 [cs.CV]
Su B, Lu S, Tan CL (2013) Robust document image binarization technique for degraded document images. IEEE Trans Image Process 22(4):1408–1417
Tensmeyer C, Davis B, Wigington C, Lee I, Barrett B (2017) PageNet: page boundary extraction in historical handwritten documents. In: Proceedings of the 4th international workshop on historical document imaging and processing. Kyoto, Japan, November 2017 (HIP17), pp 59–64
Wigington C, Tensmeyer C, Davis B, Barrett B, Price B, Cohen Start S (2018) Follow, read: end-to-end full page handwriting recognition. In: European Conference on computer vision (ECCV)
Wu Y, et al. (2016) Learning document image binarization from data. In: IEEE Proceedings of international conference on image processing (ICIP). arXiv:1505.00529 [cs.CV]
Ziaei Nafchi H, Farrahi Moghaddam R, Cheriet M (2013) Historical document binarization based on phase information of images. In: Lecture notes in computer science: Asian conference on computer vision (ACCVFFFD;12 Workshops), vol 7729. Springer, Berlin, pp 1–12
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Das, S. A statistical tool based binarization method for document images. Multimed Tools Appl 78, 27449–27462 (2019). https://doi.org/10.1007/s11042-019-07857-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-019-07857-x