Abstract
The main purpose of conducted research is the development of a new image thresholding method, which is faster than typical adaptive methods and more accurate than global binarization. Since natural images captured by cameras are usually unevenly illuminated, due to unknown and various lighting conditions, an appropriate binarization influences the results of further image analysis significantly.
In this paper, the analysis of multi-layered stack of regions, being the enhancement of the single-layer version, is proposed to calculate the local image properties. Since the balance between the global and local adaptive thresholding requires the choice of an appropriate number of shifted layers and block size, its verification has been made using a database of test images. The proposed local threshold value is chosen as the mean local intensity corrected using two additional parameters subjected to optimization.
The developed procedure allows for more accurate and faster binarization, which can be applied in many technical systems. It has been verified by the example of text recognition accuracy for the non-uniformly illuminated document images in comparison to alternative global and local methods of similar of lower computational complexity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bernsen, J.: Dynamic thresholding of grey-level images. In: Proceedings of International Conference on Pattern Recognition (ICPR), pp. 1251–1255 (1986)
Bradley, D., Roth, G.: Adaptive thresholding using the integral image. J. Graph. Tools 12(2), 13–21 (2007). https://doi.org/10.1080/2151237X.2007.10129236
Feng, M.L., Tan, Y.P.: Adaptive binarization method for document image analysis. In: 2004 IEEE International Conference on Multimedia and Expo (ICME), vol. 1, pp. 339–342 (2004). https://doi.org/10.1109/ICME.2004.1394198
Kapur, J., Sahoo, P., Wong, A.: A new method for gray-level picture thresholding using the entropy of the histogram. Comput. Vis. Graph. Image Process. 29(3), 273–285 (1985). https://doi.org/10.1016/0734-189X(85)90125-2
Khurshid, K., Siddiqi, I., Faure, C., Vincent, N.: Comparison of Niblack inspired binarization methods for ancient documents. In: Document Recognition and Retrieval XVI, vol. 7247, pp. 7247–7247-9 (2009). https://doi.org/10.1117/12.805827
Kulyukin, V., Kutiyanawala, A., Zaman, T.: Eyes-free barcode detection on smartphones with Niblack’s binarization and Support Vector Machines. In: Proceedings of the 16th International Conference on Image Processing, Computer Vision, and Pattern Recognition, IPCV 2012, vol. 1, pp. 284–290. CSREA Press (2012)
Lech, P., Okarma, K., Wojnar, D.: Binarization of document images using the modified local-global Otsu and Kapur algorithms. Przegląd Elektrotechniczny 91(1), 71–74 (2015). https://doi.org/10.15199/48.2015.02.1
Leedham, G., Yan, C., Takru, K., Tan, J.H.N., Mian, L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. In: Proceedings of 7th International Conference on Document Analysis and Recognition, ICDAR 2003, pp. 859–864 (2003). https://doi.org/10.1109/ICDAR.2003.1227784
Michalak, H., Okarma, K.: Region based adaptive binarization for optical character recognition purposes. In: Proceedings of International Interdisciplinary PhD Workshop (IIPhDW), pp. 361–366, Świnoujście, Poland (2018). https://doi.org/10.1109/IIPHDW.2018.8388391
Michalak, H., Okarma, K.: Fast adaptive image binarization using the region based approach. In: Silhavy, R. (ed.) CSOC2018 2018. AISC, vol. 764, pp. 79–90. Springer, Cham (2019). https://doi.org/10.1007/978-3-319-91189-2_9
Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, Englewood Cliffs (1986)
Ntirogiannis, K., Gatos, B., Pratikakis, I.: Performance evaluation methodology for historical document image binarization. IEEE Trans. Image Process. 22(2), 595–609 (2013). https://doi.org/10.1109/TIP.2012.2219550
Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979). https://doi.org/10.1109/TSMC.1979.4310076
Rosin, P.L.: Unimodal thresholding. Pattern Recognit. 34(11), 2083–2096 (2001). https://doi.org/10.1016/S0031-3203(00)00136-9
Samorodova, O.A., Samorodov, A.V.: Fast implementation of the Niblack binarization algorithm for microscope image segmentation. Pattern Recognit. Image Anal. 26(3), 548–551 (2016). https://doi.org/10.1134/S1054661816030020
Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recognit. 33(2), 225–236 (2000). https://doi.org/10.1016/S0031-3203(99)00055-2
Saxena, L.P.: Niblack’s binarization method and its modifications to realtime applications: a review. Artif. Intell. Rev. 1–33 (2017). https://doi.org/10.1007/s10462-017-9574-2
Shrivastava, A., Srivastava, D.K.: A review on pixel-based binarization of gray images. In: Satapathy, S.C., Bhatt, Y.C., Joshi, A., Mishra, D.K. (eds.) Proceedings of the International Congress on Information and Communication Technology. AISC, vol. 439, pp. 357–364. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0755-2_38
Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009). https://doi.org/10.1016/j.ipm.2009.03.002
Wolf, C., Jolion, J.M.: Extraction and recognition of artificial text in multimedia documents. Form. Pattern Anal. Appl. 6(4), 309–326 (2004). https://doi.org/10.1007/s10044-003-0197-7
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Michalak, H., Okarma, K. (2019). Adaptive Image Binarization Based on Multi-layered Stack of Regions. In: Vento, M., Percannella, G. (eds) Computer Analysis of Images and Patterns. CAIP 2019. Lecture Notes in Computer Science(), vol 11679. Springer, Cham. https://doi.org/10.1007/978-3-030-29891-3_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-29891-3_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-29890-6
Online ISBN: 978-3-030-29891-3
eBook Packages: Computer ScienceComputer Science (R0)