Advertisement

Fast Adaptive Binarization with Background Estimation for Non-uniformly Lightened Document Images

  • Hubert Michalak
  • Krzysztof Okarma
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 892)

Abstract

Fast and reliable adaptive binarization of unevenly lightened document images is one of the key issues for the Optical Character Recognition (OCR) purposes applied in mobile devices with limited computational power. Considering the document image captured in unknown lighting conditions the use of a single global thresholding in the binarization step makes the text recognition impossible as some parts of it might be lost in the analysed binary image.

On the other hand some well-known adaptive binarization methods e.g. Niblack, Sauvola and their modifications, are computationally demanding and might not be efficiently applied in some applications. Therefore a method for filling the gap between those two approaches is proposed in the paper. It is based on the region based approach utilizing the lighting correction method, in which input data are taken from lighting distribution approximated using reduced resolution images. Obtained binarization results are superior in comparison to typically used adaptive thresholding algorithms in terms of computational speed as well as the final OCR accuracy.

Keywords

Binarization OCR Document image analysis 

References

  1. 1.
    Bradley, D., Roth, G.: Adaptive thresholding using the integral image. J. Gr. Tools 12(2), 13–21 (2007)CrossRefGoogle Scholar
  2. 2.
    Chou, C.H., Lin, W.H., Chang, F.: A binarization method with learning-built rules for document images produced by cameras. Pattern Recognit. 43(4), 1518–1530 (2010)CrossRefGoogle Scholar
  3. 3.
    Feng, M.L., Tan, Y.P.: Adaptive binarization method for document image analysis. In: Proceedings of the 2004 IEEE International Conference on Multimedia and Expo (ICME), vol. 1, pp. 339–342 (2004)Google Scholar
  4. 4.
    Gatos, B., Pratikakis, I., Perantonis, S.: Adaptive degraded document image binarization. Pattern Recognit. 39(3), 317–327 (2006)CrossRefGoogle Scholar
  5. 5.
    Khurshid, K., Siddiqi, I., Faure, C., Vincent, N.: Comparison of Niblack inspired binarization methods for ancient documents. In: Document Recognition and Retrieval XVI, vol. 7247, pp. 7247–7247–9 (2009)Google Scholar
  6. 6.
    Lech, P., Okarma, K.: Fast histogram based image binarization using the Monte Carlo threshold estimation. In: Chmielewski, L.J., Kozera, R., Shin, B.S., Wojciechowski, K. (eds.) Computer Vision and Graphics. LNCS, vol. 8671, pp. 382–390. Springer International Publishing, Switzerland (2014)Google Scholar
  7. 7.
    Lech, P., Okarma, K.: Optimization of the fast image binarization method based on the Monte Carlo approach. Elektron. Ir Elektrotech. 20(4), 63–66 (2014)CrossRefGoogle Scholar
  8. 8.
    Leedham, G., Yan, C., Takru, K., Tan, J.H.N., Mian, L.: Comparison of some thresholding algorithms for text/background segmentation in difficult document images. In: Proceedings of the 7th International Conference on Document Analysis and Recognition, ICDAR 2003, pp. 859–864 (2003)Google Scholar
  9. 9.
    Michalak, H., Okarma, K.: Region based adaptive binarization for optical character recognition purposes. In: 2018 International Interdisciplinary PhD Workshop (IIPhDW), pp. 361–366 (2018)Google Scholar
  10. 10.
    Michalak, H., Okarma, K.: Fast adaptive image binarization using the region based approach. In: Silhavy, R. (ed.) Artificial Intelligence and Algorithms in Intelligent Systems, AISC, vol. 764, pp. 79–90. Springer International Publishing (2019)Google Scholar
  11. 11.
    Moghaddam, R.F., Cheriet, M.: AdOtsu: an adaptive and parameterless generalization of Otsu’s method for document image binarization. Pattern Recognit. 45(6), 2419–2431 (2012)CrossRefGoogle Scholar
  12. 12.
    Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, Englewood Cliffs (1986)Google Scholar
  13. 13.
    Ntirogiannis, K., Gatos, B., Pratikakis, I.: Performance evaluation methodology for historical document image binarization. IEEE Trans. Image Process. 22(2), 595–609 (2013)MathSciNetCrossRefGoogle Scholar
  14. 14.
    Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRefGoogle Scholar
  15. 15.
    Pratikakis, I., Zagoris, K., Barlas, G., Gatos, B.: ICDAR 2017 Document Image Binarization COmpetition (DIBCO 2017) (2017). https://vc.ee.duth.gr/dibco2017/
  16. 16.
    Samorodova, O.A., Samorodov, A.V.: Fast implementation of the Niblack binarization algorithm for microscope image segmentation. Pattern Recognit. Image Anal. 26(3), 548–551 (2016)CrossRefGoogle Scholar
  17. 17.
    Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recognit. 33(2), 225–236 (2000)CrossRefGoogle Scholar
  18. 18.
    Saxena, L.P.: Niblack’s binarization method and its modifications to real-time applications: a review. Artif. Intell. Rev. 1–33 (2017)Google Scholar
  19. 19.
    Shrivastava, A., Srivastava, D.K.: A review on pixel-based binarization of gray images. In: AISC, vol. 439, pp. 357–364. Springer, Singapore (2016)Google Scholar
  20. 20.
    Su, B., Lu, S., Tan, C.L.: Robust document image binarization technique for degraded document images. IEEE Trans. Image Process. 22(4), 1408–1417 (2013)MathSciNetCrossRefGoogle Scholar
  21. 21.
    Wen, J., Li, S., Sun, J.: A new binarization method for non-uniform illuminated document images. Pattern Recognit. 46(6), 1670–1690 (2013)CrossRefGoogle Scholar
  22. 22.
    Wolf, C., Jolion, J.M.: Extraction and recognition of artificial text in multimedia documents. Form. Pattern Anal. Appl. 6(4), 309–326 (2004)MathSciNetGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Department of Signal Processing and Multimedia Engineering, Faculty of Electrical EngineeringWest Pomeranian University of Technology, SzczecinSzczecinPoland

Personalised recommendations