Skip to main content

Preprocessing of Document Images Based on the GGD and GMM for Binarization of Degraded Ancient Papyri Images

  • Conference paper
  • First Online:
Progress in Image Processing, Pattern Recognition and Communication Systems (CORES 2021, IP&C 2021, ACS 2021)

Abstract

Thresholding of document images is one of the most relevant operations that influence the final results of their further analysis. Although many image binarization methods have been proposed during recent several years, starting from global thresholding, through local and adaptive methods, to more sophisticated multi-stage algorithms and the use of deep convolutional neural networks, proper thresholding of degraded historical document images is still an open challenge. Due to the release of the recent challenging DIBCO 2019 dataset, containing two categories of images with the particularly demanding papyri dataset (track B), most of the earlier proposed methods turned out to be significantly less effective for them. Analysing the competition’s results of the DIBCO 2019, apart from the variety of submitted trendy methods based on the use of deep learning, the great importance of image preprocessing may be observed for such demanding images. Hence, in this paper, we present the applicability analysis of the previously proposed preprocessing methods based on the use of the GGD and GMM in combination with various image binarization algorithms for this purpose. The obtained results are comparable with the state-of-the-art methods, including the top algorithms submitted to the recent DIBCO series contest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bernsen, J.: Dynamic thresholding of grey-level images. In: Proceedings of 8th International Conference on Pattern Recognition (ICPR), pp. 1251–1255 (1986)

    Google Scholar 

  2. Bradley, D., Roth, G.: Adaptive thresholding using the integral image. J. Graph. Tools 12(2), 13–21 (2007). https://doi.org/10.1080/2151237X.2007.10129236

    Article  Google Scholar 

  3. Chou, C.H., Lin, W.H., Chang, F.: A binarization method with learning-built rules for document images produced by cameras. Pattern Recogn. 43(4), 1518–1530 (2010). https://doi.org/10.1016/j.patcog.2009.10.016

    Article  MATH  Google Scholar 

  4. Clarke, R.J.: Transform Coding of Images. Academic, New York (1985)

    Google Scholar 

  5. Gatos, B., Pratikakis, I., Perantonis, S.: Adaptive degraded document image binarization. Pattern Recogn. 39(3), 317–327 (2006). https://doi.org/10.1016/j.patcog.2005.09.010

    Article  MATH  Google Scholar 

  6. Khurshid, K., Siddiqi, I., Faure, C., Vincent, N.: Comparison of Niblack inspired binarization methods for ancient documents. In: Document Recognition and Retrieval XVI, vol. 7247, pp. 72470U-1–72470U-9. SPIE (2009). https://doi.org/10.1117/12.805827

  7. Krupiński, R.: Approximated fast estimator for the shape parameter of generalized Gaussian distribution for a small sample size. Bull. Pol. Acad. Sci. Tech. Sci. 63(2), 405–411 (2015). https://doi.org/10.1515/bpasts-2015-0046

    Article  Google Scholar 

  8. Krupiński, R.: Reconstructed quantized coefficients modeled with generalized Gaussian distribution with exponent 1/3. Image Process. Commun. 21(4), 5–12 (2016)

    Article  Google Scholar 

  9. Krupiński, R.: Modeling quantized coefficients with generalized Gaussian distribution with exponent 1/m, m = 2, 3, ... In: Gruca, A., Czachórski, T., Harezlak, K., Kozielski, S., Piotrowska, A. (eds.) Man-Machine Interactions 5: 5th International Conference on Man-Machine Interactions, ICMMI 2017, pp. 228–237. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-67792-7_23

    Chapter  Google Scholar 

  10. Krupiński, R., Lech, P., Okarma, K.: Improved two-step binarization of degraded document images based on Gaussian mixture model. In: Krzhizhanovskaya, V., et al. (eds.) Computational Science - ICCS 2020. Lecture Notes in Computer Science, vol. 12141, pp. 467–480. Springer, Heidelberg (2020). https://doi.org/10.1007/978-3-030-50426-7_35

    Chapter  Google Scholar 

  11. Krupiński, R., Lech, P., Tecław, M., Okarma, K.: Binarization of degraded document images with generalized Gaussian distribution. In: Rodrigues, J., et al. (eds.) Computational Science - ICCS 2019. Lecture Notes in Computer Science, vol. 11540, pp. 177–190. Springer, Heidelberg (2019). https://doi.org/10.1007/978-3-030-22750-0_14

    Chapter  Google Scholar 

  12. Lins, R.D., Bernardino, R.B., de Jesus, D.M.: A quality and time assessment of binarization algorithms. In: Proceedings of the 15th IAPR International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, 20–25 September 2019, pp. 1444–1450. IEEE (2019). https://doi.org/10.1109/ICDAR.2019.00232

  13. Lu, H., Kot, A.C., Shi, Y.Q.: Distance-reciprocal distortion measure for binary document images. IEEE Signal Process. Lett. 11(2), 228–231 (2004). https://doi.org/10.1109/LSP.2003.821748

    Article  Google Scholar 

  14. Michalak, H., Okarma, K.: Fast binarization of unevenly illuminated document images based on background estimation for optical character recognition purposes. J. Univ. Comput. Sci. 25(6), 627–646 (2019). https://doi.org/10.3217/jucs-025-06-0627

    Article  Google Scholar 

  15. Michalak, H., Okarma, K.: Improvement of image binarization methods using image preprocessing with local entropy filtering for alphanumerical character recognition purposes. Entropy 11(2), 286 (2019). https://doi.org/10.3390/e21060562

    Article  MathSciNet  Google Scholar 

  16. Niblack, W.: An Introduction to Digital Image Processing. Prentice Hall, Englewood Cliffs (1986)

    Google Scholar 

  17. Ntirogiannis, K., Gatos, B., Pratikakis, I.: Performance evaluation methodology for historical document image binarization. IEEE Trans. Image Process. 22(2), 595–609 (2013). https://doi.org/10.1109/TIP.2012.2219550

    Article  MathSciNet  MATH  Google Scholar 

  18. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979). https://doi.org/10.1109/TSMC.1979.4310076

    Article  MathSciNet  Google Scholar 

  19. Pratikakis, I., Zagoris, K., Karagiannis, X., Tsochatzidis, L., Mondal, T., Marthot-Santaniello, I.: ICDAR 2019 competition on document image binarization (DIBCO 2019). In: 2019 International Conference on Document Analysis and Recognition (ICDAR), Sydney, NSW, Australia, pp. 1547–1556 (2019). https://doi.org/10.1109/ICDAR.2019.00249

  20. Sauvola, J., Pietikäinen, M.: Adaptive document image binarization. Pattern Recogn. 33(2), 225–236 (2000). https://doi.org/10.1016/S0031-3203(99)00055-2

    Article  Google Scholar 

  21. Shrivastava, A., Srivastava, D.K.: A review on pixel-based binarization of gray images. In: Advances in Intelligent Systems and Computing, vol. 439, pp. 357–364. Springer, Singapore (2016). https://doi.org/10.1007/978-981-10-0755-2_38

  22. Wolf, C., Jolion, J.M.: Extraction and recognition of artificial text in multimedia documents. Formal Pattern Anal. Appl. 6(4), 309–326 (2004). https://doi.org/10.1007/s10044-003-0197-7

    Article  MathSciNet  Google Scholar 

  23. Yu, S., Zhang, A., Li, H.: A review of estimating the shape parameter of generalized Gaussian distribution. J. Comput. Inf. Syst. 21(8), 9055–9064 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Krzysztof Okarma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Michalak, H., Krupiński, R., Lech, P., Okarma, K. (2022). Preprocessing of Document Images Based on the GGD and GMM for Binarization of Degraded Ancient Papyri Images. In: Choraś, M., Choraś, R.S., Kurzyński, M., Trajdos, P., Pejaś, J., Hyla, T. (eds) Progress in Image Processing, Pattern Recognition and Communication Systems. CORES IP&C ACS 2021 2021 2021. Lecture Notes in Networks and Systems, vol 255. Springer, Cham. https://doi.org/10.1007/978-3-030-81523-3_11

Download citation

Publish with us

Policies and ethics