Skip to main content
Log in

Farsi and Arabic document images lossy compression based on the mixed raster content model

  • Original Paper
  • Published:
International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Abstract

Recently, the mixed raster content model was proposed for compound document image compression. Most state-of-the-art document image compression methods, such as DjVu, work on the basis of this model but they have some disadvantages, especially for Farsi and Arabic document images. First, the Farsi/Arabic script has some characteristics which can be used to further improve the compression performance. Second, existing segmentation methods have focused on well-separating the textual objects from the background and/or optimizing the rate-distortion trade-off; nevertheless, they have not considered the text readability and OCR facility. Third, these methods usually suffer from the undesired jaggy artifact and misclassifying the important textual details. In this paper, MRC-based document image compression method is proposed which compromises rate-distortion trade-off better than the existing state-of-the-art document compression methods. The proposed method has higher performance in the aspects of segmentation, bi-level mask layer compression, OCR facility, and the overall compression. It uses a 1D pattern matching technique for compression of mask layer. It also uses a segmentation method which is sensitive enough to the small textual objects. Experimental results show that the proposed method has considerably higher compression performance than that of the state-of-the-art compression method DjVu, as high as 1.75–2.3.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. de Queiroz R.L., Fan Z., Tran T.D.: Optimizing block-thresholding segmentation for multilayer compression of compound images. IEEE Trans. Image Proc. 9(9), 1461–1471 (2000)

    Article  MATH  Google Scholar 

  2. Feng G., Bouman C.A.: High-quality mrc document coding. IEEE Trans. Image Proc. 15(10), 3152–3169 (2006)

    Article  Google Scholar 

  3. de Queiroz, R., Buckley, R., Xu, M.: Mixed raster content (MRC) model for compound image compression. In: Proceedings IS&T/SPIE Symposium on Electronic Imaging Science and Technology Visual Communications and Image Processing, vol. 3653, San Jose, CA, pp. 1106–1117 (1999)

  4. Lam E.Y.: Compound document compression with model-based biased reconstruction. J. Electron. Imaging 13(1), 191–194 (2004)

    Article  Google Scholar 

  5. Sharpe, L. H., Manns, B.: JPEG 2000 Options for document image compression. In: Kantor, P.B., Kanungo, T., Zhou, J. (eds.) Document Recognition and Retrieval IX, Proceedings of SPIE, vol. 4670, pp. 167–173 (2002)

  6. Bottou L., Haffner P., Howard P.G., Simard P., Bengio Y., LeCun Y.: High quality document image compression with DjVu. J. Elec. Imaging 7(3), 410–425 (1998)

    Article  Google Scholar 

  7. Fan Z., Jacobs T.: Segmentation for mixed raster contents with multiple extracted constant color areas. Proc. SPIE-IS&T Electron. Imaging 5667, 251–262 (2005)

    Article  Google Scholar 

  8. Buckley, R., Venable, D., McIntyre, L.: New developments in color facsimile and internet fax. In: Proceedings of the IS&T/SID 5th Color Imaging Conference: Color Science, Systems, and Applications, Scottsdale, AZ, vol. 3, pp. 296–300, 17–20 Nov. (1997)

  9. Huttenlocher, D., Felzenszwalb, P., Rucklidge, W.: DigiPaper: A versatile color document image representation. In: Proceedings of IEEE International Conference on Image Proccessing, pp. 219–223, Kobe, Japan, October (1999)

  10. Barthel, K. U., Partlin, S. M., Thierschmann, M.: New technology for raster document image compression. In: Part of the IS&T/SPIE Conference on Document Recognition and Retrieval VII, San Jose, CA, vol. 3967, pp. 286–290 (2000)

  11. Thierschmann, M., Barthel, K. -U., Martin, U. -E.: A scalable DSP-architecture for high-speed color document compression. In: Kantor, P.B., Lopresti, D.P., Zhou, J. (eds.) Document Recognition and Retrieval VIII. Proceedings of SPIE, vol. 4307, pp. 158–166 (2001)

  12. Wu B.-F., Chiu C.-C., Chen Y.-L.: Algorithms for compressing compound document images with large text/background overlap. IEE Proc. Vis. Image Signal Process. 151(6), 453–459 (2004)

    Article  Google Scholar 

  13. Haneda, E., Yi, J., Bouman, C. A.: Segmentation for MRC compression. In: Eschbach, R., Marcu G.G. (eds.) Color Imaging XII: Processing, Hardcopy, and Applications. Proceedings of SPIE-IS&T Electronic Imaging, vol. 6493, pp. 252–262 (2007)

  14. Cheriet M., Kharma N., Liu C.-L., Suen C.Y.: Character Recognition Systems: A Guide for Students and Practioners. Wiley, NY (2007)

    Book  MATH  Google Scholar 

  15. Trier O.D., Jain A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(12), 1191–1201 (1995)

    Article  Google Scholar 

  16. Sarfraz, M., Zidouri, A., Nawaz, S. N.: On Offline arabic character recognition. In: Sarfraz M. (ed.) Computer-Aided Intelligent Recodgnition: Techniques and Applications. Wiley, NY (2005)

  17. Zaghetto, A., de Queiroz, R. L.: Iterative pre- and post-processing for MRC layers of scanned documents. In: Proceedings of IEEE International Conference on Image Processing, ICIP, San Diego, CA, USA, Oct. (2008)

  18. Kia O.E., Doermann D.S.: Residual coding in document image compression. IEEE Trans. Image Proc. 9(6), 961–969 (2000)

    Article  Google Scholar 

  19. Sezgin M., Sankur B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Elec. Imaging 13(1), 146–165 (2004)

    Article  Google Scholar 

  20. Trier O.D., Taxt T.: Evaluation of binarization methods for document images. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(3), 312–315 (1995)

    Article  Google Scholar 

  21. Cheng H., Bouman C.A.: Document compression using rate- distortion optimized segmentation. J. Electron. Imaging 10(2), 460–474 (2001)

    Article  Google Scholar 

  22. Cheng, H., Feng, G., Bouman, C. A.: Rate-distortion based segmentation for MRC compression. In: Proceedings of SPIE Conference on Color Imaging: Device-Independent Color, Color Hardcopy and Applications, vol. 4663, San Jose, CA, 21–23 January (2002)

  23. Simard, P., Malvar, H., Rinker, J., Renshaw, E.: A Foreground/background separation algorithm for image compression. In: Data Compression Conference 2004, pp. 498–507 (2004)

  24. Fan, Z., Xu, M.: A simple segmentation algorithm for mixed rater contents image representation. In: Proceedings of IS&T/SPIE Symposium on Electronic Imaging Science and Technology, Visual Communications and Image Processing, San Jose, CA, vol. 4663, pp. 63–71 (2002)

  25. Saykol, E., Sinop, A. K., Gudukbay, U., Ulusoy, O., Cetin, A. E.: Content-based retrieval of historical ottoman documents stored as textual images. IEEE Trans. Image Process. 13(3) (2004)

  26. Ascher R.N., Nagy G.: A means for achieving a high degree of compaction on scan-digitized printed text. IEEE Trans. Comput. 23, 1174–1179 (1974)

    Article  MATH  Google Scholar 

  27. Pratt W.K., Capitant P.J., Chen W.H., Hamilton E.R., Wallis R.H.: Combined symbol matching facsimile data compression system. Proc. IEEE 68(7), 786–796 (1980)

    Article  Google Scholar 

  28. Brickman N.F., Rosenbaum W.S.: Word autocorrelation redundancy match (WARM) technology. IBM J. Res. Devel. 26(6), 681–686 (1982)

    Article  Google Scholar 

  29. Johnsen O., Segen J., Cash G.L.: Coding of two-level pictures by pattern matching and substitution. Bell Syst. Tech. J. 62(8), 2513–2545 (1983)

    Google Scholar 

  30. Holt, M. J. J., Xydeas, C. S.: Recent developments in image data compression for digital facsimile. ICL Tech. J. 123–146 (1986)

  31. Holt M.J.: A fast bi-level template matching algorithm for document image data compression. In: Kittler, J. (eds) Pattern Recognition, pp. 230–239. Springer, Berlin (1988)

    Google Scholar 

  32. Yang Y., Yan H., Yu D.: Content-lossless document image compression based on structural analysis and pattern matching. Pattern Recognit. 33, 1277–1293 (2000)

    Article  Google Scholar 

  33. Witten, I. H., Bell, T. C., Emberson, H., Inglis, S., Moffat, A.: Textual image compression: two-stage lossy/lossless encoding of textual images. In: Proceedings of the IEEE, vol. 82, no. 6, (1994)

  34. Witten I.H., Moffat A., Bell T.C.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Academic, London (1999)

    Google Scholar 

  35. Howard P.G.: Text image compression using soft pattern matching. Comput. J. 40(2/3), 146–156 (1997)

    Article  Google Scholar 

  36. ISO/IEC International Standard 11544: Progressive Binary Image Compression, JBIG, ITU-Recommendation T.82 (1993)

  37. Howard P.G., Kossentini F., Forchhammer S., Ruchlidge W.J.: The Emerging JBIG2 Standard. IEEE Trans. Circuits Syst. Video Tech. 8(7), 838–848 (1998)

    Article  Google Scholar 

  38. Ye Y., Cosman P.: Fast and memory efficient text image compression with JBIG2. IEEE Trans. Image Proc. 12(8), 944–956 (2003)

    Article  Google Scholar 

  39. Ye Y., Cosman P.: Dictionary design for text image compression with JBIG2. IEEE Trans. Image Process. 10(6), 818–828 (2001)

    Article  MATH  Google Scholar 

  40. Cormen T.H., Leiserson C.E., Rivest R.L., Stein C.: Introduction to Algorithms, 2nd edn. McGraw-Hill, NY (2001)

    MATH  Google Scholar 

  41. Jahne B.: Digital Image Processing. Springer, Berlin (2005)

    Google Scholar 

  42. Makhoul, J., Francis, K., Richard, S., Ralph, W.: Performance measures for information extraction. In: Proceedings of DARPA Broadcast News Workshop, Herndon, VA, February (1999)

  43. Baeza-Yates R., Ribeiro-Neto B.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York/Reading (1999)

    Google Scholar 

  44. Hu, M. K. Visual pattern recognition by moment invariant. In: IRE Transactions on Information Theory, IT–8, pp. 179–187 (1962)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mojtaba Lotfizad.

Additional information

This work was supported by the Iranian Telecommunication Research Center (ITRC) under the granted number TMU87-06-30.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Grailu, H., Lotfizad, M. & Sadoghi-Yazdi, H. Farsi and Arabic document images lossy compression based on the mixed raster content model. IJDAR 12, 227–248 (2009). https://doi.org/10.1007/s10032-009-0088-6

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10032-009-0088-6

Keywords

Navigation