Farsi and Arabic document images lossy compression based on the mixed raster content model

Grailu, Hadi; Lotfizad, Mojtaba; Sadoghi-Yazdi, Hadi

doi:10.1007/s10032-009-0088-6

Farsi and Arabic document images lossy compression based on the mixed raster content model

Original Paper
Published: 30 June 2009

Volume 12, pages 227–248, (2009)
Cite this article

International Journal on Document Analysis and Recognition (IJDAR) Aims and scope Submit manuscript

Hadi Grailu¹,
Mojtaba Lotfizad¹ &
Hadi Sadoghi-Yazdi²

114 Accesses
6 Citations
3 Altmetric
Explore all metrics

Abstract

Recently, the mixed raster content model was proposed for compound document image compression. Most state-of-the-art document image compression methods, such as DjVu, work on the basis of this model but they have some disadvantages, especially for Farsi and Arabic document images. First, the Farsi/Arabic script has some characteristics which can be used to further improve the compression performance. Second, existing segmentation methods have focused on well-separating the textual objects from the background and/or optimizing the rate-distortion trade-off; nevertheless, they have not considered the text readability and OCR facility. Third, these methods usually suffer from the undesired jaggy artifact and misclassifying the important textual details. In this paper, MRC-based document image compression method is proposed which compromises rate-distortion trade-off better than the existing state-of-the-art document compression methods. The proposed method has higher performance in the aspects of segmentation, bi-level mask layer compression, OCR facility, and the overall compression. It uses a 1D pattern matching technique for compression of mask layer. It also uses a segmentation method which is sensitive enough to the small textual objects. Experimental results show that the proposed method has considerably higher compression performance than that of the state-of-the-art compression method DjVu, as high as 1.75–2.3.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Robust Approach for Arabic Document Images Segmentation and Indexation

An adaptive text-line extraction algorithm for printed Arabic documents with diacritics

Article 11 September 2020

Handwritten Text Segmentation Approach in Historical Arabic Documents

References

de Queiroz R.L., Fan Z., Tran T.D.: Optimizing block-thresholding segmentation for multilayer compression of compound images. IEEE Trans. Image Proc. 9(9), 1461–1471 (2000)
Article MATH Google Scholar
Feng G., Bouman C.A.: High-quality mrc document coding. IEEE Trans. Image Proc. 15(10), 3152–3169 (2006)
Article Google Scholar
de Queiroz, R., Buckley, R., Xu, M.: Mixed raster content (MRC) model for compound image compression. In: Proceedings IS&T/SPIE Symposium on Electronic Imaging Science and Technology Visual Communications and Image Processing, vol. 3653, San Jose, CA, pp. 1106–1117 (1999)
Lam E.Y.: Compound document compression with model-based biased reconstruction. J. Electron. Imaging 13(1), 191–194 (2004)
Article Google Scholar
Sharpe, L. H., Manns, B.: JPEG 2000 Options for document image compression. In: Kantor, P.B., Kanungo, T., Zhou, J. (eds.) Document Recognition and Retrieval IX, Proceedings of SPIE, vol. 4670, pp. 167–173 (2002)
Bottou L., Haffner P., Howard P.G., Simard P., Bengio Y., LeCun Y.: High quality document image compression with DjVu. J. Elec. Imaging 7(3), 410–425 (1998)
Article Google Scholar
Fan Z., Jacobs T.: Segmentation for mixed raster contents with multiple extracted constant color areas. Proc. SPIE-IS&T Electron. Imaging 5667, 251–262 (2005)
Article Google Scholar
Buckley, R., Venable, D., McIntyre, L.: New developments in color facsimile and internet fax. In: Proceedings of the IS&T/SID 5th Color Imaging Conference: Color Science, Systems, and Applications, Scottsdale, AZ, vol. 3, pp. 296–300, 17–20 Nov. (1997)
Huttenlocher, D., Felzenszwalb, P., Rucklidge, W.: DigiPaper: A versatile color document image representation. In: Proceedings of IEEE International Conference on Image Proccessing, pp. 219–223, Kobe, Japan, October (1999)
Barthel, K. U., Partlin, S. M., Thierschmann, M.: New technology for raster document image compression. In: Part of the IS&T/SPIE Conference on Document Recognition and Retrieval VII, San Jose, CA, vol. 3967, pp. 286–290 (2000)
Thierschmann, M., Barthel, K. -U., Martin, U. -E.: A scalable DSP-architecture for high-speed color document compression. In: Kantor, P.B., Lopresti, D.P., Zhou, J. (eds.) Document Recognition and Retrieval VIII. Proceedings of SPIE, vol. 4307, pp. 158–166 (2001)
Wu B.-F., Chiu C.-C., Chen Y.-L.: Algorithms for compressing compound document images with large text/background overlap. IEE Proc. Vis. Image Signal Process. 151(6), 453–459 (2004)
Article Google Scholar
Haneda, E., Yi, J., Bouman, C. A.: Segmentation for MRC compression. In: Eschbach, R., Marcu G.G. (eds.) Color Imaging XII: Processing, Hardcopy, and Applications. Proceedings of SPIE-IS&T Electronic Imaging, vol. 6493, pp. 252–262 (2007)
Cheriet M., Kharma N., Liu C.-L., Suen C.Y.: Character Recognition Systems: A Guide for Students and Practioners. Wiley, NY (2007)
Book MATH Google Scholar
Trier O.D., Jain A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(12), 1191–1201 (1995)
Article Google Scholar
Sarfraz, M., Zidouri, A., Nawaz, S. N.: On Offline arabic character recognition. In: Sarfraz M. (ed.) Computer-Aided Intelligent Recodgnition: Techniques and Applications. Wiley, NY (2005)
Zaghetto, A., de Queiroz, R. L.: Iterative pre- and post-processing for MRC layers of scanned documents. In: Proceedings of IEEE International Conference on Image Processing, ICIP, San Diego, CA, USA, Oct. (2008)
Kia O.E., Doermann D.S.: Residual coding in document image compression. IEEE Trans. Image Proc. 9(6), 961–969 (2000)
Article Google Scholar
Sezgin M., Sankur B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Elec. Imaging 13(1), 146–165 (2004)
Article Google Scholar
Trier O.D., Taxt T.: Evaluation of binarization methods for document images. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(3), 312–315 (1995)
Article Google Scholar
Cheng H., Bouman C.A.: Document compression using rate- distortion optimized segmentation. J. Electron. Imaging 10(2), 460–474 (2001)
Article Google Scholar
Cheng, H., Feng, G., Bouman, C. A.: Rate-distortion based segmentation for MRC compression. In: Proceedings of SPIE Conference on Color Imaging: Device-Independent Color, Color Hardcopy and Applications, vol. 4663, San Jose, CA, 21–23 January (2002)
Simard, P., Malvar, H., Rinker, J., Renshaw, E.: A Foreground/background separation algorithm for image compression. In: Data Compression Conference 2004, pp. 498–507 (2004)
Fan, Z., Xu, M.: A simple segmentation algorithm for mixed rater contents image representation. In: Proceedings of IS&T/SPIE Symposium on Electronic Imaging Science and Technology, Visual Communications and Image Processing, San Jose, CA, vol. 4663, pp. 63–71 (2002)
Saykol, E., Sinop, A. K., Gudukbay, U., Ulusoy, O., Cetin, A. E.: Content-based retrieval of historical ottoman documents stored as textual images. IEEE Trans. Image Process. 13(3) (2004)
Ascher R.N., Nagy G.: A means for achieving a high degree of compaction on scan-digitized printed text. IEEE Trans. Comput. 23, 1174–1179 (1974)
Article MATH Google Scholar
Pratt W.K., Capitant P.J., Chen W.H., Hamilton E.R., Wallis R.H.: Combined symbol matching facsimile data compression system. Proc. IEEE 68(7), 786–796 (1980)
Article Google Scholar
Brickman N.F., Rosenbaum W.S.: Word autocorrelation redundancy match (WARM) technology. IBM J. Res. Devel. 26(6), 681–686 (1982)
Article Google Scholar
Johnsen O., Segen J., Cash G.L.: Coding of two-level pictures by pattern matching and substitution. Bell Syst. Tech. J. 62(8), 2513–2545 (1983)
Google Scholar
Holt, M. J. J., Xydeas, C. S.: Recent developments in image data compression for digital facsimile. ICL Tech. J. 123–146 (1986)
Holt M.J.: A fast bi-level template matching algorithm for document image data compression. In: Kittler, J. (eds) Pattern Recognition, pp. 230–239. Springer, Berlin (1988)
Google Scholar
Yang Y., Yan H., Yu D.: Content-lossless document image compression based on structural analysis and pattern matching. Pattern Recognit. 33, 1277–1293 (2000)
Article Google Scholar
Witten, I. H., Bell, T. C., Emberson, H., Inglis, S., Moffat, A.: Textual image compression: two-stage lossy/lossless encoding of textual images. In: Proceedings of the IEEE, vol. 82, no. 6, (1994)
Witten I.H., Moffat A., Bell T.C.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Academic, London (1999)
Google Scholar
Howard P.G.: Text image compression using soft pattern matching. Comput. J. 40(2/3), 146–156 (1997)
Article Google Scholar
ISO/IEC International Standard 11544: Progressive Binary Image Compression, JBIG, ITU-Recommendation T.82 (1993)
Howard P.G., Kossentini F., Forchhammer S., Ruchlidge W.J.: The Emerging JBIG2 Standard. IEEE Trans. Circuits Syst. Video Tech. 8(7), 838–848 (1998)
Article Google Scholar
Ye Y., Cosman P.: Fast and memory efficient text image compression with JBIG2. IEEE Trans. Image Proc. 12(8), 944–956 (2003)
Article Google Scholar
Ye Y., Cosman P.: Dictionary design for text image compression with JBIG2. IEEE Trans. Image Process. 10(6), 818–828 (2001)
Article MATH Google Scholar
Cormen T.H., Leiserson C.E., Rivest R.L., Stein C.: Introduction to Algorithms, 2nd edn. McGraw-Hill, NY (2001)
MATH Google Scholar
Jahne B.: Digital Image Processing. Springer, Berlin (2005)
Google Scholar
Makhoul, J., Francis, K., Richard, S., Ralph, W.: Performance measures for information extraction. In: Proceedings of DARPA Broadcast News Workshop, Herndon, VA, February (1999)
Baeza-Yates R., Ribeiro-Neto B.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York/Reading (1999)
Google Scholar
Hu, M. K. Visual pattern recognition by moment invariant. In: IRE Transactions on Information Theory, IT–8, pp. 179–187 (1962)

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, Tarbiat Modares University, Tehran, Iran
Hadi Grailu & Mojtaba Lotfizad
Department of Computer Engineering, Ferdowsi University of Mashhad, Mashhad, Iran
Hadi Sadoghi-Yazdi

Authors

Hadi Grailu
View author publications
You can also search for this author in PubMed Google Scholar
Mojtaba Lotfizad
View author publications
You can also search for this author in PubMed Google Scholar
Hadi Sadoghi-Yazdi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mojtaba Lotfizad.

Additional information

This work was supported by the Iranian Telecommunication Research Center (ITRC) under the granted number TMU87-06-30.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Grailu, H., Lotfizad, M. & Sadoghi-Yazdi, H. Farsi and Arabic document images lossy compression based on the mixed raster content model. IJDAR 12, 227–248 (2009). https://doi.org/10.1007/s10032-009-0088-6

Download citation

Received: 26 October 2008
Revised: 23 April 2009
Accepted: 25 April 2009
Published: 30 June 2009
Issue Date: December 2009
DOI: https://doi.org/10.1007/s10032-009-0088-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Farsi and Arabic document images lossy compression based on the mixed raster content model

Abstract

Access this article

Similar content being viewed by others

A Robust Approach for Arabic Document Images Segmentation and Indexation

An adaptive text-line extraction algorithm for printed Arabic documents with diacritics

Handwritten Text Segmentation Approach in Historical Arabic Documents

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Farsi and Arabic document images lossy compression based on the mixed raster content model

Abstract

Access this article

Similar content being viewed by others

A Robust Approach for Arabic Document Images Segmentation and Indexation

An adaptive text-line extraction algorithm for printed Arabic documents with diacritics

Handwritten Text Segmentation Approach in Historical Arabic Documents

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation