Abstract
Recently, the mixed raster content model was proposed for compound document image compression. Most state-of-the-art document image compression methods, such as DjVu, work on the basis of this model but they have some disadvantages, especially for Farsi and Arabic document images. First, the Farsi/Arabic script has some characteristics which can be used to further improve the compression performance. Second, existing segmentation methods have focused on well-separating the textual objects from the background and/or optimizing the rate-distortion trade-off; nevertheless, they have not considered the text readability and OCR facility. Third, these methods usually suffer from the undesired jaggy artifact and misclassifying the important textual details. In this paper, MRC-based document image compression method is proposed which compromises rate-distortion trade-off better than the existing state-of-the-art document compression methods. The proposed method has higher performance in the aspects of segmentation, bi-level mask layer compression, OCR facility, and the overall compression. It uses a 1D pattern matching technique for compression of mask layer. It also uses a segmentation method which is sensitive enough to the small textual objects. Experimental results show that the proposed method has considerably higher compression performance than that of the state-of-the-art compression method DjVu, as high as 1.75–2.3.
Similar content being viewed by others
References
de Queiroz R.L., Fan Z., Tran T.D.: Optimizing block-thresholding segmentation for multilayer compression of compound images. IEEE Trans. Image Proc. 9(9), 1461–1471 (2000)
Feng G., Bouman C.A.: High-quality mrc document coding. IEEE Trans. Image Proc. 15(10), 3152–3169 (2006)
de Queiroz, R., Buckley, R., Xu, M.: Mixed raster content (MRC) model for compound image compression. In: Proceedings IS&T/SPIE Symposium on Electronic Imaging Science and Technology Visual Communications and Image Processing, vol. 3653, San Jose, CA, pp. 1106–1117 (1999)
Lam E.Y.: Compound document compression with model-based biased reconstruction. J. Electron. Imaging 13(1), 191–194 (2004)
Sharpe, L. H., Manns, B.: JPEG 2000 Options for document image compression. In: Kantor, P.B., Kanungo, T., Zhou, J. (eds.) Document Recognition and Retrieval IX, Proceedings of SPIE, vol. 4670, pp. 167–173 (2002)
Bottou L., Haffner P., Howard P.G., Simard P., Bengio Y., LeCun Y.: High quality document image compression with DjVu. J. Elec. Imaging 7(3), 410–425 (1998)
Fan Z., Jacobs T.: Segmentation for mixed raster contents with multiple extracted constant color areas. Proc. SPIE-IS&T Electron. Imaging 5667, 251–262 (2005)
Buckley, R., Venable, D., McIntyre, L.: New developments in color facsimile and internet fax. In: Proceedings of the IS&T/SID 5th Color Imaging Conference: Color Science, Systems, and Applications, Scottsdale, AZ, vol. 3, pp. 296–300, 17–20 Nov. (1997)
Huttenlocher, D., Felzenszwalb, P., Rucklidge, W.: DigiPaper: A versatile color document image representation. In: Proceedings of IEEE International Conference on Image Proccessing, pp. 219–223, Kobe, Japan, October (1999)
Barthel, K. U., Partlin, S. M., Thierschmann, M.: New technology for raster document image compression. In: Part of the IS&T/SPIE Conference on Document Recognition and Retrieval VII, San Jose, CA, vol. 3967, pp. 286–290 (2000)
Thierschmann, M., Barthel, K. -U., Martin, U. -E.: A scalable DSP-architecture for high-speed color document compression. In: Kantor, P.B., Lopresti, D.P., Zhou, J. (eds.) Document Recognition and Retrieval VIII. Proceedings of SPIE, vol. 4307, pp. 158–166 (2001)
Wu B.-F., Chiu C.-C., Chen Y.-L.: Algorithms for compressing compound document images with large text/background overlap. IEE Proc. Vis. Image Signal Process. 151(6), 453–459 (2004)
Haneda, E., Yi, J., Bouman, C. A.: Segmentation for MRC compression. In: Eschbach, R., Marcu G.G. (eds.) Color Imaging XII: Processing, Hardcopy, and Applications. Proceedings of SPIE-IS&T Electronic Imaging, vol. 6493, pp. 252–262 (2007)
Cheriet M., Kharma N., Liu C.-L., Suen C.Y.: Character Recognition Systems: A Guide for Students and Practioners. Wiley, NY (2007)
Trier O.D., Jain A.K.: Goal-directed evaluation of binarization methods. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(12), 1191–1201 (1995)
Sarfraz, M., Zidouri, A., Nawaz, S. N.: On Offline arabic character recognition. In: Sarfraz M. (ed.) Computer-Aided Intelligent Recodgnition: Techniques and Applications. Wiley, NY (2005)
Zaghetto, A., de Queiroz, R. L.: Iterative pre- and post-processing for MRC layers of scanned documents. In: Proceedings of IEEE International Conference on Image Processing, ICIP, San Diego, CA, USA, Oct. (2008)
Kia O.E., Doermann D.S.: Residual coding in document image compression. IEEE Trans. Image Proc. 9(6), 961–969 (2000)
Sezgin M., Sankur B.: Survey over image thresholding techniques and quantitative performance evaluation. J. Elec. Imaging 13(1), 146–165 (2004)
Trier O.D., Taxt T.: Evaluation of binarization methods for document images. IEEE Trans. Pattern Anal. Mach. Intell. PAMI 17(3), 312–315 (1995)
Cheng H., Bouman C.A.: Document compression using rate- distortion optimized segmentation. J. Electron. Imaging 10(2), 460–474 (2001)
Cheng, H., Feng, G., Bouman, C. A.: Rate-distortion based segmentation for MRC compression. In: Proceedings of SPIE Conference on Color Imaging: Device-Independent Color, Color Hardcopy and Applications, vol. 4663, San Jose, CA, 21–23 January (2002)
Simard, P., Malvar, H., Rinker, J., Renshaw, E.: A Foreground/background separation algorithm for image compression. In: Data Compression Conference 2004, pp. 498–507 (2004)
Fan, Z., Xu, M.: A simple segmentation algorithm for mixed rater contents image representation. In: Proceedings of IS&T/SPIE Symposium on Electronic Imaging Science and Technology, Visual Communications and Image Processing, San Jose, CA, vol. 4663, pp. 63–71 (2002)
Saykol, E., Sinop, A. K., Gudukbay, U., Ulusoy, O., Cetin, A. E.: Content-based retrieval of historical ottoman documents stored as textual images. IEEE Trans. Image Process. 13(3) (2004)
Ascher R.N., Nagy G.: A means for achieving a high degree of compaction on scan-digitized printed text. IEEE Trans. Comput. 23, 1174–1179 (1974)
Pratt W.K., Capitant P.J., Chen W.H., Hamilton E.R., Wallis R.H.: Combined symbol matching facsimile data compression system. Proc. IEEE 68(7), 786–796 (1980)
Brickman N.F., Rosenbaum W.S.: Word autocorrelation redundancy match (WARM) technology. IBM J. Res. Devel. 26(6), 681–686 (1982)
Johnsen O., Segen J., Cash G.L.: Coding of two-level pictures by pattern matching and substitution. Bell Syst. Tech. J. 62(8), 2513–2545 (1983)
Holt, M. J. J., Xydeas, C. S.: Recent developments in image data compression for digital facsimile. ICL Tech. J. 123–146 (1986)
Holt M.J.: A fast bi-level template matching algorithm for document image data compression. In: Kittler, J. (eds) Pattern Recognition, pp. 230–239. Springer, Berlin (1988)
Yang Y., Yan H., Yu D.: Content-lossless document image compression based on structural analysis and pattern matching. Pattern Recognit. 33, 1277–1293 (2000)
Witten, I. H., Bell, T. C., Emberson, H., Inglis, S., Moffat, A.: Textual image compression: two-stage lossy/lossless encoding of textual images. In: Proceedings of the IEEE, vol. 82, no. 6, (1994)
Witten I.H., Moffat A., Bell T.C.: Managing Gigabytes: Compressing and Indexing Documents and Images, 2nd edn. Academic, London (1999)
Howard P.G.: Text image compression using soft pattern matching. Comput. J. 40(2/3), 146–156 (1997)
ISO/IEC International Standard 11544: Progressive Binary Image Compression, JBIG, ITU-Recommendation T.82 (1993)
Howard P.G., Kossentini F., Forchhammer S., Ruchlidge W.J.: The Emerging JBIG2 Standard. IEEE Trans. Circuits Syst. Video Tech. 8(7), 838–848 (1998)
Ye Y., Cosman P.: Fast and memory efficient text image compression with JBIG2. IEEE Trans. Image Proc. 12(8), 944–956 (2003)
Ye Y., Cosman P.: Dictionary design for text image compression with JBIG2. IEEE Trans. Image Process. 10(6), 818–828 (2001)
Cormen T.H., Leiserson C.E., Rivest R.L., Stein C.: Introduction to Algorithms, 2nd edn. McGraw-Hill, NY (2001)
Jahne B.: Digital Image Processing. Springer, Berlin (2005)
Makhoul, J., Francis, K., Richard, S., Ralph, W.: Performance measures for information extraction. In: Proceedings of DARPA Broadcast News Workshop, Herndon, VA, February (1999)
Baeza-Yates R., Ribeiro-Neto B.: Modern Information Retrieval. ACM Press/Addison-Wesley, New York/Reading (1999)
Hu, M. K. Visual pattern recognition by moment invariant. In: IRE Transactions on Information Theory, IT–8, pp. 179–187 (1962)
Author information
Authors and Affiliations
Corresponding author
Additional information
This work was supported by the Iranian Telecommunication Research Center (ITRC) under the granted number TMU87-06-30.
Rights and permissions
About this article
Cite this article
Grailu, H., Lotfizad, M. & Sadoghi-Yazdi, H. Farsi and Arabic document images lossy compression based on the mixed raster content model. IJDAR 12, 227–248 (2009). https://doi.org/10.1007/s10032-009-0088-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10032-009-0088-6