Abstract
Region-of-Interest (ROI) techniques are often utilized in natural still-image coding standards such as JPEG2000 [1]. In contrast, document image coding typically adopts multi-layer methods [2], using a carefully selected algorithm for each layer to optimize overall performance. In this paper, an ROI-based method is proposed for multi-component document image coding, where rectangular textual ROI’s are easily extracted using standard document image analysis techniques. Compared to multi-layer methods, the method is simpler and scalable, while preserving comparable visual quality at equivalent PSNR.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Christopoulos, C., Skodras, A., Ebrahimi, T.: The JPEG 2000 still image coding system: An overview. IEEE Transactions On Consumer Electronics 46, 1103–1127 (2000)
Bottou, L., Haffner, P., Howard, P.G., Simard, P., Bengio, Y., Le Cun, Y.: High quality document image compression with DjVu. Journal of Electronic Imageing 7, 410–425 (1998)
Howard, P.G.: Text image compression using soft pattern matching. The Computer Journal 40, 146–156 (1997)
Haffner, P., Bottou, P., Howard, P.G., Simard, P., Bengio, Y., Le Cun, Y.: Browsing through high quality document images with DjVu. In: Advances in Digital Libaries, pp. 309–318 (1998)
Yin, X.W., Fleury, M., Downton, A.C.: Archive image communication with improved compression. In: ICDAR 2003, vol. 1, pp. 92–96 (2003)
Beccaloni, G., Scoble, M., Robinson, G., Pitkin, B.: The global lepidoptera names index (2002), at http://www.nhm.ac.uk/entomology/lepindex/
Wong, K.Y., Casey, R.G., Wahl, F.M.: Document analysis system. IBM Journal of Research and Development 26, 647–656 (1982)
Ha, J., Haralick, M., Phillips, R., Recursive, I.T.: X-Y cut using bounding boxes of connected components. In: ICDAR 1995, vol. II, pp. 952–955 (1995)
He, J., Downton, A.C.: Configurable text stamp identification tool with application of fuzzy logic. In: Marinai, S., Dengel, A.R. (eds.) DAS 2004. LNCS, vol. 3163, pp. 201–212. Springer, Heidelberg (2004)
Christopoulos, C., Askelof, J., Larsson, M.: Efficient methods for encoding region of interest in the upcoming JPEG2000 still image coding standard. IEEE Signal Processing Letters 7, 247–249 (2000)
Park, K., Park, H.W.: Region-of-interest coding based on set partitioning in hierarchical trees. IEEE Transactions On Circuits and Systems for Video Technology 12, 106–113 (2002)
Said, A., Pearlman, W.A.: A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans. on Circuits and Systems for Video Tech. 6, 243–250 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yin, X.W., Downton, A.C., Fleury, M., He, J. (2004). Multi-component Document Image Coding Using Regions-of-Interest. In: Marinai, S., Dengel, A.R. (eds) Document Analysis Systems VI. DAS 2004. Lecture Notes in Computer Science, vol 3163. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28640-0_15
Download citation
DOI: https://doi.org/10.1007/978-3-540-28640-0_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23060-1
Online ISBN: 978-3-540-28640-0
eBook Packages: Springer Book Archive