Abstract
This paper is concerned with the indexing and retrieval of images based on features extracted directly from JPEG discrete cosine transform (DCT) domain. We examine possible ways of manipulating DCT coefficients by standard image analysis approaches to describe image shape, texture, and color. Through Mandala transformation, our approach groups a subset of DCT coefficients to form ten blocks. Each block represents a particular frequency of the original image. We use two blocks to model rough object shape; nine blocks to describe subband properties; and one block to compute color distribution. As a result, the amount of data used for processing and analysis is reduced significantly. This can lead to simple yet efficient ways, of indexing and retrieval in a large scale image database. Experimental results show that it only takes approximately 6ms to index shape features, 5ms to index texture features, and 8ms to index color features from an image the size of 128 × 128 on a Sun Sparc Ultra-1 machine.
This work is supported in part by RGC Grants HKUST661/95E and HKUST6072/97E
Please direct all enquires to C. W. Ngo, Email:cwngo@cs.ust.hk
Preview
Unable to display preview. Download preview PDF.
References
Y. Rui, T. S. Huang & S. Mehrotra.: Content-based Image Retrieval with Relevancy Feedback In Mars. Proc. of IEEE Int. Conf. on Image Processing, pp. 815–818, 1997.
Kok F. Lai, H. Zhou, & S. Chan.: Query Expansion by Raw Image Features and Text Annotations in Image Retrieval. Third Asian Conf. on Computer Vision, vol. 1, pp. 402–409, 1998.
Y. S. Hsu, S. Prum, J. H. Kagel, and H. C. Andrews.: Pattern Recognition experiments in the Mandala/cosine domain. IEEE Trans. Pattern Anal. Machine Intell., vol. PAMI-5, no. 5, pp. 512–520, Sept, 1983.
B. C. Smith, & L. A. Rowe.: Algorithms for manipulating compressed images. IEEE Computer Graphics and Applications, vol. 13, no. 5, pp. 34–42, Sept 1993.
Shih-Fu Chang & D. G. Messerschmitt.: Manipulation and Compositing of MC-DCT Compressed Video. EEE Journal on Selected Areas in Communications, vol. 13, no. 1, pp. 1–11, Jan 1995.
B. Shen & I. K. Sethi.: Inner-Block Operations on Compressed Images. Proc. ACM Intl. Conf. Multimedia'95, pp. 490–499, Nov, 1995.
S. Soltane, N. Kerkeni & J. C. Angue.: The use of Two Dimensional Discrete Cosine Transform for an Adaptive Approach to Image Segmentation. Proc. SPIE Image and Video Processing IV, pp. 242–251, 1996.
B. Shen & I. K. Sethi.: Direct Feature Extraction from Compressed Images. Proc. SPIE Storage and Retrieval for Image and Video Database IV, vol. 2670, pp. 404–14, 1996.
Shih-Fu Chang.: Compressed Domain Techniques for Image/Video Indexing and Manipulation. IEEE Intern. Conf. on Image Processing, ICIP 95, pp. 314–317, 1995.
W. Brent Seales, C. J. Yuan & W. Hu.: Content Analysis of Compressed Video. Technical Report 265-96, University of Kentucky, 1996.
N. V. Patel, I. K. Sethi.: Compressed Video Processing for Cut Detection. IEE Proc. Visual Image Signal Process, vol. 143, no. 5, pp. 315–23, Oct 1996.
K. R. Rao & P. Yip.: Discrete Cosine Transform: Algorithm, Advantage, Applications. The Univeristy of Texas, Academic Press, 1990.
H.S.Hou, D.R.Tretter, M.J.Vogel.: Interesting Properties of the Discrete Cosine Transform. Journal of Visual Communication and Image Representation, vol. 3, no. 1, pp. 73–83, March 1992.
B.L. Yeo and B. Liu.: On the Extraction of DC Sequence from MPEG Compressed Video. IEEE Int. Conf. on Image Processing, vol. 2, pp. 260–30, Oct 1995.
Y. Ariki & Y.Saito.: Extraction of TV News Articles Based on Scene Cut Detection Using DCT clustering. Proc. Int. Conf. on Image Processing, vol. 3, pp. 847–50, 1996.
J. R. Smith.: Integrated Spatial and Feature Image Systems: Retrieval, Analysis and Compression. Ph.D Thesis, Chapter 2, Columbia University, 1997.
ftp.uu.net/graphics/jpeg/jpegsrc.v6a.tar.gz, The Independent JPEG Group's JPEG software.
www-white.media.mit.edu/vismod/imagery/VisionTexture/vistex.html, Vision Texture.
Chaur-Chin Chen.: Improved Moment Invariants for Shape Discrimination. Pattern Recognition, vol. 26, no. 5, pp. 683–86, 1993.
Stan Sclaroff.: Deformable Prototypes for Encoding Shape Categories im Image Databases. Pattern Recognition, vol. 30, nol. 4, pp. 627–41, April 1997.
P. Brodatz.: Textures: A Photographic Album for Artists and Designers. New York:Dover, 1966.
D. K. Harman.: The First Text REtrieval Conference (TREC-1). Information Processing and Management, vol. 29, no 4, pp. 411–414, 1993.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ngo, C.W., Pong, T.C., Chin, R.T. (1998). Exploiting image indexing techniques in DCT domain. In: Ip, H.H.S., Smeulders, A.W.M. (eds) Multimedia Information Analysis and Retrieval. MINAR 1998. Lecture Notes in Computer Science, vol 1464. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0016499
Download citation
DOI: https://doi.org/10.1007/BFb0016499
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64826-0
Online ISBN: 978-3-540-68537-1
eBook Packages: Springer Book Archive