Abstract
In this paper, we propose a three-layer visual information processing architecture for extracting concise non-textual descriptions from visual contents. These coded descriptions capture both local saliencies and spatial configurations present in visual contents via prototypical visual tokens called visual “keywords”. Categorization of images and video shots represented by keyframes can be performed by comparing their coded descriptions. We demonstrate our proposed architecture in natural scene image categorization that outperforms methods which use aggregate measures of low-level features.
Real World Computing Partnership
Kent Ridge Digital Labs
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arbib, M.A. (Ed.): The Handbook of Brain Theory and Neural Networks. The MIT Press (1995).
Bach, J.R. et al.: Virage image search engine: an open framework for image management. In Storage and Retrieval for Image and Video Databases IV, Proc. S PIE 2670 (1996) 76–87.
Bolle, R.M., Yeo, B.L., Yeung, M.M.: Video query: research directions. IBM Journal of Research and Development 42(2) (1998) 233–252.
Corel (1998). http://www.corel.com.
Deerwester. S. et al.: Indexing by latent semantic analysis. J. of the Am. Soc. for. Information Science, 41 (1990) 391–407.
Jacobs, C.E., Finkelstein, A., Salesin, D.H.: Fast multiresolution image querying. In Proc. SIGGRAPH’95 (1995).
Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In Proc. of SIGIR’96 (1996) 289–297.
Lewis, D.D., Ringuette, M.: A comparison of two learning algorithms for text categorization. In Proc. of SIGIR’94 (1994) 81–93.
Lipson, P., Grimson, E., Sinha, P.: Configuration based scene classification and image indexing. In Proc. of CVPR’97 (1997) 1007–1013.
Lim, J.H. (1999). Learnable Visual Keywords for Image Classification. (in preparation).
Niblack, W. et al.: The QBIC project: querying images by content using color, textures and shapes. Storage and Retrieval for Image and Video Databases, Proc. SPIE 1908 (1993) 13–25.
Papageorgiou, P.C., Oren, M., Poggio, T.: A general framework for object detection. In Proc. ICCV (1998).
Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: content-based manipulation of image databases. Intl. J. of Computer Vision, 18(3) (1995) 233–254.
Ratan, A.L. Grimson, W.E.L.: Training templates for scene classification using a few examples. In Proc. IEEE Workshop on Content-Based Analysis of Images and Video Libraries (1997) 90–97.
Rowe, L.A. Boreczky, J.S., Eads, C.A.: Indices for user access to large video database. Storage and Retrieval for Image and Video Databases II. Proc. SPIE. 2185 (1994) 150–161.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lim, JH. (1999). Categorizing Visual Contents by Matching Visual “Keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds) Visual Information and Information Systems. VISUAL 1999. Lecture Notes in Computer Science, vol 1614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48762-X_46
Download citation
DOI: https://doi.org/10.1007/3-540-48762-X_46
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66079-8
Online ISBN: 978-3-540-48762-3
eBook Packages: Springer Book Archive