Categorizing Visual Contents by Matching Visual “Keywords”

Lim, Joo-Hwee

doi:10.1007/3-540-48762-X_46

Joo-Hwee Lim⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1614))

Included in the following conference series:

International Conference on Advances in Visual Information Systems

696 Accesses
4 Citations

Abstract

In this paper, we propose a three-layer visual information processing architecture for extracting concise non-textual descriptions from visual contents. These coded descriptions capture both local saliencies and spatial configurations present in visual contents via prototypical visual tokens called visual “keywords”. Categorization of images and video shots represented by keyframes can be performed by comparing their coded descriptions. We demonstrate our proposed architecture in natural scene image categorization that outperforms methods which use aggregate measures of low-level features.

Real World Computing Partnership

Kent Ridge Digital Labs

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arbib, M.A. (Ed.): The Handbook of Brain Theory and Neural Networks. The MIT Press (1995).
Google Scholar
Bach, J.R. et al.: Virage image search engine: an open framework for image management. In Storage and Retrieval for Image and Video Databases IV, Proc. S PIE 2670 (1996) 76–87.
Google Scholar
Bolle, R.M., Yeo, B.L., Yeung, M.M.: Video query: research directions. IBM Journal of Research and Development 42(2) (1998) 233–252.
Article Google Scholar
Corel (1998). http://www.corel.com.
Deerwester. S. et al.: Indexing by latent semantic analysis. J. of the Am. Soc. for. Information Science, 41 (1990) 391–407.
Article Google Scholar
Jacobs, C.E., Finkelstein, A., Salesin, D.H.: Fast multiresolution image querying. In Proc. SIGGRAPH’95 (1995).
Google Scholar
Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In Proc. of SIGIR’96 (1996) 289–297.
Google Scholar
Lewis, D.D., Ringuette, M.: A comparison of two learning algorithms for text categorization. In Proc. of SIGIR’94 (1994) 81–93.
Google Scholar
Lipson, P., Grimson, E., Sinha, P.: Configuration based scene classification and image indexing. In Proc. of CVPR’97 (1997) 1007–1013.
Google Scholar
Lim, J.H. (1999). Learnable Visual Keywords for Image Classification. (in preparation).
Google Scholar
Niblack, W. et al.: The QBIC project: querying images by content using color, textures and shapes. Storage and Retrieval for Image and Video Databases, Proc. SPIE 1908 (1993) 13–25.
Google Scholar
Papageorgiou, P.C., Oren, M., Poggio, T.: A general framework for object detection. In Proc. ICCV (1998).
Google Scholar
Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: content-based manipulation of image databases. Intl. J. of Computer Vision, 18(3) (1995) 233–254.
Article Google Scholar
Ratan, A.L. Grimson, W.E.L.: Training templates for scene classification using a few examples. In Proc. IEEE Workshop on Content-Based Analysis of Images and Video Libraries (1997) 90–97.
Google Scholar
Rowe, L.A. Boreczky, J.S., Eads, C.A.: Indices for user access to large video database. Storage and Retrieval for Image and Video Databases II. Proc. SPIE. 2185 (1994) 150–161.
Google Scholar

Download references

Author information

Authors and Affiliations

Information-Base Functions KRDL Lab, RWCP, 21 Heng Mui Kent Terrace, S(119613), Singapore
Joo-Hwee Lim

Authors

Joo-Hwee Lim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, Leiden University, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
Dionysius P. Huijsmans
Research Institute Computer Science, University of Amsterdam, Kruislaan 403, 1098 SJ, Amsterdam, The Netherlands
Arnold W. M. Smeulders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lim, JH. (1999). Categorizing Visual Contents by Matching Visual “Keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds) Visual Information and Information Systems. VISUAL 1999. Lecture Notes in Computer Science, vol 1614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48762-X_46

Download citation

DOI: https://doi.org/10.1007/3-540-48762-X_46
Published: 02 July 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66079-8
Online ISBN: 978-3-540-48762-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics