Skip to main content

Categorizing Visual Contents by Matching Visual “Keywords”

  • Conference paper
  • First Online:
Book cover Visual Information and Information Systems (VISUAL 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1614))

Included in the following conference series:

Abstract

In this paper, we propose a three-layer visual information processing architecture for extracting concise non-textual descriptions from visual contents. These coded descriptions capture both local saliencies and spatial configurations present in visual contents via prototypical visual tokens called visual “keywords”. Categorization of images and video shots represented by keyframes can be performed by comparing their coded descriptions. We demonstrate our proposed architecture in natural scene image categorization that outperforms methods which use aggregate measures of low-level features.

Real World Computing Partnership

Kent Ridge Digital Labs

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arbib, M.A. (Ed.): The Handbook of Brain Theory and Neural Networks. The MIT Press (1995).

    Google Scholar 

  2. Bach, J.R. et al.: Virage image search engine: an open framework for image management. In Storage and Retrieval for Image and Video Databases IV, Proc. S PIE 2670 (1996) 76–87.

    Google Scholar 

  3. Bolle, R.M., Yeo, B.L., Yeung, M.M.: Video query: research directions. IBM Journal of Research and Development 42(2) (1998) 233–252.

    Article  Google Scholar 

  4. Corel (1998). http://www.corel.com.

  5. Deerwester. S. et al.: Indexing by latent semantic analysis. J. of the Am. Soc. for. Information Science, 41 (1990) 391–407.

    Article  Google Scholar 

  6. Jacobs, C.E., Finkelstein, A., Salesin, D.H.: Fast multiresolution image querying. In Proc. SIGGRAPH’95 (1995).

    Google Scholar 

  7. Larkey, L.S., Croft, W.B.: Combining classifiers in text categorization. In Proc. of SIGIR’96 (1996) 289–297.

    Google Scholar 

  8. Lewis, D.D., Ringuette, M.: A comparison of two learning algorithms for text categorization. In Proc. of SIGIR’94 (1994) 81–93.

    Google Scholar 

  9. Lipson, P., Grimson, E., Sinha, P.: Configuration based scene classification and image indexing. In Proc. of CVPR’97 (1997) 1007–1013.

    Google Scholar 

  10. Lim, J.H. (1999). Learnable Visual Keywords for Image Classification. (in preparation).

    Google Scholar 

  11. Niblack, W. et al.: The QBIC project: querying images by content using color, textures and shapes. Storage and Retrieval for Image and Video Databases, Proc. SPIE 1908 (1993) 13–25.

    Google Scholar 

  12. Papageorgiou, P.C., Oren, M., Poggio, T.: A general framework for object detection. In Proc. ICCV (1998).

    Google Scholar 

  13. Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: content-based manipulation of image databases. Intl. J. of Computer Vision, 18(3) (1995) 233–254.

    Article  Google Scholar 

  14. Ratan, A.L. Grimson, W.E.L.: Training templates for scene classification using a few examples. In Proc. IEEE Workshop on Content-Based Analysis of Images and Video Libraries (1997) 90–97.

    Google Scholar 

  15. Rowe, L.A. Boreczky, J.S., Eads, C.A.: Indices for user access to large video database. Storage and Retrieval for Image and Video Databases II. Proc. SPIE. 2185 (1994) 150–161.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lim, JH. (1999). Categorizing Visual Contents by Matching Visual “Keywords”. In: Huijsmans, D.P., Smeulders, A.W.M. (eds) Visual Information and Information Systems. VISUAL 1999. Lecture Notes in Computer Science, vol 1614. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48762-X_46

Download citation

  • DOI: https://doi.org/10.1007/3-540-48762-X_46

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66079-8

  • Online ISBN: 978-3-540-48762-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics