Semantic Multimedia

Volume 4306 of the series Lecture Notes in Computer Science pp 113-123

Image Clustering Using Multimodal Keywords

  • Rajeev AgrawalAffiliated withKettering UniversityWayne State University
  • , William GroskyAffiliated withThe University of Michigan – Dearborn
  • , Farshad FotouhiAffiliated withWayne State University

* Final gross prices may vary according to local VAT.

Get Access


Extending our previous work on visual keywords, we use the concept of template-based visual keywords using MPEG-7 color descriptors. MPEG-7, also called the Multimedia Content Description Interface, has been a standard for many years. These color descriptors have the ability to characterize perceptual color similarity and need relatively low complexity operations to extract them, besides being scalable and interoperable. We then demonstrate the power of these visual keywords for image clustering, when used in tandem with textual keyword annotations, in the context of latent semantic analysis, a popular technique in classical information retrieval which has been used to reveal the underlying semantic structure of document collections.


MPEG-7 visual keywords textual keywords latent semantic analysis singular value decomposition adjusted rand index