Hierarchical Combination of Semantic Visual Words for Image Classification and Clustering

  • Vinicius von Glehn De Filippo
  • Zenilton Kleber G. do PatrocínioJr.Email author
  • Silvio Jamil F. Guimarães
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9423)


Image classification and image clustering are two important tasks related to image analysis. In this work a two-level hierarchical model for both tasks using a hierarchical combination of image descriptors is presented. The construction of a latent semantic representation for images is also presented and its impact on the results of both tasks for the two-level hierarchical model is evaluated. Experiments have shown the superior performance attained by the hierarchical combination of descriptors when compared to the simple concatenation of them or to the use of single descriptors. The hierarchical combination of a latent semantic representation has presented results similar to the other hierarchical combinations, using only a small fraction of the time and space needed by others, which is interesting specially for those with restrictions of computer power and/or storage space.


Hierarchical combination of descriptors Image classification Image clustering Semantic visual vocabulary 


  1. 1.
    Andrade, F.S.P., Almeida, J., Pedrini, H., da S.Torres, R.: Fusion of local and global descriptors for content-based image and video retrieval. In: Alvarez, L., Mejail, M., Gomez, L., Jacobo, J. (eds.) CIARP 2012. LNCS, vol. 7441, pp. 845–853. Springer, Heidelberg (2012) CrossRefGoogle Scholar
  2. 2.
    Chen, S., Shi, W., Lv, X.: Feature coding for image classification combining global saliency and local difference. Pattern Recognition Letters 51, 44–49 (2015)CrossRefGoogle Scholar
  3. 3.
    Deerwester, S.C., Dumais, S.T., Landauer, T.K., Furnas, G.W., Harshman, R.A.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990)CrossRefGoogle Scholar
  4. 4.
    Farahzadeh, E., Cham, T.J., Sluzek, A.: Scene recognition by semantic visual words. Signal, Image and Video Processing (October 2014)
  5. 5.
    Pfitzner, D., Leibbrandt, R., Powers, D.: Characterization and evaluation of similarity measures for pairs of clusterings. Knowledge and Information Systems 19(3), 361–394 (2009)CrossRefGoogle Scholar
  6. 6.
    Wang, X.Y., Zhang, B.B., Yang, H.Y.: Content-based image retrieval by integrating color and texture features. Multimedia Tools and Applications 68(3), 545–569 (2014)MathSciNetCrossRefGoogle Scholar
  7. 7.
    Yue, J., Li, Z., Liu, L., Fu, Z.: Content-based image retrieval using color and texture fused features. Mathematical and Computer Modelling 54(3–4), 1121–1127 (2011)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Vinicius von Glehn De Filippo
    • 1
  • Zenilton Kleber G. do PatrocínioJr.
    • 2
    Email author
  • Silvio Jamil F. Guimarães
    • 2
  1. 1.Instituto PolitécnicoCentro Universitário UNABelo HorizonteBrazil
  2. 2.Pontifícia Universidade Católica de Minas GeraisBelo HorizonteBrazil

Personalised recommendations