Skip to main content

Improved Content-Based Image Retrieval with Multivariate Gaussian Distribution

  • Conference paper
  • First Online:
Soft Computing in Data Analytics

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 758))

  • 838 Accesses

Abstract

This paper presents an overview of content-based image retrieval based on integrating image features obtained from scale-invariant feature transform (SIFT) with multivariate Gaussian distribution for the efficient retrieval of images in the image database. The state-of-the-art methods for image retrieval and object recognition use SIFT and HoG to extract image visual features. Though these descriptors are helpful in a variety of applications, they exploit zero-order statistics as they only collect histogram features, and this lacks high descriptiveness of object features and quantization problem. The novel method is having each pixel of the object which is associated with multivariate Gaussian distribution and approximated new features in the locality of the region. The key issue of this approach lies in space of the multivariate Gaussian distribution which lies in Riemannian manifold. But the linear space is suitable domain to discriminate image feature vectors efficiently. With the basis of Lie group structure and Riemannian geometry, multiplication operations are determined on the manifold to embed Gaussian space into linear space and are referred to as log-Euclidean multivariate Gaussian descriptors. These descriptors determine distinctive low- and high-dimensional image features efficiently. The experiments were conducted on Caltech-101, WANG database to validate thoroughly this approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hiremath, P.S., Pujari, J.: Content based image retrieval using color, texture and shape features. In: International Conference on Advanced Computing and Communications, 2007. ADCOM 2007. IEEE (2007)

    Google Scholar 

  2. Venu Gopal, T., Ramesh Naik, B., Prasad, V.K.: Image retrieval using adapted Fourier descriptors. Int. J. Signal Imag. Syst. Eng. 3(3), 188–194 (2010)

    Google Scholar 

  3. Lowe, D.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60, 91–110 (2004)

    Article  Google Scholar 

  4. Persoon, E., Fu, K.S.: Shape discrimination using Fourier descriptors. IEEE Trans. Syst. Man Cybern. 21(3), 170–179 (1997)

    Article  MathSciNet  Google Scholar 

  5. Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 886–893 (2005)

    Google Scholar 

  6. Arsigny, V., Fillard, P., Pennec, X., Ayache, N.: Geometric means in a novel vector space structure on symmetric positive-definite atrices. SIAM J. Matrix Anal., Appl (2006)

    MATH  Google Scholar 

  7. Lebanon, G.: Metric learning for text documents. IEEE Trans. Pattern Anal. Mach. Intell. 28(4) (2006)

    Google Scholar 

  8. Li, P., Wang, Q., Zeng, H., Zhang, L.: Local log euclidean multivariate gaussian descriptors and its application to image classification. IEEE Trans. PAMI 39(4), 803–817 (2017)

    Article  Google Scholar 

  9. Naik, B.Ramesh, Venugopal, T.: Object recognition using log euclidean multivariate gaussian descriptors. Int. J. Appl. Eng. Res. 12(14), 4130–4137 (2017)

    Google Scholar 

  10. Shi, H., Zhang, H., Li, G., Wang, X.: Stable embedding of Grassmann Manifold via Gaussian Random Matrices. IEEE Trans. Inf. Theory 61(5), 2924–2924 (2015)

    Article  MathSciNet  Google Scholar 

  11. Ke, Y., Sukthankar, R.: PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. II–506 (2004)

    Google Scholar 

  12. Grana, C., et al.: UNIMORE at ImageCLEF 2013: Scalable Concept Image Annotation. CLEF (Working Notes) (2013)

    Google Scholar 

  13. Serra, G., Grana, C., Manfredi, M., Cucchiara, R.: Modeling local descriptors with multivariate Gaussians for object and scene recognition. In: Proceedings of the ACM International Conference on Multimedia, pp. 709–712 (2013)

    Google Scholar 

  14. Lebanon, G.: Riemannian geometry and statistical machine learning. Thesis, Language Technologies Institute, 2005

    Google Scholar 

  15. Huang, Y., Wu, Z., Wang, L., Tan, T.: Feature coding in image classification: a comprehensive study. IEEE Trans. Pattern Anal. Mach. Intell. 36(3), 493–506 (2014)

    Article  Google Scholar 

  16. Hall, B.C.: Lie groups, Lie algebras, and Representations: an elementary entroduction. Graduate Texts in Mathematics, vol. 222, 2nd edn. Springer. https://doi.org/10.1007/978-3-319-13467-3. ISBN 978-3319134666 ISSN 0072-5285. 2015

  17. Skovgaard, L.T.: A Riemannian geometry of the multivariate normal model. Scand. J. Stat. 11(4), 211–223 (1984)

    MathSciNet  MATH  Google Scholar 

  18. Sánchez, J., Perronnin, F., Mensink, T., Verbeek, J.: Image classification with the Fisher vector: theory and practice. Int. J. Comput. Vis. 105(3), 222–245 (2013)

    Article  MathSciNet  Google Scholar 

  19. Calvo, M., Oller, J.M.: A distance between multivariate normal distributions based in an embedding into the siegel group. J. Multivar. Anal. 35(2), 223–242 (1990)

    Article  MathSciNet  Google Scholar 

  20. Gallier, J.: Logarithms and square roots of real matrices. CoRR (2013). arXiv:0805.0245

  21. Cimpoi, M., Maji, S., Kokkinos, I., Mohamed, S., Vedaldi, A.: Describing textures in the wild. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3606–3613 (2014)

    Google Scholar 

  22. Garnett, R., Osborne, M.A., Hennig, P.: Active learning of linear embeddings for Gaussian processes. arXiv:1310.6740 (2013)

  23. Donahue, J., Jia, Y., Vinyals, O., Hoffman, J., Zhang, N., Tzeng, E., Darrell, T.: DeCAF: a deep convolutional activation feature forgeneric visual recognition. In: Proceedings of the International Conference on Machine Learning, pp. 647–655 (2014)

    Google Scholar 

  24. van de Sande, K., Gevers, T., Snoek, C.: Evaluating color descriptors for object and scene recognition. IEEE Trans. Pattern Anal. Mach. Intell. 32(9), 1582–1596 (2010)

    Article  Google Scholar 

  25. Griffin, G., Holub, A., Perona, P.: The Caltech-256. Tech. Rep, California Institute of Technology (2007)

    Google Scholar 

  26. Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2169–2178 (2006)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to B. Ramesh Naik .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ramesh Naik, B., Venu Gopal, T. (2019). Improved Content-Based Image Retrieval with Multivariate Gaussian Distribution. In: Nayak, J., Abraham, A., Krishna, B., Chandra Sekhar, G., Das, A. (eds) Soft Computing in Data Analytics . Advances in Intelligent Systems and Computing, vol 758. Springer, Singapore. https://doi.org/10.1007/978-981-13-0514-6_40

Download citation

Publish with us

Policies and ethics