Skip to main content
Log in

Feature representation for 3D object retrieval based on unconstrained multi-view

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

Reasonable and accurate image feature representation is the key to successful object retrieval. In this paper, we propose a 3D object feature representation method based on multiple views rather than a shape model. Unlike existing view-based methods that use pre-designed camera arrays to capture views, our method is flexible to implement by using several unconstrained views. Firstly, we generate a histogram of word frequencies to represent each view through local feature quantization. Then we integrate the histogram vectors of views belonging to the same object to generate a complete feature representation. Finally, similarity between two features is calculated for object retrieval. Several criteria are employed to evaluate the retrieval quality of the proposed method. Experimental results show that the integrated model feature is more effective and efficient than a set of individual image features and our approach is also competitive among several state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Liu, Y., Zhang, D., Lu, G., et al.: A survey of content-based image retrieval with high-level semantics. Pattern Recogn. 40(1), 262–282 (2007)

    Article  Google Scholar 

  2. Gao, Y., Dai, Q.H.: View-based 3D object retrieval: challenges and approaches. IEEE Multimedia 21(3), 52–57 (2014)

    Article  Google Scholar 

  3. Ohbuchi, R., Osada, K., Furuya, T., Banno T.: Salient local visual features for shape-based 3D model retrieval. In: IEEE International Conference on Shape Modeling And Applications 2008, Proceedings, pp. 93–102 (2008)

  4. Chen, X., Li, J., Shi, Z., et al.: Distinctive local surface descriptor for three-dimensional objects based on bispectrum of spherical harmonics. J. Electron. Imaging 25(1), 013021 (2016)

    Article  Google Scholar 

  5. Tabia, H., Colot, O., Daoudi, M., et al.: Three-dimensional object retrieval based on vector quantization of invariant descriptors. J. Electron. Imaging 21(2), 023011 (2012)

    Article  Google Scholar 

  6. Wang, P.S., et al.: O-CNN: octree-based convolutional neural networks for 3D shape analysis. ACM Trans. Graphics 36(4), 72 (2017)

    Article  Google Scholar 

  7. Qi, R.C., Su, H., Niebner, M., Dai, A., Yan, M., Guibas, L.J.: Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5648–5656 (2016)

  8. Bai, S., Bai, X., Zhou, Z., Zhang, Z., Latecki, L.J.: GIFT: a real-time and scalable 3D shape search engine. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)

  9. Gao, Y., Wang, M., Ji, R.R., et al.: 3-D object retrieval with Hausdorff distance learning. IEEE Trans. Industr. Electron. 61(4), 2088–2098 (2014)

    Article  Google Scholar 

  10. Gao, Y., Dai, Q.H., Wang, M., et al.: 3D model retrieval using weighted bipartite graph matching. Signal Process.-Image Commun. 26(1), 39–47 (2011)

    Article  Google Scholar 

  11. Gao, Y., Wang, M., Tao, D.C., et al.: 3-D object retrieval and recognition with Hypergraph analysis. IEEE Trans. Image Process. 21(9), 4290–4303 (2012)

    Article  MathSciNet  Google Scholar 

  12. Wang, M., Gao, Y., Lu, K., et al.: View-based discriminative probabilistic modeling for 3D object retrieval and recognition. IEEE Trans. Image Process. 22(4), 1395–1407 (2013)

    Article  MathSciNet  Google Scholar 

  13. Zhao, S., Yao, H., Zhang, Y., et al.: View-based 3D object retrieval via multi-modal graph learning. Signal Process. 112, 110–118 (2015)

    Article  Google Scholar 

  14. Liu, A., Wang, Z.Y., Nie, W.Z., et al.: Graph-based characteristic view set extraction and matching for 3D model retrieval. Inf. Sci. 320, 429–442 (2015)

    Article  Google Scholar 

  15. Chen, D.Y., Tian, X.P., Shen, Y.T., et al.: On visual similarity based 3D model retrieval. Comput. Graph. Forum 22(3), 223–232 (2003)

    Article  Google Scholar 

  16. Daras, P., Axenopoulos, A.: A 3D shape retrieval framework supporting multimodal queries. Int. J. Comput. Vis. 89(2–3), 229–247 (2010)

    Article  Google Scholar 

  17. Ansary, T.F., Daoudi, M., Vandeborre, J.P.: A Bayesian 3-D search engine using adaptive views clustering. IEEE Trans. Multimedia 9(1), 78–88 (2007)

    Article  Google Scholar 

  18. Gao, Y., Tang, J.H., Hong, R.C., et al.: Camera constraint-free view-based 3-D object retrieval. IEEE Trans. Image Process. 21(4), 2269–2281 (2012)

    Article  MathSciNet  Google Scholar 

  19. Mahmoudi, S., Daoudi, M.: 3D models retrieval by using characteristic views. In: 16th International Conference on Pattern Recognition, Vol Ii, Proceedings, pp. 457–460 (2002)

  20. Gao, Y., Dai, Q.H., Zhang, N.Y.: 3D model comparison using spatial structure circular descriptor. Pattern Recogn. 43(3), 1142–1151 (2010)

    Article  Google Scholar 

  21. Papadakis, P., Pratikakis, I., Theoharis, T., et al.: PANORAMA: a 3D shape descriptor based on panoramic views for unsupervised 3D object retrieval. Int. J. Comput. Vis. 89(2–3), 177–192 (2010)

    Article  Google Scholar 

  22. Kim, W.Y., Kim, Y.S.: A region-based shape descriptor using Zernike moments. Signal Process.-Image Commun. 16(1–2), 95–102 (2000)

    Article  Google Scholar 

  23. Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)

    Article  Google Scholar 

  24. Gao, Z., Li, Y., Wan, S.: Exploring deep learning for view-based 3D model retrieval. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 16(1), 1–21 (2020)

    Article  Google Scholar 

  25. Gao, Z., Xue, K.X., Wan, S.H.: Multiple discrimination and pairwise CNN for view-based 3D object retrieval. Neural Netw. 125, 290–302 (2020)

    Article  Google Scholar 

  26. Gao, Z., et al.: Adaptive fusion and category-level dictionary learning model for multiview human action recognition. IEEE Internet Things J. 6(6), 9280–9293 (2019)

    Article  Google Scholar 

  27. Li, F., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: Proceeding of IEEE Computer Vision and Pattern Recognition. pp. 524–531 (2005)

  28. Passalis, N., Tefas, A.: Entropy optimized feature-based bag-of-words representation for information retrieval[J]. IEEE Trans. Knowl. Data Eng. 28(7), 1664–1677 (2016)

    Article  Google Scholar 

  29. Ergun, H., Sert, M.: Efficient bag of words based concept extraction for visual object retrieval. Springer International Publishing (2016)

  30. Lavoue, G.: Combination of bag-of-words descriptors for robust partial shape retrieval[J]. Vis. Comput. 28(9), 931–942 (2012)

    Article  Google Scholar 

  31. Toldo, R., Castellani, U., Fusiello, A.: The bag of words approach for retrieval and categorization of 3D objects. Vis. Comput. 26(10), 1257–1268 (2010)

    Article  Google Scholar 

  32. Sedmidubsky. J., Budikova, P., Dohnal, V., Zezula, P.: Motion words: a text-like representation of 3D skeleton sequences. In: 42nd European Conference on Information Retrieval (ECIR) (2020)

  33. Budikova, P., et al.: Efficient Indexing of 3D Human Motions. In: ACM International Conference on Multimedia Retrieval (ICMR), pp. 10–18 (2021)

  34. Duda, O., Hart, P.E., Stork, D.G.: Pattern Classification. John Wiley & Sons, Hoboken (2012)

    MATH  Google Scholar 

  35. Van Gemert, J.C., et al.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2009)

    Article  Google Scholar 

  36. Leibe, B., Schiele, B.: Analyzing appearance and contour based methods for object categorization. In: 2003 IEEE Computer Society Conference on Computer Vision And Pattern Recognition, Vol Ii, Proceedings, pp. 409–415 (2003)

Download references

Acknowledgements

The authors thank the editor and anonymous reviewers for their helpful comments and valuable suggestions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xuanyin Wang.

Additional information

Communicated by K. Schoeffmann.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhou, B., Wang, X. Feature representation for 3D object retrieval based on unconstrained multi-view. Multimedia Systems 28, 1699–1711 (2022). https://doi.org/10.1007/s00530-022-00939-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-022-00939-1

Keywords

Navigation