Feature representation for 3D object retrieval based on unconstrained multi-view

Zhou, Bin; Wang, Xuanyin

doi:10.1007/s00530-022-00939-1

Feature representation for 3D object retrieval based on unconstrained multi-view

Regular Paper
Published: 04 May 2022

Volume 28, pages 1699–1711, (2022)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Bin Zhou¹ &
Xuanyin Wang¹

196 Accesses
Explore all metrics

Abstract

Reasonable and accurate image feature representation is the key to successful object retrieval. In this paper, we propose a 3D object feature representation method based on multiple views rather than a shape model. Unlike existing view-based methods that use pre-designed camera arrays to capture views, our method is flexible to implement by using several unconstrained views. Firstly, we generate a histogram of word frequencies to represent each view through local feature quantization. Then we integrate the histogram vectors of views belonging to the same object to generate a complete feature representation. Finally, similarity between two features is calculated for object retrieval. Several criteria are employed to evaluate the retrieval quality of the proposed method. Experimental results show that the integrated model feature is more effective and efficient than a set of individual image features and our approach is also competitive among several state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-view and multivariate gaussian descriptor for 3D object retrieval

Article 11 October 2017

Visual information quantification for object recognition and retrieval

Article 27 October 2021

Cross-View Feature Hashing for Image Retrieval

References

Liu, Y., Zhang, D., Lu, G., et al.: A survey of content-based image retrieval with high-level semantics. Pattern Recogn. 40(1), 262–282 (2007)
Article Google Scholar
Gao, Y., Dai, Q.H.: View-based 3D object retrieval: challenges and approaches. IEEE Multimedia 21(3), 52–57 (2014)
Article Google Scholar
Ohbuchi, R., Osada, K., Furuya, T., Banno T.: Salient local visual features for shape-based 3D model retrieval. In: IEEE International Conference on Shape Modeling And Applications 2008, Proceedings, pp. 93–102 (2008)
Chen, X., Li, J., Shi, Z., et al.: Distinctive local surface descriptor for three-dimensional objects based on bispectrum of spherical harmonics. J. Electron. Imaging 25(1), 013021 (2016)
Article Google Scholar
Tabia, H., Colot, O., Daoudi, M., et al.: Three-dimensional object retrieval based on vector quantization of invariant descriptors. J. Electron. Imaging 21(2), 023011 (2012)
Article Google Scholar
Wang, P.S., et al.: O-CNN: octree-based convolutional neural networks for 3D shape analysis. ACM Trans. Graphics 36(4), 72 (2017)
Article Google Scholar
Qi, R.C., Su, H., Niebner, M., Dai, A., Yan, M., Guibas, L.J.: Volumetric and multi-view CNNs for object classification on 3D data. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5648–5656 (2016)
Bai, S., Bai, X., Zhou, Z., Zhang, Z., Latecki, L.J.: GIFT: a real-time and scalable 3D shape search engine. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016)
Gao, Y., Wang, M., Ji, R.R., et al.: 3-D object retrieval with Hausdorff distance learning. IEEE Trans. Industr. Electron. 61(4), 2088–2098 (2014)
Article Google Scholar
Gao, Y., Dai, Q.H., Wang, M., et al.: 3D model retrieval using weighted bipartite graph matching. Signal Process.-Image Commun. 26(1), 39–47 (2011)
Article Google Scholar
Gao, Y., Wang, M., Tao, D.C., et al.: 3-D object retrieval and recognition with Hypergraph analysis. IEEE Trans. Image Process. 21(9), 4290–4303 (2012)
Article MathSciNet Google Scholar
Wang, M., Gao, Y., Lu, K., et al.: View-based discriminative probabilistic modeling for 3D object retrieval and recognition. IEEE Trans. Image Process. 22(4), 1395–1407 (2013)
Article MathSciNet Google Scholar
Zhao, S., Yao, H., Zhang, Y., et al.: View-based 3D object retrieval via multi-modal graph learning. Signal Process. 112, 110–118 (2015)
Article Google Scholar
Liu, A., Wang, Z.Y., Nie, W.Z., et al.: Graph-based characteristic view set extraction and matching for 3D model retrieval. Inf. Sci. 320, 429–442 (2015)
Article Google Scholar
Chen, D.Y., Tian, X.P., Shen, Y.T., et al.: On visual similarity based 3D model retrieval. Comput. Graph. Forum 22(3), 223–232 (2003)
Article Google Scholar
Daras, P., Axenopoulos, A.: A 3D shape retrieval framework supporting multimodal queries. Int. J. Comput. Vis. 89(2–3), 229–247 (2010)
Article Google Scholar
Ansary, T.F., Daoudi, M., Vandeborre, J.P.: A Bayesian 3-D search engine using adaptive views clustering. IEEE Trans. Multimedia 9(1), 78–88 (2007)
Article Google Scholar
Gao, Y., Tang, J.H., Hong, R.C., et al.: Camera constraint-free view-based 3-D object retrieval. IEEE Trans. Image Process. 21(4), 2269–2281 (2012)
Article MathSciNet Google Scholar
Mahmoudi, S., Daoudi, M.: 3D models retrieval by using characteristic views. In: 16th International Conference on Pattern Recognition, Vol Ii, Proceedings, pp. 457–460 (2002)
Gao, Y., Dai, Q.H., Zhang, N.Y.: 3D model comparison using spatial structure circular descriptor. Pattern Recogn. 43(3), 1142–1151 (2010)
Article Google Scholar
Papadakis, P., Pratikakis, I., Theoharis, T., et al.: PANORAMA: a 3D shape descriptor based on panoramic views for unsupervised 3D object retrieval. Int. J. Comput. Vis. 89(2–3), 177–192 (2010)
Article Google Scholar
Kim, W.Y., Kim, Y.S.: A region-based shape descriptor using Zernike moments. Signal Process.-Image Commun. 16(1–2), 95–102 (2000)
Article Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
Gao, Z., Li, Y., Wan, S.: Exploring deep learning for view-based 3D model retrieval. ACM Trans. Multimed. Comput. Commun. Appl. (TOMM) 16(1), 1–21 (2020)
Article Google Scholar
Gao, Z., Xue, K.X., Wan, S.H.: Multiple discrimination and pairwise CNN for view-based 3D object retrieval. Neural Netw. 125, 290–302 (2020)
Article Google Scholar
Gao, Z., et al.: Adaptive fusion and category-level dictionary learning model for multiview human action recognition. IEEE Internet Things J. 6(6), 9280–9293 (2019)
Article Google Scholar
Li, F., Perona, P.: A Bayesian hierarchical model for learning natural scene categories. In: Proceeding of IEEE Computer Vision and Pattern Recognition. pp. 524–531 (2005)
Passalis, N., Tefas, A.: Entropy optimized feature-based bag-of-words representation for information retrieval[J]. IEEE Trans. Knowl. Data Eng. 28(7), 1664–1677 (2016)
Article Google Scholar
Ergun, H., Sert, M.: Efficient bag of words based concept extraction for visual object retrieval. Springer International Publishing (2016)
Lavoue, G.: Combination of bag-of-words descriptors for robust partial shape retrieval[J]. Vis. Comput. 28(9), 931–942 (2012)
Article Google Scholar
Toldo, R., Castellani, U., Fusiello, A.: The bag of words approach for retrieval and categorization of 3D objects. Vis. Comput. 26(10), 1257–1268 (2010)
Article Google Scholar
Sedmidubsky. J., Budikova, P., Dohnal, V., Zezula, P.: Motion words: a text-like representation of 3D skeleton sequences. In: 42nd European Conference on Information Retrieval (ECIR) (2020)
Budikova, P., et al.: Efficient Indexing of 3D Human Motions. In: ACM International Conference on Multimedia Retrieval (ICMR), pp. 10–18 (2021)
Duda, O., Hart, P.E., Stork, D.G.: Pattern Classification. John Wiley & Sons, Hoboken (2012)
MATH Google Scholar
Van Gemert, J.C., et al.: Visual word ambiguity. IEEE Trans. Pattern Anal. Mach. Intell. 32(7), 1271–1283 (2009)
Article Google Scholar
Leibe, B., Schiele, B.: Analyzing appearance and contour based methods for object categorization. In: 2003 IEEE Computer Society Conference on Computer Vision And Pattern Recognition, Vol Ii, Proceedings, pp. 409–415 (2003)

Download references

Acknowledgements

The authors thank the editor and anonymous reviewers for their helpful comments and valuable suggestions.

Author information

Authors and Affiliations

State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, No. 38 Zheda Road, Hangzhou, 310027, Zhejiang, People’s Republic of China
Bin Zhou & Xuanyin Wang

Authors

Bin Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Xuanyin Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xuanyin Wang.

Additional information

Communicated by K. Schoeffmann.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, B., Wang, X. Feature representation for 3D object retrieval based on unconstrained multi-view. Multimedia Systems 28, 1699–1711 (2022). https://doi.org/10.1007/s00530-022-00939-1

Download citation

Received: 16 May 2021
Accepted: 07 April 2022
Published: 04 May 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s00530-022-00939-1

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Feature representation for 3D object retrieval based on unconstrained multi-view

Abstract

Access this article

Similar content being viewed by others

Multi-view and multivariate gaussian descriptor for 3D object retrieval

Visual information quantification for object recognition and retrieval

Cross-View Feature Hashing for Image Retrieval

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Feature representation for 3D object retrieval based on unconstrained multi-view

Abstract

Access this article

Similar content being viewed by others

Multi-view and multivariate gaussian descriptor for 3D object retrieval

Visual information quantification for object recognition and retrieval

Cross-View Feature Hashing for Image Retrieval

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation