Analysis of Compact Features for RGB-D Visual Search

Petrelli, Alioscia; Pau, Danilo; Di Stefano, Luigi

doi:10.1007/978-3-319-23234-8_2

Alioscia Petrelli¹⁵,
Danilo Pau¹⁶ &
Luigi Di Stefano¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9280))

Included in the following conference series:

International Conference on Image Analysis and Processing

2 Citations

Abstract

Anticipating the oncoming integration of depth sensing into mobile devices, we experimentally compare different compact features for representing RGB-D images in mobile visual search. Experiments on 3 state-of-the-art datasets, addressing both category and instance recognition, show how Deep Features provided by Convolutional Neural Networks better represent appearance information, whereas shape is more effectively encoded through Kernel Descriptors. Moreover, our evaluation suggests that learning to weight the relative contribution of depth and appearance is key to deploy effectively depth sensing in forthcoming mobile visual search scenarios.

Download to read the full chapter text

Chapter PDF

Spatial Hierarchical Analysis Deep Neural Network for RGB-D Object Recognition

RGB-D Scene Classification via Multi-modal Feature Learning

Article 02 August 2018

Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition

Keywords

References

Bo, L., Ren, X., Fox, D.: Kernel descriptors for visual recognition. In: Advances in Neural Information Processing Systems, vol. 23, pp. 1–9 (2010)
Google Scholar
Bo, L., Ren, X., Fox, D.: Depth kernel descriptors for object recognition. In: Intelligent Robots and Systems (2011)
Google Scholar
Browatzki, B., Fischer, J.: Going into depth: Evaluating 2D and 3D cues for object classification on a new, large-scale object dataset. In: International Conference on Computer Vision Workshops (2011)
Google Scholar
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010)
Chapter Google Scholar
Chandrasekhar, V., Makar, M., Takacs, G., Chen, D., Tsai, S.S., Cheung, N.M., Grzeszczuk, R., Reznik, Y., Girod, B.: Survey of SIFT compression schemes. In: International Conference on Pattern Recognition (2010)
Google Scholar
Chandrasekhar, V., Takacs, G., Chen, D.M., Tsai, S.S., Makar, M., Girod, B.: Feature matching performance of compact descriptors for visual search. In: Data Compression Conference (2014)
Google Scholar
Chandrasekhar, V., Takacs, G., Chen, D.M., Tsai, S.S., Reznik, Y., Grzeszczuk, R., Girod, B.: Compressed Histogram of Gradients: A Low-Bitrate Descriptor. International Journal of Computer Vision (2011)
Google Scholar
Dudani, S.A.: The Distance-Weighted k-Nearest-Neighbor Rule. Transactions on Systems, Man, and Cybernetics, 325–327 (1976)
Google Scholar
Girod, B., Chandrasekhar, V., Chen, D.M., Cheung, N.M., Grzeszczuk, R., Reznik, Y., Takacs, G., Tsai, S.S., Vedantham, R.: Mobile visual search. IEEE Signal Processing Magazine, 61–76, July 2011
Google Scholar
Gupta, S., Girshick, R., Arbeláez, P., Malik, J.: Learning rich features from RGB-D images for object detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part VII. LNCS, vol. 8695, pp. 345–360. Springer, Heidelberg (2014)
Google Scholar
Heo, J.P., Lee, Y., He, J., Chang, S.F., Yoon, S.E.: Spherical hashing. In: Conference on Computer Vision and Pattern Recognition, pp. 2957–2964 (2012)
Google Scholar
Ji, R., Duan, L.Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location Discriminative Vocabulary Coding for Mobile Landmark Search. International Journal of Computer Vision, 290–314 (2011)
Google Scholar
Johnson, M.: Generalized descriptor compression for storage and matching. In: British Machine Vision Conference, pp. 23.1-23.11 (2010)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1–9 (2012)
Google Scholar
Lai, K., Bo, L., Ren, X., Fox, D.: A large-scale hierarchical multi-view rgb-d object dataset. In: International Conference on Robotics and Automation, pp. 1817–1824 (2011)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004)
Article Google Scholar
Lv, Q., Josephson, W., Wang, Z., Charikar, M., Li, K.: Multi-probe LSH: efficient indexing for high-dimensional similarity search. In: International Conference on Very Large Data Bases (2007)
Google Scholar
Malaguti, F., Tombari, F., Salti, S., Pau, D., Di Stefano, L.: Toward compressed 3D descriptors. In: International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission, pp. 176–183, October 2012
Google Scholar
Nascimento, E.R., Oliveira, G.L., Campos, M.F.M., Vieira, A.W., Schwartz, W.R.: BRAND: a robust appearance and depth descriptor for RGB-D images. In: International Conference on Intelligent Robots and Systems, pp. 1720–1726, October 2012
Google Scholar
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Singh, A., Sha, J., Narayan, K.S., Achim, T., Abbeel, P.: BigBIRD: a large-scale 3D database of object instances. In: International Conference on Robotics and Automation, pp. 509–516 (2014)
Google Scholar
Venkataraman, K., Lelescu, D., Duparr, J., McMahon, A., Molina, G., Chatterjee, P., Mullis, R.: PiCam: an ultra-thin high performance monolithic camera array. In: Siggraph Asia (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Bologna, Bologna, Italy
Alioscia Petrelli & Luigi Di Stefano
ST Microelectronics, Agrate Brianza, Italy
Danilo Pau

Authors

Alioscia Petrelli
View author publications
You can also search for this author in PubMed Google Scholar
Danilo Pau
View author publications
You can also search for this author in PubMed Google Scholar
Luigi Di Stefano
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alioscia Petrelli .

Editor information

Editors and Affiliations

Pattern Analysis and Computer Vision, Istituto Italiano di Tecnologia (IIT), Genoa, Italy
Vittorio Murino
Università di Genova, Genoa, Italy
Enrico Puppo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Petrelli, A., Pau, D., Di Stefano, L. (2015). Analysis of Compact Features for RGB-D Visual Search. In: Murino, V., Puppo, E. (eds) Image Analysis and Processing — ICIAP 2015. ICIAP 2015. Lecture Notes in Computer Science(), vol 9280. Springer, Cham. https://doi.org/10.1007/978-3-319-23234-8_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-23234-8_2
Published: 21 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23233-1
Online ISBN: 978-3-319-23234-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Analysis of Compact Features for RGB-D Visual Search

Abstract

Chapter PDF

Similar content being viewed by others

Spatial Hierarchical Analysis Deep Neural Network for RGB-D Object Recognition

RGB-D Scene Classification via Multi-modal Feature Learning

Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Analysis of Compact Features for RGB-D Visual Search

Abstract

Chapter PDF

Similar content being viewed by others

Spatial Hierarchical Analysis Deep Neural Network for RGB-D Object Recognition

RGB-D Scene Classification via Multi-modal Feature Learning

Revisiting Deep Convolutional Neural Networks for RGB-D Based Object Recognition

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation