Skip to main content
Log in

Discriminatively learning for representing local image features with quadruplet model

  • Published:
Optoelectronics Letters Aims and scope Submit manuscript

Abstract

Traditional hand-crafted features for representing local image patches are evolving into current data-driven and learning-based image feature, but learning a robust and discriminative descriptor which is capable of controlling various patch-level computer vision tasks is still an open problem. In this work, we propose a novel deep convolutional neural network (CNN) to learn local feature descriptors. We utilize the quadruplets with positive and negative training samples, together with a constraint to restrict the intra-class variance, to learn good discriminative CNN representations. Compared with previous works, our model reduces the overlap in feature space between corresponding and non-corresponding patch pairs, and mitigates margin varying problem caused by commonly used triplet loss. We demonstrate that our method achieves better embedding result than some latest works, like PN-Net and TN-TG, on benchmark dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. N. Molton, A. J. Davison and I, Reid, Locally Planar Patch Features for Real-Time Structure from Motion, British Machine Vision Conference, 1 (2004).

    Google Scholar 

  2. S. M. Seitz, B. Curless, J. Diebel, D. Scharstein and R. Szeliski, A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 519 (2006).

    Google Scholar 

  3. R. Szeliski, Foundations and Trends in Computer Graphics and Vision 2, 1 (2006).

    Article  Google Scholar 

  4. D. G. Lowe, International Journal of Computer Vision 60, 91 (2004).

    Article  Google Scholar 

  5. H. Bay, A. Ess, T. Tuytelaars and L. Van Gool, Computer Vision and Image Understanding 110, 346 (2008).

    Article  Google Scholar 

  6. E. Simo-Serra, E. Trulls, L. Ferraz, I, Kokkinos, P, Fna and F. Moreno-Noguer, Discriminative Learning of Deep Convolutional Feature Point Descriptors, IEEE International Conference on Computer Vision, 118 (2015).

    Google Scholar 

  7. S. Zagoruyko and N. Komodakis, Learning to Compare Image Patches via Convolutional Neural Networks, IEEE Conference on Computer Vision and Pattern Recognition, 4353 (2015).

    Google Scholar 

  8. V. Balntas, E. Johns, L. Tang and K. Mikolajczyk, PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors, arXiv: 1601.05030, (2016).

    Google Scholar 

  9. B. G. V. Kumar, G. Carneiro and I. Reid, Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by minimising global loss Functions, IEEE Conference on Computer Vision and Pattern Recognition, 5385 (2016).

    Google Scholar 

  10. Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama and Trevor Darrell, Caffe: Convolutional Architecture for Fast Feature Embedding, arXiv: 1408.5093, (2014).

    Book  Google Scholar 

  11. M. Brown, G. Hua and S. Winder, IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 43 (2011).

    Article  Google Scholar 

  12. K. Simonyan, A. Vedaldi and A. Zisserman, IEEE Transactions on Pattern Analysis and Machine Intelligence 36,1573 (2014).

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lei Zhao  (赵磊).

Additional information

This work has been supported by the Natural Science Foundation of Zhejiang Province (No.Y16F020023). This paper was presented in part at the CCF Chinese Conference on Computer Vision, Tianjin, 2017. This paper was recommended by the program committee.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Dl., Zhao, L., Xu, Dq. et al. Discriminatively learning for representing local image features with quadruplet model. Optoelectron. Lett. 13, 462–465 (2017). https://doi.org/10.1007/s11801-017-7198-z

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11801-017-7198-z

Document code

Navigation