Advertisement

Artificial Intelligence Review

, Volume 52, Issue 1, pp 323–355 | Cite as

A comparative study of hash based approximate nearest neighbor learning and its application in image retrieval

  • P. ArulmozhiEmail author
  • S. Abirami
Article
  • 410 Downloads

Abstract

Plenty of data are available due to the growth of digital technology that creates a high expectation in retrieving the relevant images, accurately and efficiently for a given query image. For searching the relevant images efficiently for the Large Scale dataset, the searching algorithm should have fast access capability. The existing Exact Nearest Neighbor search performs in linear time and so it takes more time as both the dataset and data dimension increases. As a remedy to provide sub-linear/logarithmic time complexity, usage of Approximate Nearest Neighbor (ANN) algorithms is emerging at a rapid rate. This paper discusses about the importance of ANN and their general classification; the different categories involved in Learning to Hash has been analyzed with their pros and cons; different bit assignment types and methods to minimize the Quantization Errors have been reviewed along with its merits and demerits. Therefore, it serves to increase the efficiency of the Image Retrieval process in Large Scale.

Keywords

Approximate nearest neighbor Hash based ANN Learning to hash Deep hashing 

Notes

Acknowledgements

This work is supported by the Department of Science and Technology- 7.1.3.69 PURSE phase II programme, Sanction Proceedings No: 9500/PD2/2014.

References

  1. Aly M, Munich M, Perona P (2011) Distributed kd-trees for retrieval from very large image collections. In: Proceedings of the British machine vision conference (BMVC)Google Scholar
  2. Basri R, Hassner T, Zelnik-Manor L (2011) Approximate nearest subspace search. IEEE Trans Pattern Anal Mach Intell 33(2):266–278CrossRefGoogle Scholar
  3. Bellet A, Habrard A, Sebban M (2013) A survey on metric learning for feature vectors and structured data. arXiv preprint arXiv:1306.6709
  4. Boato G, Dang-Nguyen DT, Muratov O, Alajlan N, Natale FG (2016) Exploiting visual saliency for increasing diversity of image retrieval results. Multimed Tools Appl 75(10):5581–5602CrossRefGoogle Scholar
  5. Calonder M, Lepetit V, Strecha C, Fua P (2010) Brief: binary robust independent elementary features. In: European conference on computer vision. Springer, pp 778–792Google Scholar
  6. Cao Z, Long M, Wang J, Yu PS (2017) Hashnet: deep learning to hash by continuation. arXiv preprint arXiv:1702.00758
  7. Cha SH (2007) Comprehensive survey on distance/similarity measures between probability density functions. City 1(2):1MathSciNetGoogle Scholar
  8. Chandra B, Sharma RK (2016) Fast learning in deep neural networks. Neurocomputing 171:1205–1215CrossRefGoogle Scholar
  9. Chatfield K, Lempitsky VS, Vedaldi A, Zisserman A (2011) The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC, vol 2, p 8Google Scholar
  10. Chum O, Perd’och M, Matas J (2009) Geometric min-hashing: finding a (thick) needle in a haystack. In: IEEE conference on computer vision and pattern recognition, CVPR 2009, IEEE, pp 17–24Google Scholar
  11. Dai L, Sun X, Wu F, Yu N (2013) Large scale image retrieval with visual groups. In: 2013 IEEE international conference on image processing, IEEE, pp 2582–2586Google Scholar
  12. Dai Q, Li J, Wang J, Chen Y, Jiang YG (2016) A bayesian hashing approach and its application to face recognition. Neurocomputing 213:5–13CrossRefGoogle Scholar
  13. Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on computational geometry, ACM, pp 253–262Google Scholar
  14. Deng C, Deng H, Liu X, Yuan Y (2015) Adaptive multi-bit quantization for hashing. Neurocomputing 151:319–326CrossRefGoogle Scholar
  15. Ding G, Zhou J, Guo Y, Lin Z, Zhao S, Han J (2017) Large-scale image retrieval with sparse embedded hashing. Neurocomputing 257:24–36Google Scholar
  16. Erin Liong V, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2475–2483Google Scholar
  17. Feng D, Yang J, Liu C (2013) An efficient indexing method for content-based image retrieval. Neurocomputing 106:103–114CrossRefGoogle Scholar
  18. Goh H, Thome N, Cord M, Lim JH (2014) Learning deep hierarchical visual feature coding. IEEE Trans Neural Netw Learn Syst 25(12):2212–2225CrossRefGoogle Scholar
  19. Gong Y, Kumar S, Verma V, Lazebnik S (2012) Angular quantization-based binary codes for fast similarity search. In: Advances in neural information processing systems, pp 1196–1204Google Scholar
  20. Gong Y, Kumar S, Rowley HA, Lazebnik S (2013a) Learning binary codes for high-dimensional data using bilinear projections. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 484–491Google Scholar
  21. Gong Y, Lazebnik S, Gordo A, Perronnin F (2013b) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intell 35(12):2916–2929CrossRefGoogle Scholar
  22. Grauman K (2010) Efficiently searching for similar images. Commun ACM 53(6):84–94CrossRefGoogle Scholar
  23. Grauman K, Fergus R (2013) Learning binary hash codes for large-scale image search. In: Cipolla R, Battiato S, Farinella GM (eds) Machine learning for computer vision. Springer, pp 49–87Google Scholar
  24. Guo QZ, Zeng Z, Zhang S (2015) Adaptive bit allocation hashing for approximate nearest neighbor search. Neurocomputing 151:719–728CrossRefGoogle Scholar
  25. Hadjieleftheriou M, Manolopoulos Y, Theodoridis Y, Tsotras VJ (2008) R-trees—a dynamic index structure for spatial searching. In: Shekhar S, Xiong H (eds) Encyclopedia of GIS. Springer, pp 993–1002Google Scholar
  26. He J, Feng J, Liu X, Cheng T, Lin TH, Chung H, Chang SF (2012) Mobile product search with bag of hash bits and boundary reranking. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3005–3012Google Scholar
  27. He K, Wen F, Sun J (2013) K-means hashing: an affinity-preserving quantization method for learning binary compact codes. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2938–2945Google Scholar
  28. Heo JP, Lee Y, He J, Chang SF, Yoon SE (2012) Spherical hashing. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 2957–2964Google Scholar
  29. Hou G, Cui R, Pan Z, Zhang C (2015) Tree-based compact hashing for approximate nearest neighbor search. Neurocomputing 166:271–281CrossRefGoogle Scholar
  30. Huang Y, Wu Z, Wang L, Tan T (2014) Feature coding in image classification: a comprehensive study. IEEE Trans Pattern Anal Mach Intell 36(3):493–506CrossRefGoogle Scholar
  31. Jiang YG, Wang J, Chang SF (2011) Lost in binarization: query-adaptive ranking for similar image search with compact codes. In: Proceedings of the 1st ACM international conference on multimedia retrieval, ACM, p 16Google Scholar
  32. Jiang YG, Wang J, Xue X, Chang SF (2013) Query-adaptive image search with hash codes. IEEE Trans Multimed 15(2):442–453CrossRefGoogle Scholar
  33. Johnson J, Krishna R, Stark M, Li LJ, Shamma D, Bernstein M, Fei-Fei L (2015) Image retrieval using scene graphs. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3668–3678Google Scholar
  34. Ke Y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of the 2004 IEEE computer society conference on computer vision and pattern recognition, CVPR 2004, IEEE, vol 2, pp II–506Google Scholar
  35. Kim S, Choi S (2011) Semi-supervised discriminant hashing. In: 2011 IEEE 11th international conference on data mining (ICDM), IEEE, pp 1122–1127Google Scholar
  36. Kim S, Choi S (2015) Bilinear random projections for locality-sensitive binary codes. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1338–1346Google Scholar
  37. Kim IJ, Xie X (2015) Handwritten Hangul recognition using deep convolutional neural networks. Int J Doc Anal Recognit (IJDAR) 18(1):1–13CrossRefGoogle Scholar
  38. Kong W, Li WJ (2012a) Double-bit quantization for hashing. In: AAAI, vol 1, p 5Google Scholar
  39. Kong W, Li WJ (2012b) Isotropic hashing. In: Advances in neural information processing systems, pp 1646–1654Google Scholar
  40. Kong W, Li WJ, Guo M (2012) Manhattan hashing for large-scale image retrieval. In: Proceedings of the 35th international ACM SIGIR conference on research and development in information retrieval, ACM, pp 45–54Google Scholar
  41. Kulis B, Darrell T (2009) Learning to hash with binary reconstructive embeddings. In: Advances in neural information processing systems, pp 1042–1050Google Scholar
  42. Kulis B, Grauman K (2012) Kernelized locality-sensitive hashing. IEEE Trans Pattern Anal Mach Intell 34(6):1092–1104CrossRefGoogle Scholar
  43. Kumar N, Zhang L, Nayar S (2008) What is a good nearest neighbors algorithm for finding similar patches in images? In: European conference on computer vision, Springer, pp 364–378Google Scholar
  44. Kuo CH, Chou YH, Chang PC (2016) Using deep convolutional neural networks for image retrieval. Electron Imaging 2:1–6CrossRefGoogle Scholar
  45. Lai H, Yan P, Shu X, Wei Y, Yan S (2016) Instance-aware hashing for multi-label image retrieval. IEEE Trans Image Process 25(6):2469–2479MathSciNetzbMATHCrossRefGoogle Scholar
  46. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444CrossRefGoogle Scholar
  47. Lee Y, Heo JP, Yoon SE (2014) Quadra-embedding: binary code embedding with low quantization error. Comput Vis Image Underst 125:214–222CrossRefGoogle Scholar
  48. Leutenegger S, Chli M, Siegwart RY (2011) Brisk: binary robust invariant scalable keypoints. In: 2011 International conference on computer vision, IEEE, pp 2548–2555Google Scholar
  49. Li P, Cheng J, Lu H (2013) Hashing with dual complementary projection learning for fast image retrieval. Neurocomputing 120:83–89CrossRefGoogle Scholar
  50. Lin G, Shen C, Suter D, van den Hengel A (2013a) A general two-step approach to learning-based hashing. In: Proceedings of the IEEE international conference on computer vision, pp 2552–2559Google Scholar
  51. Lin Y, Jin R, Cai D, Yan S, Li X (2013b) Compressed hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 446–451Google Scholar
  52. Liu W, Wang J, Kumar S, Chang SF (2011a) Hashing with graphs. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 1–8Google Scholar
  53. Liu Y, Zhou S, Chen Q (2011b) Discriminative deep belief networks for visual data classification. Pattern Recognit 44(10):2287–2296zbMATHCrossRefGoogle Scholar
  54. Liu W, Wang J, Ji R, Jiang YG, Chang SF (2012a) Supervised hashing with kernels. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 2074–2081Google Scholar
  55. Liu W, Wang J, Mu Y, Kumar S, Chang SF (2012b) Compact hyperplane hashing with bilinear functions. arXiv preprint arXiv:1206.4618
  56. Liu X, Lang B, Xu Y, Cheng B (2012c) Feature grouping and local soft match for mobile visual search. Pattern Recognit Lett 33(3):239–246CrossRefGoogle Scholar
  57. Liu X, He J, Lang B, Chang SF (2013) Hash bit selection: a unified solution for selection problems in hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1570–1577Google Scholar
  58. Liu W, Mu C, Kumar S, Chang SF (2014a) Discrete graph hashing. In: Advances in neural information processing systems, pp 3419–3427Google Scholar
  59. Liu X, He J, Lang B (2014b) Multiple feature kernel hashing for large-scale visual search. Pattern Recognit 47(2):748–757zbMATHCrossRefGoogle Scholar
  60. Liu X, Mu Y, Lang B, Chang SF (2014c) Mixed image-keyword query adaptive hashing over multilabel images. ACM Trans Multimed Comput Commun Appl (TOMM) 10(2):22Google Scholar
  61. Liu X, Mu Y, Zhang D, Lang B, Li X (2015) Large-scale unsupervised hashing with shared structure learning. IEEE Trans Cybern 45(9):1811–1822CrossRefGoogle Scholar
  62. Liu H, Wang R, Shan S, Chen X (2016a) Deep supervised hashing for fast image retrieval. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2064–2072Google Scholar
  63. Liu X, Du B, Deng C, Liu M, Lang B (2016b) Structure sensitive hashing with adaptive product quantization. IEEE Trans Cybern 46(10):2252–2264CrossRefGoogle Scholar
  64. Liu X, Fan X, Deng C, Li Z, Su H, Tao D (2016c) Multilinear hyperplane hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5119–5127Google Scholar
  65. Liu X, Huang L, Deng C, Lang B, Tao D (2016d) Query-adaptive hash code ranking for large-scale multi-view visual search. IEEE Trans Image Process 25(10):4514–4524MathSciNetzbMATHCrossRefGoogle Scholar
  66. Liu L, Yu M, Shao L (2017a) Learning short binary codes for large-scale image retrieval. IEEE Trans Image Process 26(3):1289–1299MathSciNetzbMATHCrossRefGoogle Scholar
  67. Liu W, Ma H, Qi H, Zhao D, Chen Z (2017b) Deep learning hashing for mobile visual search. EURASIP J Image Video Process 1:17CrossRefGoogle Scholar
  68. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRefGoogle Scholar
  69. Moran S, Lavrenko V, Osborne M (2013a) Neighbourhood preserving quantisation for LSH. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, ACM, pp 1009–1012Google Scholar
  70. Moran S, Lavrenko V, Osborne M (2013b) Variable bit quantisation for LSH. In: ACL (2), pp 753–758Google Scholar
  71. Mu Y, Liu Z (2017) Deep hashing: a joint approach for image signature learning. In: AAAI, pp 2380–2386Google Scholar
  72. Nister D, Stewenius H (2006) Scalable recognition with a vocabulary tree. In: 2006 IEEE computer society conference on computer vision and pattern recognition, IEEE, vol 2, pp 2161–2168Google Scholar
  73. Norouzi M, Blei DM (2011) Minimal loss hashing for compact binary codes. In: Proceedings of the 28th international conference on machine learning (ICML-11), pp 353–360Google Scholar
  74. Norouzi M, Punjani A, Fleet DJ (2012) Fast search in hamming space with multi-index hashing. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3108–3115Google Scholar
  75. Raginsky M, Lazebnik S (2009) Locality-sensitive binary codes from shift-invariant kernels. In: Advances in neural information processing systems, pp 1509–1517Google Scholar
  76. Rublee E, Rabaud V, Konolige K, Bradski G (2011) ORB: an efficient alternative to sift or surf. In: 2011 International conference on computer vision, IEEE, pp 2564–2571Google Scholar
  77. Salakhutdinov R, Hinton GE (2009) Deep boltzmann machines. In: AISTATS, vol 1, p 3Google Scholar
  78. Samet H (1984) The quadtree and related hierarchical data structures. ACM Comput Surv (CSUR) 16(2):187–260MathSciNetCrossRefGoogle Scholar
  79. Shen F, Shen C, Liu W, Tao Shen H (2015) Supervised discrete hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 37–45Google Scholar
  80. Shi M, Xu R, Tao D, Xu C (2013) W-tree indexing for fast visual word generation. IEEE Trans Image Process 22(3):1209–1222MathSciNetzbMATHCrossRefGoogle Scholar
  81. Song J, Gao L, Zou F, Yan Y, Sebe N (2016) Deep and fast: deep learning hashing with semi-supervised graph construction. Image Vis Comput 55:101–108Google Scholar
  82. Tang J, Li Z, Wang M, Zhao R (2015) Neighborhood discriminant hashing for large-scale image retrieval. IEEE Trans Image Process 24(9):2827–2840MathSciNetzbMATHCrossRefGoogle Scholar
  83. Wang J, Kumar S, Chang SF (2010) Semi-supervised hashing for scalable image retrieval. In: 2010 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 3424–3431Google Scholar
  84. Wang X, Yang M, Cour T, Zhu S, Yu K, Han TX (2011) Contextual weighting for vocabulary tree based image retrieval. In: 2011 IEEE international conference on computer vision (ICCV), IEEE, pp 209–216Google Scholar
  85. Wang J, Kumar S, Chang SF (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intell 34(12):2393–2406CrossRefGoogle Scholar
  86. Wang J, Wang J, Yu N, Li S (2013) Order preserving hashing for approximate nearest neighbor search. In: Proceedings of the 21st ACM international conference on multimedia, ACM, pp 133–142Google Scholar
  87. Wang X, Qiu S, Liu K, Tang X (2014) Web image re-ranking using query-specific semantic signatures. IEEE Trans Pattern Anal Mach Intell 36(4):810–823CrossRefGoogle Scholar
  88. Wang Z, Duan LY, Lin J, Wang X, Huang T, Gao W (2015) Hamming compatible quantization for hashing. In: Proceedings of the twenty-fourth international joint conference on artificial intelligence, IJCAI, vol 15Google Scholar
  89. Wang J, Liu W, Kumar S, Chang SF (2016a) Learning to hash for indexing big data—a survey. Proc IEEE 104(1):34–57CrossRefGoogle Scholar
  90. Wang J, Xu XS, Guo S, Cui L, Wang XL (2016b) Linear unsupervised hashing for ANN search in Euclidean space. Neurocomputing 171:283–292CrossRefGoogle Scholar
  91. Wang J, Zhang T, Song J, Sebe N, Shen HT (2016c) A survey on learning to hash. arXiv preprint arXiv:1606.00185
  92. Wang K, Tang J, Wang N, Shao L (2016d) Semantic boosting cross-modal hashing for efficient multimedia retrieval. Inf Sci 330:199–210CrossRefGoogle Scholar
  93. Wang X, Shi Y, Kitani KM (2016e) Deep supervised hashing with triplet labels. arXiv preprint arXiv:1612.03900
  94. Wang Y, Yao H, Zhao S (2016f) Auto-encoder based dimensionality reduction. Neurocomputing 184:232–242CrossRefGoogle Scholar
  95. Weiss Y, Torralba A, Fergus R (2009) Spectral hashing. In: Advances in neural information processing systems, pp 1753–1760Google Scholar
  96. Weiss Y, Fergus R, Torralba A (2012) Multidimensional spectral hashing. In: European conference on computer vision, Springer, pp 340–353Google Scholar
  97. Wu L, Zhao K, Lu H, Wei Z, Lu B (2015) Distance preserving marginal hashing for image retrieval. In: 2015 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6Google Scholar
  98. Wu J, Feng L, Liu S, Sun M (2017) Image retrieval framework based on texton uniform descriptor and modified manifold ranking. J Vis Commun Image Represent 49:78–88CrossRefGoogle Scholar
  99. Xiong C, Chen W, Chen G, Johnson DM, Corso JJ (2014) Adaptive quantization for hashing: an information-based approach to learning binary codes. In: SDM, SIAM, vol 1, p 2Google Scholar
  100. Xu C, Liu Q, Ye M (2017) Age invariant face recognition and retrieval by coupled auto-encoder networks. Neurocomputing 222:62–71CrossRefGoogle Scholar
  101. Yang B, Shang X, Pang S (2017a) Isometric hashing for image retrieval. Signal Process Image Commun 59:117–130Google Scholar
  102. Yang E, Deng C, Liu W, Liu X, Tao D, Gao X (2017b) Pairwise relationship guided deep hashing for cross-modal retrieval. In: AAAI, pp 1618–1625Google Scholar
  103. Yao T, Kong X, Fu H, Tian Q (2016) Semantic consistency hashing for cross-modal retrieval. Neurocomputing 193:250–259CrossRefGoogle Scholar
  104. Yu X, Zhang S, Liu B, Zhong L, Metaxas D (2013) Large scale medical image search via unsupervised PCA hashing. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 393–398Google Scholar
  105. Zhang D, Wang J, Cai D, Lu J (2010) Self-taught hashing for fast similarity search. In: Proceedings of the 33rd international ACM SIGIR conference on research and development in information retrieval, ACM, pp 18–25Google Scholar
  106. Zhang X, Zhang L, Shum HY (2012) Qsrank: query-sensitive hash code ranking for efficient-neighbor search. In: 2012 IEEE conference on computer vision and pattern recognition (CVPR), IEEE, pp 2058–2065Google Scholar
  107. Zhang R, Lin L, Zhang R, Zuo W, Zhang L (2015) Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Trans Image Process 24(12):4766–4779MathSciNetzbMATHCrossRefGoogle Scholar
  108. Zhang W, Ji J, Zhu J, Li J, Xu H, Zhang B (2016) Bithash: an efficient bitwise locality sensitive hashing method with applications. Knowl Based Syst 97:40–47CrossRefGoogle Scholar
  109. Zhao H, Wang Z, Liu P, Wu B (2016) A fast binary encoding mechanism for approximate nearest neighbor search. Neurocomputing 178:112–122CrossRefGoogle Scholar
  110. Zhou J, Fu H, Kong X (2011) A balanced semi-supervised hashing method for CBIR. In: 2011 18th IEEE international conference on image processing (ICIP), IEEE, pp 2481–2484Google Scholar
  111. Zhu H, Long M, Wang J, Cao Y (2016) Deep hashing network for efficient similarity retrieval. In: AAAI, pp 2415–2421Google Scholar
  112. Zhu S, Jin D, Liang Z, Wang Q, Sun Y, Xu G (2017) Integration of semantic and visual hashing for image retrieval. J Vis Commun Image Represent 44:229–235CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media B.V., part of Springer Nature 2017

Authors and Affiliations

  1. 1.Department of Information Science and Technology, College of EngineeringAnna UniversityChennaiIndia

Personalised recommendations