Locally Optimized Hashing for Nearest Neighbor Search

Tokui, Seiya; Sato, Issei; Nakagawa, Hiroshi

doi:10.1007/978-3-319-18032-8_39

Seiya Tokui¹⁰,
Issei Sato¹¹ &
Hiroshi Nakagawa¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9078))

Included in the following conference series:

Pacific-Asia Conference on Knowledge Discovery and Data Mining

4158 Accesses
1 Citations

Abstract

Fast nearest neighbor search (NNS) is becoming important to utilize massive data. Recent work shows that hash learning is effective for NNS in terms of computational time and space. Existing hash learning methods try to convert neighboring samples to similar binary codes, and their hash functions are globally optimized on the data manifold. However, such hash functions often have low resolution of binary codes; each bucket, a set of samples with same binary code, may contain a large number of samples in these methods, which makes it infeasible to obtain the nearest neighbors of given query with high precision. As a result, existing methods require long binary codes for precise NNS. In this paper, we propose Locally Optimized Hashing to overcome this drawback, which explicitly partitions each bucket by solving optimization problem based on that of Spectral Hashing with stronger constraints. Our method outperforms existing methods in image and document datasets in terms of quality of both the hash table and query, especially when the code length is short.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fan, R.-E., Chang, K.-W., Hsieh, C.-J., Wang, X.-R., Lin, C.-J.: Liblinear: A library for large linear classification. Journal of Machine Learning Research 9, 1871–1874 (2008)
MATH Google Scholar
Indyk, P., Motwani, R.: Approximate nearest neighbors: towards removing the curse of dimensionality. In: Proceedings of the 30th Annual ACM Symposium on Theory of Computing, pp. 604–613 (1998)
Google Scholar
Kulis, B., Grauman, K.: Kernelized locality-sensitive hashing for scalable image search. In: IEEE 12th International Conference on Computer Vision, pp. 2130–2137 (2009)
Google Scholar
Lewis, D.D., Yang, Y., Rose, T.G., Li, F.: Rcv1: A new benchmark collection for text categorization research. Journal of Machine Learning Research 5, 361–397 (2004)
Google Scholar
Li, P., König, A.C.: b-Bit minwise hashing. In: Proceedings of the 19th International Conference on World Wide Web, pp. 671–680 (2010)
Google Scholar
Liu, W., Wang, J., Kumar, S., Chang, S.-F.: Hashing with graphs. In: Proceedings of the 28th International Conference on Machine Learning, pp. 1–8 (2011)
Google Scholar
Norouzi, M., Fleet, D.: Minimal loss hashing for compact binary codes. In: Proceedings of the 28th International Conference on Machine Learning, pp. 353–360 (2011)
Google Scholar
Raginsky, M., Lazebnik, S.: Locality-sensitive binary codes from shift-invariant kernels. In: Advances in Neural Information Processing Systems, vol. 22, pp. 1509–1517 (2009)
Google Scholar
Wang, J., Kumar, S., Chang, S.-F.: Sequential projection learning for hashing with compact codes. In: Proceedings of the 27th International Conference on Machine Learning, pp. 1127–1134 (2010)
Google Scholar
Watkins, D.S.: Fundamentals of Matrix Computations. Pure and Applied Mathematics: A Wiley Series of Texts, Monographs and Tracts. Wiley, third edition edition (2010)
Google Scholar
Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems, vol. 21, pp. 1753–1760 (2009)
Google Scholar
Zhang, D., Wang, J., Cai, D., Lu, J.: Self-taught hashing for fast similarity search. In: Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 18–25 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Preferred Networks, Inc., Tokyo, Japan
Seiya Tokui
The University of Tokyo, Tokyo, Japan
Issei Sato & Hiroshi Nakagawa

Authors

Seiya Tokui
View author publications
You can also search for this author in PubMed Google Scholar
Issei Sato
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Nakagawa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Seiya Tokui .

Editor information

Editors and Affiliations

Ho Chi Minh City University of Technology, Ho Chi Minh City, Vietnam
Tru Cao
Singapore Management University, Singapore, Singapore
Ee-Peng Lim
Nanjing University, Nanjing, China
Zhi-Hua Zhou
Japan Advanced Institute of Science and Technology, Nomi City, Japan
Tu-Bao Ho
The University of Hong Kong, Hong Kong, Hong Kong SAR
David Cheung
Osaka University, Osaka, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tokui, S., Sato, I., Nakagawa, H. (2015). Locally Optimized Hashing for Nearest Neighbor Search. In: Cao, T., Lim, EP., Zhou, ZH., Ho, TB., Cheung, D., Motoda, H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2015. Lecture Notes in Computer Science(), vol 9078. Springer, Cham. https://doi.org/10.1007/978-3-319-18032-8_39

Download citation

DOI: https://doi.org/10.1007/978-3-319-18032-8_39
Published: 09 May 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18031-1
Online ISBN: 978-3-319-18032-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics