Abstract
In recent years, some approximate high-dimensional indexing techniques have shown promising results by trading off quality guarantees for improved query performance. While the query performance and quality of these methods has been well studied, however, the performance of index maintenance has not yet been reported in any detail. Here, we focus on the dynamic behavior of the balanced NV-tree, which is a disk-based approximate index for very large collections. We report on an initial study of the effects of several implementation choices for the balanced NV-tree, and show that with appropriate implementation, significant performance improvements are possible. Overall, the proposed techniques not only reduce maintenance cost, but can also improve search performance significantly with minimal loss of search quality.
Similar content being viewed by others
Notes
Since the split partition contained at least p descriptors, as many as p random disk reads may be required to find the descriptors. Unless the collection is very large, p random reads cost far more than a sequential scan.
Note that the lack of quality in single index searches is due to the index configuration used, which is a balanced NV-tree with leaves of 32 pages; this configuration was chosen since it generates small indices very quickly. In [8], it was shown that with smaller leaves, better line selections and other configurations, the unbalanced NV-tree is quite effective for single index searches.
References
Amsaleg, L., Gros, P.: Content-based retrieval using local descriptors: problems and issues from a database perspective. Pattern Anal. Appl. 4(2/3), 108–124 (2001)
Bainbridge, D., Browne, P., Cairns, P., Rüger, S., Li-Qun, X.: Managing the growth of multimedia digital content. ERCIM News (62), 16–17 (2005)
Casey, M., Slaney, M.: Song intersection by approximate nearest neighbor search. In: Proceedings of International Conference on Music Information Retrieval (ISMIR), Victoria, BC, Canada (2006)
Chierichetti, F., Panconesi, A., Raghavan, P., Sozio, M., Tiberi, A., Upfal, E.: Finding near neighbors through cluster pruning. In: Proceedings of the ACM Symposium on Principles of Database Systems (PODS), Beijing, China (2007)
Datar, M., Indyk, P., Immorlica, N., Mirrokni, V.S.: Locality-Sensitive Hashing using Stable Distributions. MIT Press, Cambridge (2006)
Fagin, R., Kumar, R., Sivakumar, D.: Efficient similarity search and classification via rank aggregation. In: Proceedings of the ACM SIGMOD International Conference on Management of Data. San Diego, CA, USA (2003)
Lejsek, H., Ásmundsson, F.H., Jónsson, B.Þ., Amsaleg, L.: Scalability of local image descriptors: a comparative study. In: Proceedings of the ACM International Conference on Multimedia. Santa Barbara, CA, USA (2006)
Lejsek, H., Ásmundsson, F.H., Jónsson, B.Þ., Amsaleg, L.: NV-tree: an efficient disk-based index for approximate search in very large high-dimensional collections. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 869–883 (2009)
Li, C., Chang, E., Garcia-Molina, H., Wiederhold, G.: Clustering for approximate similarity search in high-dimensional spaces. IEEE Trans. Knowl. Data Eng. 14(4), 792–808 (2002)
Liu, T., Moore, A., Gray, A., Yang, K.: An investigation of practical approximate nearest neighbor algorithms. In: Proceedings of Neural Information Processing Systems (NIPS). Vancouver, BC, Canada (2004)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), (2004)
Ólafsson, A.: Dynamic behavior of balanced NV-trees. Master’s Thesis, School of Computer Science, Reykjavík University (2008)
Ólafsson, A., Jónsson, B.Þ., Amsaleg, L.: Dynamic behavior of balanced NV-trees. In: Proceedings of Content-Based Multimedia Indexing. London, England (2008)
Shaft, U., Ramakrishnan, R.: Theory of nearest neighbors indexability. ACM Trans. Database Syst. 31(3), 814–838 (2006)
Zhang, L., Tian, Q.: Multimedia content analysis: model-based approaches vs. data-driven approaches. In: Proceedings of the ACM International Conference on Multimedia. Beijing, China (2009)
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Balakrishnan Prabhakaran.
Rights and permissions
About this article
Cite this article
Ólafsson, A., Þór Jónsson, B., Amsaleg, L. et al. Dynamic behavior of balanced NV-trees. Multimedia Systems 17, 83–100 (2011). https://doi.org/10.1007/s00530-010-0199-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00530-010-0199-4