An Effective Permutant Selection Heuristic for Proximity Searching in Metric Spaces
The permutation based index has shown to be very effective in medium and high dimensional metric spaces, even in difficult problems such as solving reverse k-nearest neighbor queries. Nevertheless, currently there is no study about which are the desirable features one can ask to a permutant set, or how to select good permutants. Similar to the case of pivots, our experimental results show that, compared with a randomly chosen set, a good permutant set yields to fast query response or to reduce the amount of space used by the index. In this paper, we start by characterizing permutants and studying their predictive power; then we propose an effective heuristic to select a good set of permutant candidates. We also show empirical evidence that supports our technique.
- 2.Bustos, B., Pedreira, O., Brisaboa, N.R.: A dynamic pivot selection technique for similarity search. In: Proc. 1st Workshop on Similarity Search and Applications (SISAP 2008), pp. 105–112 (2008)Google Scholar
- 3.Chávez, E., Figueroa, K., Navarro, G.: Effective proximity retrieval by ordering permutations. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI) 30(9), 1647–1658 (2009)Google Scholar
- 6.Falchi, F., Kacimi, M., Mass, Y., Rabitti, F., Zezula, P.: SAPIR: Scalable and distributed image searching. In: SAMT (Posters and Demos). CEUR Workshop Proceedings, vol. 300, pp. 11–12 (2007)Google Scholar
- 10.Samet, H.: Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann (2006)Google Scholar
- 11.Yianilos, P.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: Proc. 4th ACM-SIAM Symposium on Discrete Algorithms (SODA 1993), pp. 311–321 (1993)Google Scholar
- 12.Zezula, P., Amato, G., Dohnal, V., Batko, M.: Similarity Search – The Metric Space Approach. Advances in Database System, vol. 32. Springer (2006)Google Scholar