Abstract
The Distance Index (D-index) is a recently introduced metric indexing structure which capable of state-of-the-art performance in large scale metric search applications. In this paper we address the problem of how to balance the D-index structure for more efficient similarity search. A group of evaluation functions measuring the balance property of a D-index structure are introduced to guide the construction of the indexing structure. The optimization is formulated in a genetic representation that is effectively solved by a generic genetic algorithm (GA). Compared with the classic D-index, balanced D-index structures show a significant improvement in reduction of distance calculations while maintaining a good input-output (IO) performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Yianilos, P.N.: Data structures and algorithms for nearest neighbor search in general metric spaces. In: ACM-SIAM Symposium on Discrete Algorithms, pp. 311–321 (1993)
Uhlmann, J.K.: Satisfying general proximity/similarity queries with metric trees. Information Processing Letters 40(4), 175–179 (1991)
Ciaccia, P., Patella, M., Zezula, P.: M-tree: an efficient access method for similarity search in matric spaces. In: Proceedings of the 23rd International Conference on Very Large Data Bases (VLDB 1997), pp. 426–435 (1997)
Vidal, E.: New formulation and improvements of the nearest-neighbor approximating and eliminating search algorithm (AESA). Pattern Recognition Letters 15(1), 1–7 (1994)
Micó, M.L., Oncina, J., Vidal, E.: A new version of the nearest-neighbor approximating and eliminating search algorithm (AESA) with linear preprocessing time and memory requirements. Pattern Recognition Letters 15(1), 9–17 (1994)
Dohnal, V., Gennaro, C., Savino, P., Zezula, P.: D-Index: distance searching index for metric sata sets. Multimedia Tools and Applications 21(1), 9–33 (2003)
Schmitt, L.M.: Theory of Genetic Algorithms. Theoretical Computer Science 259, 1–61 (2001)
Data available at http://www.net-comber.com/wordurls.html
Ortega, M., Rui, Y., et al.: Supporting ranked Boolean similarity queries in MARS. IEEE Transaction on Knowledge and Data Engineering 10(6), 905–925 (1998)
Data available at http://www.ics.uci.edu/~mlearn/MLRepository.html
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ban, T. (2008). Using Genetic Algorithm to Balance the D-Index Algorithm for Metric Search. In: Ishikawa, M., Doya, K., Miyamoto, H., Yamakawa, T. (eds) Neural Information Processing. ICONIP 2007. Lecture Notes in Computer Science, vol 4985. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69162-4_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-69162-4_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-69159-4
Online ISBN: 978-3-540-69162-4
eBook Packages: Computer ScienceComputer Science (R0)