Developing Parallel Cell-Based Filtering Scheme Under Shared-Nothing Cluster-Based Architecture
A large number of high-dimensional index structures suffer from the so called ’dimensional curse’ problem, i.e., the retrieval performance becomes increasingly degraded as the dimensionality is increased. To solve this problem, the cell-based filtering (CBF) scheme has been proposed, but it shows a linear decrease in performance as the dimensionality is increased. In this paper, we develop a parallel CBF scheme under an SN(Shared Nothing) cluster-based parallel architecture, so as to cope with the linear decrease in retrieval performance. In addition, we devise data insertion, range query and k-NN query processing algorithms which are suitable for the SN parallel architecture. Finally, we show that our parallel CBF scheme achieves good retrieval performance in proportion to the number of servers in the SN architecture and it outperforms a parallel version of the VA-File when the dimensionality is over 10.
KeywordsFeature Vector Range Query Query Point Retrieval Performance Master Node
Unable to display preview. Download preview PDF.
- 1.White, D.A., Jain, R.: Similarity Indexing: Algorithms and Performance. In: Proc. of the SPIE: Storage and Retrieval for Image and Video Databases IV, vol. 2670, pp. 62–75 (1996)Google Scholar
- 3.Berchtold, S., Keim, D.A., Kriegel, H.-P.: The X-tree: An Index Structure for High-Dimensional Data. In: Proceedings of the 22nd VLDB Conference, pp. 28–39 (1996)Google Scholar
- 4.Berchtold, S., Bohm, C., Keim, D., Kriegel, H.-P.: A Cost Model for Nearest Neighbor Search in High-Dimensional Data Space. In: ACM PODS Symposium on Principles of Databases Systems, Tucson, Arizona (1997)Google Scholar
- 5.Weber, R., Schek, H.-J., Blott, S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High- Dimensional Spaces. In: Proceedings of 24rd International Conference on Very Large Data Bases, pp. 24–27 (1998)Google Scholar
- 9.Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: Proc. ACM Int. Conf. on Management of Data(SIGMOD), pp. 71–79 (1995)Google Scholar