PK-Tree: A Spatial Index Structure for High Dimensional Point Data

Wang, Wei; Yang, Jiong; Muntz, Richard

doi:10.1007/978-1-4615-1379-7_20

Wei Wang⁴,
Jiong Yang⁴ &
Richard Muntz⁵

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 579))

183 Accesses
2 Citations

Abstract

In this chapter we present the PK-tree which is an index structure for high dimensional point data. The proposed indexing structure can be viewed as combining aspects of the PR-quad or K-D tree but where unnecessary nodes are eliminated. The unnecessary nodes are typically the result of skew in the point distribution and we show that by eliminating these nodes the performance of the resulting index is robust to skewed data distributions. The index structure is formally defined, efficiently updatable and bounds on the number of nodes and the mean height of the tree can be proved. Bounds on the expected height of the tree can be given under certain mild constraints on the spatial distribution of points. Empirical evidence both on real data sets and generated data sets shows that the PK-tree outperforms the recently proposed spatial indexes based on the R-tree such as the SR-tree and X-tree by a wide margin. It is also significant that the relative performance advantage of the PK-tree grows with the dimensionality of the data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

M. Abrash. BSP Trees. Dr. Dobbs Sourcebook, 20(14), 49–52, May/June 1995.
Google Scholar
N. Beckmann, H.-P. Kriegel, R. Schneider, and Bernhard Seeger. The R^*-tree: an efficient and robust access method for points and rectangles. Proc. ACM SIGMOD Conf. on Management of Data, 322–331, 1990.
Google Scholar
S. Berchtold, D. A. Keim, and H.-P. Kriegel. The X-tree: an index structure for high-dimensional data. Proc. 22nd Int. Conf. on Very Large Data Bases (VLDB), 28–39, 1996.
Google Scholar
P. Ciaccia, M. Patella, and P. Zezula. M-tree: an efficient access method for similarity search in metric spaces. Proc. 23rd Int. Conf. on Very Large Data Bases (VLDB), 426–435, 1997.
Google Scholar
A. Guttman. R-trees: a dynamic index structure for spatial searching. Proc. ACM SIGMOD Conf. on Management of Data,47–57, 1984.
Google Scholar
A. Henrich, H.-W. Six, and P. Widmayer. The LSD tree: spatial access to multi-dimensional point and non-point objects. Proc. 15th Int. Conf. on Very Large Data Bases (VLDB), 45–54, 1989.
Google Scholar
I. Kamel and C. Faloutsos. Hilbert R-tree: an improved R-tree using fractals. Proc. 20th Int. Conf. on Very Large Data Bases (VLDB), 500–509, 1994.
Google Scholar
N. Katayama and S. Satoh. The SR-tree: an index structure for high-dimensional nearest neighbor queries. Proc. ACM SIGMOD Conf. on Management of Data, 369–380, 1997.
Google Scholar
K.-I. Lin, H. V. Jagadish, and C. Faloutsos. The TV-tree: an index structure for high-dimensional data. VLDB Journal, 3(4):517–542, 1994.
Article Google Scholar
R. Motwani. Randomized Algorithms, Cambridge University Press, 1997.
Google Scholar
J. T. Robinson. The K-D-B-tree: a search structure for large multidimensional dynamic indexes. Proc. ACM SIGMOD Conf. on Management of Data, 10–18, 1981.
Google Scholar
H. Samet. The design and analysis of spatial data structures. Addison-Wesley Publishing Company, 1990.
Google Scholar
T. K. Sellis, N. Roussopoulos, and C. Faloutsos. The R+-tree: a dynamic index for multi-dimensional objects. Proc. 13th Int. Conf. on Very Large Data Bases (VLDB), 507–518, 1987.
Google Scholar
W. Wang, J. Yang, and R. Muntz. PK-tree: a dynamic spatial index structure for large data sets. UCLA Computer Science Department Technical Report #970039, 1997.
Google Scholar
W. Wang, J. Yang, and R. Muntz. PK-tree: a spatial index structure for high dimensional point data. UCLA Computer Science Department Technical Report #980032, 1998.
Google Scholar
W. Wang, J. Yang, and R. Muntz. PK-tree: a spatial index structure for high dimensional point data. Proc. Int. Conf. on Foundations of Data Organozation and Algorithms (FODO), 1998.
Google Scholar
J. Yang, W. Wang, and R. Muntz. Yet another spatial indexing structure. UCLA Computer Science Department Technical Report #970040, 1997.
Google Scholar

Download references

Author information

Authors and Affiliations

IBM T.J. Watson Research Center, USA
Wei Wang & Jiong Yang
Department of Computer Science, University of California, Los Angeles, USA
Richard Muntz

Authors

Wei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jiong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Richard Muntz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Kobe University, Japan
Katsumi Tanaka
University of Southern California, USA
Shahram Ghandeharizadeh
Kyoto University, Japan
Yahiko Kambayashi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wang, W., Yang, J., Muntz, R. (2000). PK-Tree: A Spatial Index Structure for High Dimensional Point Data. In: Tanaka, K., Ghandeharizadeh, S., Kambayashi, Y. (eds) Information Organization and Databases. The Springer International Series in Engineering and Computer Science, vol 579. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1379-7_20

Download citation

DOI: https://doi.org/10.1007/978-1-4615-1379-7_20
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5524-3
Online ISBN: 978-1-4615-1379-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics