Selectivity Estimation for Spatial Joins with Geometric Selections
Spatial join is an expensive operation that is commonly used in spatial database systems. In order to generate efficient query plans for the queries involving spatial join operations, it is crucial to obtain accurate selectivity estimates for these operations. In this paper we introduce a framework for estimating the selectivity of spatial joins constrained by geometric selections. The center piece of the framework is Euler Histogram, which decomposes the estimation process into estimations on vertices, edges and faces. Based on the characteristics of different datasets, different probabilistic models can be plugged into the framework to provide better estimation results. To demonstrate the effectiveness of this framework, we implement it by incorporating two existing probabilistic models, and compare the performance with the Geometric Histogram  and the algorithm recently proposed by Mamoulis and Papadias .
KeywordsGeographical Information System Grid Granularity Area Selectivity Spatial Database Spatial Object
Unable to display preview. Download preview PDF.
- 1.Ning An, Zhen-Yu Yang, and Anand Sivasubramaniam. Selectivity estimation for spatial joins. In ICDE 2001, Proceedings of the 17th International Conference on Data Engineering, pages 368–375, April 2001.Google Scholar
- 2.Nikos Mamoulis and Dimitris Papadias. Selectivity estimation of complex spatial queries. In SSTD’01, Proceedings of the 7th International Symposium on Spatial and Temporal Databases, July 2001.Google Scholar
- 3.Philippe Rigaux, Michel Scholl, and Agnès Voisard. Spatial Databases with Applications to GIS, chapter 1.3.1, page 14. Morgan Kaufmann Publishers, 2001.Google Scholar
- 4.R. Beigel and Egemen Tanin. The geometry of browsing. In Proceedings of the Latin American Symposium on Theoretical Informatics, 1998, Brazil, pages 331–340, 1998.Google Scholar
- 5.Chengyu Sun, Divyakant Agrawal, and Amr El Abbadi. Exploring spatial datasets with histograms. In ICDE 2002, Proceedings of the 18th International Conference on Data Engineering, Feburary 2002.Google Scholar
- 7.Chengyu Sun, Divyakant Agrawal, and Amr El Abbadi. Selectivity estimation for spatial joins with geometric selections (extended version). Technical report, Computer Science Department, University of California, Santa Barbara, 2002. http://www.cs.ucsb.edu/research/trcs/docs/2002-01.ps.
- 10.Larry C. Munn and Christopher S. Arneson. Draft 1:100,000-scale digital soils map of wyoming. Technical report, University of Wyoming Agricultural Experiment Station, 1999.Google Scholar
- 11.Michael Stonebraker, James Frew, Kenn Gardels, and Jeff Meredith. The sequoia 2000 benchmark. In Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, Washington, D.C., May 26–28, 1993, pages 2–11, 1993.Google Scholar
- 12.Wael M. Badawy and Walid G. Aref. On local heuristics to speed up polygonpolygon intersection tests. In ACM-GIS’ 99, Proceedings of the 7th International Symposium on Advances in Geographic Information Systems, pages 97–102, 1999.Google Scholar
- 13.Ravi K. Kothuri and Siva Ravada. Efficient processing of large spatial queries using interior approximation. In Proceedings of the 7th International Symposium on Advances in Spatial and Temporal Databases (SSTD 2001), pages 404–421, 2001.Google Scholar