Spatial Queries in the Cloud
Spatial queries in the cloud refer to processing of spatial queries on a distributed and interconnected network of computers that provide computation, storage, and resource management capabilities elastically in large scale. Resources in the cloud can be allocated on demand, and customers only pay for what they use. Cloud offers a number of query processing infrastructure and services ranging from parallel spatial database systems to MapReduce-based systems. Common spatial queries of interest include range queries, joins, and k-nearest neighbor queries.
Support of high-performance queries on large volumes of spatial data becomes increasingly important in many application domains, including geo-spatial problems in numerous fields, location-based services, and emerging scientific applications that are increasingly data and compute intensive. Past research efforts fall into three major directions toward improving spatial query performance: (i) algorithmic...
KeywordsCloud Computing Query Processing Range Query Query Point Spatial Query
- 1.Aji, A, Wang, F, Saltz, JH. Towards building a high performance spatial query system for large scale medical imaging data. In: Proceedings of the 20th international conference on advances in geographic information systems, Redondo Beach. ACM; 2012. p. 309–18.Google Scholar
- 2.Aji, A, Wang F, Vo H, Lee R, Liu Q, Zhang X, Saltz J. Hadoop GIS: a high performance spatial data warehousing system over mapreduce. Proc VLDB Endowment. 2013;6:1009–20.Google Scholar
- 3.Akdogan A, Demiryurek U, Banaei-Kashani F, Shahabi C. Voronoi-based geospatial query processing with mapreduce. In: Cloud computing technology and science (CloudCom), 2010 IEEE second international conference on, Indianapolis. IEEE; 2010. p. 9–16.Google Scholar
- 4.Eldawy A, Mokbel MF. SpatialHadoop: a MapReduce framework for spatial data. In: Proceedings of the IEEE international conference on data engineering (ICDE’15), Seol. IEEE. 2015.Google Scholar
- 5.Lu J, Guting RH. Parallel secondo: boosting database engines with hadoop. In: Parallel and Distributed Systems (ICPADS), 2012 IEEE 18th international conference on, Singapore. IEEE. 2012. p. 738–43.Google Scholar
- 6.Lu W, Shen Y, Chen S, Ooi BC. Efficient processing of k nearest neighbor joins using mapreduce. Proc VLDB Endowment. 2012;5:1016–27.Google Scholar
- 7.Nishimura S, Das S, Agrawal D, Abbadi AE. MD-HBase: a scalable multi-dimensional data infrastructure for location aware services. Mobile Data Management (MDM), 2011 12th IEEE international conference on, Lulea. IEEE; 2011. p. 7–16.Google Scholar
- 8.Ray S, Simion B, Brown AD, Johnson R. A parallel spatial data analysis infrastructure for the cloud. In: Proceedings of the 21st ACM SIGSPATIAL international conference on advances in geographic information systems, Orlando. ACM; 2013. p. 284–93.Google Scholar
- 9.Zhang C, Li F, Jestes J. Efficient parallel kNN joins for large data in MapReduce. In: Proceedings of the 15th international conference on extending database technology, Berlin. ACM; 2012. p. 38–49.Google Scholar