Cluster Computing

, Volume 20, Issue 4, pp 2931–2942 | Cite as

A PR-quadtree based multi-dimensional indexing for complex query in a cloud system

  • Jian-feng LiEmail author
  • Shi-ping Chen
  • Lin-mao Duan
  • Liang Niu


The state-of-the-art indexing mechanisms for distributed cloud data management systems can not support complex queries, such as multi-dimensional query and range query. To solve this problem, we propose a multi-dimensional indexing mechanism named PR-Chord to support complex queries. PR-Chord is composed of the global index named PR-Index and the Chord network. The multi-dimensional space formed by the range of the multi-dimensional data is divided into hyper-rectangle spaces equally. The PR-Index is a hierarchical index structure based on the improved PR quadtree to index these spaces. The complex query is transformed into the query of leaf nodes of PR-Index. We design the algorithms of query, insertion and deletion to support complex queries. Since PR-Index does not store the multi-dimensional data, its maintenance cost is zero. PR-Chord has the advantages of load balancing and simple algorithm. The experiment results demonstrate that PR-Chord has good query efficiency.


Cloud computing Multi-dimensional data Multi-dimensional query Interval query Data index Consistent hash 



This work was supported by the National Natural Science Foundation of China under Grant 61170277,61472256, the Innovation Program of Shanghai Municipal Education Commission (Program Number 12zz137), the first-class discipline construction project of Shanghai under Grant S1201YLXK, Science Development Foundation of University of Shanghai for Science and Technology (16KJFZ035), the Hujiang Foundation of China (A14006), Project of Zhejiang Philosophy and Social Sciences Planning (16NDJC283yb) and Project of Humanities and Social Sciences in Universities of Zhejiang (2014QN015).


  1. 1.
    Ghemawat, S., Gobioff, H., Leung, S.-T.: The google file system. In: Proceedings of the 19th ACM Symposium on Operating Systems Principles, pp. 29–43 (2003)Google Scholar
  2. 2.
    Chang, F., Dean, J., Ghemawat, S., et al.: Bigtable: a distributed storage system for structured data. In: Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation, pp. 205–218 (2006)Google Scholar
  3. 3.
    Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. In: Proceedings of the 5th USENIX Symposium on Operating Systems Design and Implementation, pp. 137–150 (2004)Google Scholar
  4. 4.
    Osanaiye, O.A., Cai, H., Choo, K., Dehghantanha, A., Xu, Z., Dlodlo, M.E.: Ensemble-based multi-filter feature selection method for DDoS detection in cloud computing. EURASIP J. Wireless Commun. Netw. 2016, 130 (2016)CrossRefGoogle Scholar
  5. 5.
    Liu, J., Tian, Y., Yu, X., Yang, Z., Jia, X., Ma, C., Xu, Z.: A multi-source approach for bug triage. Int. J. Softw. Eng. Knowl. Eng. 26(9–10), 1593–1604 (2016)CrossRefGoogle Scholar
  6. 6.
    DeCandia, G., Hastorun, D., Jampani, M., et al.: Dynamo: amazon’s highly available key-value store. In: Proceedings of the 21st ACM Symposium on Operating Systems Principles, pp. 205–220 (2007)Google Scholar
  7. 7.
    Lakshman, A., Malik, P.: Cassandra: a decentralized structured storage system. ACM SIGOPS Oper. Syst. Rev. 44(2), 35–40 (2010)CrossRefGoogle Scholar
  8. 8.
    Ma, Y., Meng, X.: Research on indexing for cloud data management. J. Softw. 26(1), 145–166 (2015). (in Chinese)Google Scholar
  9. 9.
    Xia, Z., Junzhou, L., Aibo, S., et al.: A multidimensional indexing for complex query in cloud computing. J. Comput. Res. Dev. 50(8), 1592–1603 (2013). (in Chinese)Google Scholar
  10. 10.
    Stoica, I., Morris, R., Karger, D., et al.: Chord: a scalable peer-to-peer lookup service for internet applications. In: Proceedings of 2001 SIGCOMM, vol. 31, pp. 149–160 (2001)Google Scholar
  11. 11.
    Zhao, B., Kubiatowicz, J., Tapestry, J.A.: An infrastructure for fault-tolerant wide-area location and routing. UCB//CSD-01-1141, University of California at Berkeley, California (2001)Google Scholar
  12. 12.
    Ratnasamy, S., Francis, P., Handley, M., et al.: A scalable content-addressable network. In: Proceedings of the 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications, pp. 161–172 (2001)Google Scholar
  13. 13.
    Rowstron, A., Pastry, D.P.: Scalable, distributed object location and routing for large-scale peer-to-peer systems. In: Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms, Heidelberg, pp. 329–350 (2001)Google Scholar
  14. 14.
    Maymounkov, P., Kademlia, M.D.: A peer-to-peer information system based on the XOR metric. In: IPTPS’01 Revised Papers from the First International Workshop on Peer-to-Peer Systems, pp. 53–65 (2002)Google Scholar
  15. 15.
    Samet, H.: Foundations of Multidimensional and Metric Data Structures, pp. 28–48. Tsinghua University Press, Beijing (2011)Google Scholar
  16. 16.
    Bereczkya, N., Duchb, A., Németha, K., et al.: Quad-kd trees: a general framework for kd trees and quad trees. Theor. Comput. Sci. 616(2), 126–140 (2016)CrossRefGoogle Scholar
  17. 17.
    Wu, S., Jiang, D., Ooi, B.C., et al.: Efficient B-tree based indexing for cloud data processing. In: Proceedings of the VLDB Endowment, pp. 1207–1218 (2010)Google Scholar
  18. 18.
    Wang, J., Wu, S., Gao, H., et al.: Indexing multi-dimensional data in a cloud system. In: Proceedings of the ACM SIGMOD/PODS Conference, pp. 591–602 (2010)Google Scholar
  19. 19.
    Zhang, X., Ai, J., Wang, Z., et al.: An efficient multi-dimensional index for cloud data management. In: Proceedings of the CIKM Workshop on Cloud Data Management, pp. 17–24 (2009)Google Scholar
  20. 20.
    Ding, L., Qiao, B., Wang, G., et al.: An efficient quad-tree based index structure for cloud data management. In: Proceedings of the 12th International Conference on Web-Age Information Management, pp. 238–250 (2010)Google Scholar
  21. 21.
    Nishimura, S., Das, S., Agrawal, D., et al.: MD-HBase: design and implementation of an elastic data infrastructure for cloud-scale location services. Distrib. Parallel Databases 31(2), 289–319 (2013)CrossRefGoogle Scholar
  22. 22.
    Hsu, Y., Pan, Y., Wei, L., et al.: Key formulation schemes for spatial index in cloud data managements. In: Proceedings of the 13th IEEE Conference on Mobile Data Management, pp. 21–26 (2012)Google Scholar
  23. 23.
    Carlini, E., Lulli, A., Ricci, L.: Dragon: multidimensional range queries on distributed aggregation trees. Future Gener. Comput. Syst. 55(2), 101–115 (2016)CrossRefGoogle Scholar
  24. 24.
    Aguilera, M.K., Golab, W., Shah, M.A.: A practical scalable distributed B-tree. In: The Proceedings of the VLDB Endowment (PVLDB), vol. 1, pp. 598–609 (2008)Google Scholar
  25. 25.
    Aguilera, M.K., Merchant, A., Shah, M.A., et al.: Sinfonia: a new paradigm for building scalable distributed systems. In: SOSP’07 Proceedings of Twenty-First ACM SIGOPS Symposium on Operating Systems Principles, pp. 159–174 (2007)Google Scholar
  26. 26.
    Tanin, E., Harwood, A., Samet, H.: Using a distributed quadtree index in peer-to-peer networks. Int. J. Very Large Data Bases 16(2), 165–178 (2007)CrossRefGoogle Scholar
  27. 27.
    Bently, J.L., Stanat, D.F.: Analysis of range searches in quad trees. Inf. Process. Lett. 3(6), 170–173 (1975)CrossRefzbMATHGoogle Scholar
  28. 28.
    Lee, D.T., Wong, C.K.: Worst-case analysis for region and partial region searches in multidimensional binary search trees and balanced quad trees[J]. Acta Inf. 9(1), 23–29 (1977)CrossRefzbMATHMathSciNetGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2017

Authors and Affiliations

  • Jian-feng Li
    • 1
    • 2
    Email author
  • Shi-ping Chen
    • 1
  • Lin-mao Duan
    • 2
  • Liang Niu
    • 2
  1. 1.Business SchoolUniversity of Shanghai for Science & TechnologyShanghaiChina
  2. 2.College of Economics and ManagementChina Jiliang UniversityHangzhouChina

Personalised recommendations