Advertisement

An Efficient In-Memory R-Tree Construction Scheme for Spatio-Temporal Data Stream

  • Ting Zhang
  • Lianghuai YangEmail author
  • Donghai Shen
  • Yulei Fan
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11434)

Abstract

In this paper, we proposed an efficient R-tree construction method by bulk loading over spatial-temporal data stream. The core idea is to partition spatial-temporal data stream into time windows and construct an R-tree for each time window. In each time window, we parallelized space partitioning and data stream reception during R-tree construction; and then we adopted sorting-based bulk loading scheme to optimize R-tree construction, which avoided unnecessary synchronization overhead and accelerated the R-tree construction. In addition, to reduce the sorting cost of R-tree bulk loading, sampling-based space partitioning scheme was introduced. Theoretical analysis and experiments demonstrated the effectiveness of our proposed method.

Keywords

R-tree Data stream In-memory index Big data 

References

  1. 1.
    Guttman, A.: R-trees: a dynamic index structure for spatial searching. In: ACM Sigmod International Conference on Management of Data, pp. 47–57 (1984)Google Scholar
  2. 2.
    Liu, D., Li, Q., Cheng, J.: Indexing on main memory spatial object. Remote Sens. Environ. 11(4), 302–308 (1996)Google Scholar
  3. 3.
    Bentley, J.L.: Multidimensional binary search trees used for associative searching. Commun. ACM 18(9), 509–517 (1975)MathSciNetCrossRefGoogle Scholar
  4. 4.
    Finkel, R.A., Bentley, J.L.: Quad trees a data structure for retrieval on composite keys. Acta Informatica 4(1), 1–9 (1974)CrossRefGoogle Scholar
  5. 5.
    Sellis, T.K., Roussopoulos, N., Faloutsos, C.: The R + -Tree: a dynamic index for multi-dimensional objects. In: Proceedings of the 13th International Conference on Very Large Data Bases, pp. 507–518. Morgan Kaufmann Publishers Inc (1987)Google Scholar
  6. 6.
    Beckmann, N., Kriegel, H.P., Schneider, R., et al.: The R*-tree: an efficient and robust access method for points and rectangles. ACM Sigmod Rec. 19(2), 322–331 (1990)CrossRefGoogle Scholar
  7. 7.
    Roussopoulos, N., Leifker, D.: Direct spatial search on pictorial databases using packed R-trees. ACM Sigmod Rec. 14(4), 17–31 (1985)CrossRefGoogle Scholar
  8. 8.
    Zhang, M., Lu, F., Shen, P., et al.: The evolvement and progress of R-Tree family. Chin. J. Comput. 28(3), 289–300 (2005)Google Scholar
  9. 9.
    Kamel, I., Faloutsos, C.: On packing R-trees. In: Proceedings of the Second International Conference on Information and Knowledge Management, pp. 490–499 (1993)Google Scholar
  10. 10.
    Leutenegger, S.T., Edgington, J., Lopez, M.A.: STR: a simple and efficient algorithm for R-tree packing. In: Proceedings of the 13th International Conference on Data Engineering, pp. 497–506. IEEE Computer Society (1997)Google Scholar
  11. 11.
    García, R.Y.J., López, M.A., Leutenegger, S.T.: A greedy algorithm for bulk loading R-trees. In: Proceedings of the 6th ACM International Symposium on Advances in Geographic Information Systems, pp. 163–164 (1998)Google Scholar
  12. 12.
    Tan, H., Luo, W., Ni, L.M.: CloST: a Hadoop-based storage system for big spatio-temporal data analytics. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 2139–2143 (2012)Google Scholar
  13. 13.
    Cary, A., Sun, Z., Hristidis, V., Rishe, N.: Experiences on processing spatial data with MapReduce. In: Winslett, M. (ed.) SSDBM 2009. LNCS, vol. 5566, pp. 302–319. Springer, Heidelberg (2009).  https://doi.org/10.1007/978-3-642-02279-1_24CrossRefGoogle Scholar
  14. 14.
    Zhong, Y., Fang, J., Zhao, X.: VegaIndexer: a distributed composite index scheme for big spatio-temporal sensor data on cloud. In: Geoscience and Remote Sensing Symposium, 2013 IEEE International, pp. 1713–1716 (2013)Google Scholar
  15. 15.
    Zhong, Y., Zhu, X., Fang, J.: Elastic and effective spatio-temporal query processing scheme on Hadoop. In: Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data, pp. 33–42 (2012)Google Scholar
  16. 16.
    Zhong, Y., Fang, J., Zhao, X.: A distributed storage scheme for big spatio-temporal data. Chin. High Technol. Lett. 23(12), 1219–1229 (2013)Google Scholar
  17. 17.
    Eldawy, A., Mokbel, M.F., Alharthi, S., et al.: SHAHED: a MapReduce-based system for querying and visualizing spatio-temporal satellite data. In: 2015 IEEE 31st International Conference on Data Engineering, pp. 1585–1596 (2015)Google Scholar
  18. 18.
    Li, X., Zheng, W.: Parallel spatial index algorithm based on Hilbert partition. In: 2013 International Conference on Computational and Information Sciences, pp. 876–879 (2013)Google Scholar
  19. 19.
    Liu, Y.: Research on Key Techniques of High Performance Spatial Query Processing for Large Scale Spatial Data. National University of Defense Technology (2013)Google Scholar
  20. 20.
    Nishimura, S., Das, S., Agrawal, D., et al.: MD-HBase: a scalable multi-dimensional data infrastructure for location aware services. In: Proceedings of the 2011 IEEE 12th International Conference on Mobile Data Management, vol. 01, pp. 7–16. IEEE Computer Society (2011)Google Scholar
  21. 21.
    Wang, S., Maier, D., Ooi, B.C.: Fast and adaptive indexing of multi-dimensional observational data. Proc. VLDB Endowment 9(14), 1683–1694 (2016)CrossRefGoogle Scholar
  22. 22.
    Ma, Y., Rao, J., Hu, W., et al.: An efficient index for massive IOT data in cloud environment. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 2129–2133 (2012)Google Scholar
  23. 23.
    Cai, R., Lu, Z., Wang, L., et al.: DITIR: distributed index for high throughput trajectory insertion and real-time temporal range query. Proc. VLDB Endowment 10(12), 1865–1868 (2017)MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer Nature Switzerland AG 2019

Authors and Affiliations

  • Ting Zhang
    • 1
  • Lianghuai Yang
    • 1
    Email author
  • Donghai Shen
    • 1
  • Yulei Fan
    • 1
  1. 1.School of Computer Science and TechnologyZhejiang University of TechnologyHangzhouChina

Personalised recommendations