Skip to main content

Online Indexing and Distributed Querying Model-View Sensor Data in the Cloud

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8421))

Included in the following conference series:

Abstract

As various kinds of sensors penetrate our daily life (e.g., sensor networks for environmental monitoring), the efficient management of massive amount of sensor data becomes increasingly important at present. Traditional sensor data management systems based on relational database lack scalability to accommodate large-scale sensor data efficiently. Consequently, distributed key-value stores in the cloud is becoming the prime tool to manage sensor data. Meanwhile, model-view sensor data management stores the sensor data in the form of modelled segments. However, currently there is no index and query optimizations upon the modelled segments in the cloud, which results in full table scan in the worst case. In this paper, we propose an innovative model index for sensor data segments in key-value stores (KVM-index). KVM-index consists of two interval indices on the time and sensor value dimensions respectively, each of which has an in-memory search tree and a secondary list materialized in the key-value store. This composite structure enables to update new incoming sensor data segments with constant network I/O. Second, for time (or value)-range and point queries a MapReduce-based approach is designed to process the discrete predicate-related ranges of KVM-index, thereby eliminating computation and communication overheads incurred by accessing irrelevant parts of the index table in conventional MapReduce programs. Finally, we propose a cost based adaptive strategy for the KVM-index-MapReduce framework to process composite queries. As proved by extensive experiments, our approach outperforms in query response time both MapReduce-based processing of the raw sensor data and multiple alternative approaches of model-view sensor data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Sathe, S., Papaioannou, T.G., Jeung, H., Aberer, K.: A survey of model-based sensor data acquisition and management. In: Managing and Mining Sensor Data, Springer (2013)

    Google Scholar 

  2. Thiagarajan, A., Madden, S.: Querying continuous functions in a database system. In: SIGMOD (2008)

    Google Scholar 

  3. Deshpande, A., Madden, S.: Mauvedb: supporting model-based user views in database systems. In: SIGMOD (2006)

    Google Scholar 

  4. Papaioannou, T.G., Riahi, M., Aberer, K.: Towards online multi-model approximation of time series. In: MDM (2011)

    Google Scholar 

  5. Bhattacharya, A., Meka, A., Singh, A.: Mist: Distributed indexing and querying in sensor networks using statistical models. In: VLDB (2007)

    Google Scholar 

  6. Kriegel, H.-P., Pötke, M., Seidl, T.: Managing intervals efficiently in object-relational databases. In: VLDB (2000)

    Google Scholar 

  7. Elmasri, R., Wuu, G.T.J., Kim, Y.-J.: The time index: An access structure for temporal data. In: VLDB (1990)

    Google Scholar 

  8. Sfakianakis, G., Patlakas, I., Ntarmos, N., Triantafillou, P.: Interval indexing and querying on key-value cloud stores. In: ICDE (2013)

    Google Scholar 

  9. Ang, C.H., Tan, K.P.: The interval b-tree. Inf. Process. Lett. 53 (1995)

    Google Scholar 

  10. Arge, L., Vitter, J.S.: Optimal dynamic interval management in external memory. In: 37th Annual Symposium on Foundations of Computer Science (1996)

    Google Scholar 

  11. Shen, H., Ooi, B.C., Lu, H.: The tp-index: A dynamic and efficient indexing mechanism for temporal databases. In: ICDE, pp. 274–281. IEEE (1994)

    Google Scholar 

  12. Kolovson, C.P., Stonebraker, M.: Segment indexes: dynamic indexing techniques for multi-dimensional interval data. SIGMOD Rec. 20 (1991)

    Google Scholar 

  13. Iu, M.-Y., Zwaenepoel, W.: Hadooptosql: a mapreduce query optimizer. In: EuroSys (2010)

    Google Scholar 

  14. Dittrich, J., Quiané-Ruiz, J.-A., Jindal, A., Kargin, Y., Setty, V., Schad, J.: Hadoop++: making a yellow elephant run like a cheetah (without it even noticing). VLDB Endow. 3 (2010)

    Google Scholar 

  15. Dittrich, J., Quiané-Ruiz, J.-A., Richter, S., Schuh, S., Jindal, A., Schad, J.: Only aggressive elephants are fast elephants. VLDB Endow. 5 (2012)

    Google Scholar 

  16. Eltabakh, M.Y., Özcan, F., Sismanis, Y., Haas, P.J., Pirahesh, H., Vondrak, J.: Eagle-eyed elephant: split-oriented indexing in hadoop. In: EDBT (2013)

    Google Scholar 

  17. Guo, T., Yan, Z., Aberer, K.: An adaptive approach for online segmentation of multi-dimensional mobile data. In: Proc. of MobiDE, SIGMOD Workshop (2012)

    Google Scholar 

  18. Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and mining of time series data: experimental comparison of representations and distance measures. VLDB Endowment 1 (2008)

    Google Scholar 

  19. Tian Guo, T.G.P., Aberer, K.: Model-view sensor data management in the cloud. In: Proceedings of the 2013 IEEE International Conference on BigData. IEEE Computer Society (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Guo, T., Papaioannou, T.G., Zhuang, H., Aberer, K. (2014). Online Indexing and Distributed Querying Model-View Sensor Data in the Cloud. In: Bhowmick, S.S., Dyreson, C.E., Jensen, C.S., Lee, M.L., Muliantara, A., Thalheim, B. (eds) Database Systems for Advanced Applications. DASFAA 2014. Lecture Notes in Computer Science, vol 8421. Springer, Cham. https://doi.org/10.1007/978-3-319-05810-8_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05810-8_3

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05809-2

  • Online ISBN: 978-3-319-05810-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics