Skip to main content

Monitoring Data Reduction in Data Centers: A Correlation-Based Approach

  • Conference paper
  • First Online:
Smart Cities, Green Technologies, and Intelligent Transport Systems (VEHITS 2016, SMARTGREENS 2016)

Abstract

Monitoring data are collected and stored in a wide range of domains, especially in data centers, which integrate myriads of services and massive data. To handle the inevitable challenges brought by increasing volume of monitoring data, this paper proposes a correlation-based reduction method for streaming data that derives quantitative formulas between correlated indicators, and reduces the sampling rate of some indicators by replacing them with formulas predictions. This approach also revises formulas through iterations of the reduction process to find an adaptive solution in dynamic environments of data centers. One highlight of this work is the ability to work on upstream side, i.e., it can reduce volume requirements for data collection of monitoring systems. This work also tests the approach with both simulated and real data, showing that our approach is capable of data reduction in complex data centers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cappiello, C., Ho, T.T.N., Pernici, B., Plebani, P., Vitali, M.: \({\rm Co}_{2}\)-aware adaptation strategies for cloud applications. IEEE Trans. Cloud Comput. 4(2), 152–165 (2016). doi:10.1109/TCC.2015.2464796

    Article  Google Scholar 

  2. Carvalho, C., Gomes, D.G., Agoulmine, N., De Souza, J.N.: Improving prediction accuracy for wsn data reduction by applying multivariate spatio-temporal correlation. Sensors 11(11), 10010–10037 (2011)

    Article  Google Scholar 

  3. Ding, R., Wang, Q., Dang, Y., Fu, Q., Zhang, H., Zhang, D.: Yading: fast clustering of large-scale time series data. Proc. VLDB Endowment 8(5), 473–484 (2015)

    Article  Google Scholar 

  4. Esling, P., Agon, C.: Time-series data mining. ACM Comput. Surv. 45(1), 12 (2012)

    Article  MATH  Google Scholar 

  5. Hayashi, F.: Econometrics. Princeton University Press, Princeton (2000). http://gso.gbv.de/DB=2.1/CMD?ACT=SRCHA&SRT=YOP&IKT=1016&TRM=ppn+313736715&sourceid=fbwbibsonomy

    MATH  Google Scholar 

  6. Jolliffe, I.: Principal component analysis. Wiley Online Library (2002)

    Google Scholar 

  7. Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)

    Article  MATH  Google Scholar 

  8. Kung, H., Lin, C.K., Vlah, D.: Cloudsense: Continuous fine-grain cloud monitoring with compressive sensing. In: HotCloud (2011)

    Google Scholar 

  9. Peng, X.: Data reduction in monitored data. In: Loucopoulos, P., Nurcan, S., Weigand, H. (eds.) Proceedings of the CAiSE’2015 Doctoral Consortium at the 27th International Conference on Advanced Information Systems Engineering (CAiSE 2015), Stockholm, Sweden. CEUR Workshop Proceedings, vol. 1415, pp. 39–46, 11–12 June 2015. CEUR-WS.org (2015)

    Google Scholar 

  10. Peng, X., Pernici, B.: Correlation-model-based reduction of monitoring data in data centers. In: Proceedings of the 5th International Conference on Smart Cities and Green ICT Systems, pp. 395–405 (2016)

    Google Scholar 

  11. Reeves, G., Liu, J., Nath, S., Zhao, F.: Managing massive time series streams with multi-scale compressed trickles. Proc. VLDB Endowment 2(1), 97–108 (2009)

    Article  Google Scholar 

  12. Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)

    Article  Google Scholar 

  13. Vitali, M., O’Reilly, U.M., Veeramachaneni, K.: Modeling service execution on data centers for energy efficiency and quality of service monitoring. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 103–108. IEEE (2013)

    Google Scholar 

  14. Vitali, M., Pernici, B., OReilly, U.M.: Learning a goal-oriented model for energy efficient adaptive applications in data centers. Inf. Sci. 319, 152–170 (2015)

    Article  Google Scholar 

  15. Wajid, U., Cappiello, C., Plebani, P., Pernici, B., Mehandjiev, N., Vitali, M., Gienger, M., Kavoussanakis, K., Margery, D., García-Pérez, D., Sampaio, P.: On achieving energy efficiency and reducing co\({}_{\text{2 }}\) footprint in cloud computing. IEEE Trans. Cloud Comput. 4(2), 138–151 (2016). doi:10.1109/TCC.2015.2453988

    Article  Google Scholar 

  16. Zhou, S., Lin, K.J., Na, J., Chuang, C.C., Shih, C.S.: Supporting service adaptation in fault tolerant internet of things. In: 2015 IEEE 8th International Conference on Service-Oriented Computing and Applications (SOCA), pp. 65–72 (2015)

    Google Scholar 

Download references

Acknowledgements

This work has been partially funded by the Italian Project ITS Italy 2020 under the Technological National Clusters program.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xuesong Peng .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Peng, X., Pernici, B. (2017). Monitoring Data Reduction in Data Centers: A Correlation-Based Approach. In: Helfert, M., Klein, C., Donnellan, B., Gusikhin, O. (eds) Smart Cities, Green Technologies, and Intelligent Transport Systems. VEHITS SMARTGREENS 2016 2016. Communications in Computer and Information Science, vol 738. Springer, Cham. https://doi.org/10.1007/978-3-319-63712-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-63712-9_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-63711-2

  • Online ISBN: 978-3-319-63712-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics