Abstract
Monitoring data are collected and stored in a wide range of domains, especially in data centers, which integrate myriads of services and massive data. To handle the inevitable challenges brought by increasing volume of monitoring data, this paper proposes a correlation-based reduction method for streaming data that derives quantitative formulas between correlated indicators, and reduces the sampling rate of some indicators by replacing them with formulas predictions. This approach also revises formulas through iterations of the reduction process to find an adaptive solution in dynamic environments of data centers. One highlight of this work is the ability to work on upstream side, i.e., it can reduce volume requirements for data collection of monitoring systems. This work also tests the approach with both simulated and real data, showing that our approach is capable of data reduction in complex data centers.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cappiello, C., Ho, T.T.N., Pernici, B., Plebani, P., Vitali, M.: \({\rm Co}_{2}\)-aware adaptation strategies for cloud applications. IEEE Trans. Cloud Comput. 4(2), 152–165 (2016). doi:10.1109/TCC.2015.2464796
Carvalho, C., Gomes, D.G., Agoulmine, N., De Souza, J.N.: Improving prediction accuracy for wsn data reduction by applying multivariate spatio-temporal correlation. Sensors 11(11), 10010–10037 (2011)
Ding, R., Wang, Q., Dang, Y., Fu, Q., Zhang, H., Zhang, D.: Yading: fast clustering of large-scale time series data. Proc. VLDB Endowment 8(5), 473–484 (2015)
Esling, P., Agon, C.: Time-series data mining. ACM Comput. Surv. 45(1), 12 (2012)
Hayashi, F.: Econometrics. Princeton University Press, Princeton (2000). http://gso.gbv.de/DB=2.1/CMD?ACT=SRCHA&SRT=YOP&IKT=1016&TRM=ppn+313736715&sourceid=fbwbibsonomy
Jolliffe, I.: Principal component analysis. Wiley Online Library (2002)
Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)
Kung, H., Lin, C.K., Vlah, D.: Cloudsense: Continuous fine-grain cloud monitoring with compressive sensing. In: HotCloud (2011)
Peng, X.: Data reduction in monitored data. In: Loucopoulos, P., Nurcan, S., Weigand, H. (eds.) Proceedings of the CAiSE’2015 Doctoral Consortium at the 27th International Conference on Advanced Information Systems Engineering (CAiSE 2015), Stockholm, Sweden. CEUR Workshop Proceedings, vol. 1415, pp. 39–46, 11–12 June 2015. CEUR-WS.org (2015)
Peng, X., Pernici, B.: Correlation-model-based reduction of monitoring data in data centers. In: Proceedings of the 5th International Conference on Smart Cities and Green ICT Systems, pp. 395–405 (2016)
Reeves, G., Liu, J., Nath, S., Zhao, F.: Managing massive time series streams with multi-scale compressed trickles. Proc. VLDB Endowment 2(1), 97–108 (2009)
Tsamardinos, I., Brown, L.E., Aliferis, C.F.: The max-min hill-climbing bayesian network structure learning algorithm. Mach. Learn. 65(1), 31–78 (2006)
Vitali, M., O’Reilly, U.M., Veeramachaneni, K.: Modeling service execution on data centers for energy efficiency and quality of service monitoring. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 103–108. IEEE (2013)
Vitali, M., Pernici, B., OReilly, U.M.: Learning a goal-oriented model for energy efficient adaptive applications in data centers. Inf. Sci. 319, 152–170 (2015)
Wajid, U., Cappiello, C., Plebani, P., Pernici, B., Mehandjiev, N., Vitali, M., Gienger, M., Kavoussanakis, K., Margery, D., GarcÃa-Pérez, D., Sampaio, P.: On achieving energy efficiency and reducing co\({}_{\text{2 }}\) footprint in cloud computing. IEEE Trans. Cloud Comput. 4(2), 138–151 (2016). doi:10.1109/TCC.2015.2453988
Zhou, S., Lin, K.J., Na, J., Chuang, C.C., Shih, C.S.: Supporting service adaptation in fault tolerant internet of things. In: 2015 IEEE 8th International Conference on Service-Oriented Computing and Applications (SOCA), pp. 65–72 (2015)
Acknowledgements
This work has been partially funded by the Italian Project ITS Italy 2020 under the Technological National Clusters program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Peng, X., Pernici, B. (2017). Monitoring Data Reduction in Data Centers: A Correlation-Based Approach. In: Helfert, M., Klein, C., Donnellan, B., Gusikhin, O. (eds) Smart Cities, Green Technologies, and Intelligent Transport Systems. VEHITS SMARTGREENS 2016 2016. Communications in Computer and Information Science, vol 738. Springer, Cham. https://doi.org/10.1007/978-3-319-63712-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-63712-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-63711-2
Online ISBN: 978-3-319-63712-9
eBook Packages: Computer ScienceComputer Science (R0)