Abstract
In this paper we propose a monitoring system of a data center that is able to infer when the data center is getting into an anomalous behavior by analyzing the power consumption at each server and the data center network traffic. The monitoring system is non-intrusive in the sense that there is no need to install software on the data center servers. The monitoring architecture embeds two Elman Recurrent Networks (RNNs) to predict power consumed by each data center component starting from data center network traffic and viceversa. Results obtained along six mounts of experiments, within a data center, show that the architecture is able to classify anomalous system behaviors and normal ones by analyzing the error between the actual values of power consumption and network traffic and the ones inferred by the two RNNs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Over s.r.l. website, http://www.overtechnologies.com
Encog Machine Learning Framework (2008), http://www.heatonresearch.com/encog/
Aguilera, M.K., Mogul, J.C., Wiener, J.L., Reynolds, P., Muthitacharoen, A.: Performance debugging for distributed systems of black boxes. SIGOPS Oper. Syst. Rev. 37, 74–89 (2003)
Aniello, L., Baldoni, R., Bonomi, S., Lombardi, F., Zelli, A.: An Architecture for Automatic Scaling of Replicated Services. To appear in the Proceedings of the 2nd International Conference on NETworked sYStems (NETYS), vol. 5 (2014)
Baldoni, R., Caruso, M., Cerocchi, A., Ciccotelli, C., Montanari, L., Nicoletti, L.: Correlating power consumption and network traffic for improving data centers resiliency. ArXiv e-prints (May 2014)
Baldoni, R., Lodi, G., Montanari, L., Mariotta, G., Rizzuto, M.: Online black-box failure prediction for mission critical distributed systems. In: Ortmeier, F., Lipaczewski, M. (eds.) SAFECOMP 2012. LNCS, vol. 7612, pp. 185–197. Springer, Heidelberg (2012)
Frank, R.J., Davey, N., Hunt, S.P.: Time series prediction and neural networks. Journal of Intelligent and Robotic Systems 31(1-3), 91–103 (2001)
Heller, B., Seetharaman, S., Mahadevan, P., Yiakoumis, Y., Sharma, P., Banerjee, S., McKeown, N.: Elastictree: Saving energy in data center networks. In: Proceedings of the 7th USENIX Conference on Networked Systems Design and Implementation, NSDI 2010, p. 17. USENIX Association, Berkeley (2010)
Kazandjieva, M., Heller, B., Levis, P., Kozyrakis, C.: Energy dumpster diving. In: SOSP 2009: Proceedings of the ACM SIGOPS 22nd Symposium on Operating Systems Principles. ACM, New York (2009)
Lefurgy, C., Wang, X., Ware, M.: Power capping: A prelude to power shifting. Cluster Computing 11(2), 183–195 (2008)
Park, D.C., El-Sharkawi, M.A., Marks, R.J., Atlas, L.E., Damborg, M.J., et al.: Electric load forecasting using an artificial neural network. IEEE Transactions on Power Systems 6(2), 442–449 (1991)
Riedmiller, M., Braun, H.: A direct adaptive method for faster backpropagation learning: The rprop algorithm. In: IEEE International Conference on Neural Networks, pp. 586–591. IEEE (1993)
Senjyu, T., Takara, H., Uezato, K., Funabashi, T.: One-hour-ahead load forecasting using neural network. IEEE Transactions on Power Systems 17(1), 113–118 (2002)
Tan, Y., Gu, X., Wang, H.: Adaptive system anomaly prediction for large-scale hosting infrastructures. In: Proc. of ACM PODC 2010, pp. 173–182. ACM, New York (2010)
Wang, X., Yao, Y., Wang, X., Lu, K., Cao, Q.: Carpo: Correlation-aware power optimization in data center networks. In: 2012 Proceedings IEEE INFOCOM, pp. 1125–1133 (March 2012)
Wang, X., Chen, M.: Cluster-level feedback power control for performance optimization. In: IEEE 14th International Symposium on High Performance Computer Architecture, HPCA 2008, pp. 101–110 (February 2008)
Wang, X., Wang, Y.: Co-con: Coordinated control of power and application performance for virtualized server clusters. In: 17th International Workshop on Quality of Service, IWQoS, pp. 1–9 (July 2009)
Williams, A.W., Pertet, S.M., Narasimhan, P.: Tiresias: Black-box failure prediction in distributed systems. In: Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS 2007), Los Alamitos, CA, USA (2007)
Zhang, G., Patuwo, B.E., Hu, M.Y.: Forecasting With Artificial Neural Networks: the State of the Art. International Journal of Forecasting 14(1), 35–62 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Baldoni, R., Cerocchi, A., Ciccotelli, C., Donno, A., Lombardi, F., Montanari, L. (2014). Towards a Non-intrusive Recognition of Anomalous System Behavior in Data Centers. In: Bondavalli, A., Ceccarelli, A., Ortmeier, F. (eds) Computer Safety, Reliability, and Security. SAFECOMP 2014. Lecture Notes in Computer Science, vol 8696. Springer, Cham. https://doi.org/10.1007/978-3-319-10557-4_38
Download citation
DOI: https://doi.org/10.1007/978-3-319-10557-4_38
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10556-7
Online ISBN: 978-3-319-10557-4
eBook Packages: Computer ScienceComputer Science (R0)