Abstract
Cloud monitoring activity involves dynamically tracking the Quality of Service (QoS) parameters related to virtualized resources (e.g., VM, storage, network, appliances, etc.), the physical resources they share, the applications running on them and data hosted on them. Applications and resources configuration in cloud computing environment is quite challenging considering a large number of heterogeneous cloud resources. Further, considering the fact that at given point of time, there may be need to change cloud resource configuration (number of VMs, types of VMs, number of appliance instances, etc.) for meet application QoS requirements under uncertainties (resource failure, resource overload, workload spike, etc.). Hence, cloud monitoring tools can assist a cloud providers or application developers in: (i) keeping their resources and applications operating at peak efficiency, (ii) detecting variations in resource and application performance, (iii) accounting the service level agreement violations of certain QoS parameters, and (iv) tracking the leave and join operations of cloud resources due to failures and other dynamic configuration changes. In this paper, we identify and discuss the major research dimensions and design issues related to engineering cloud monitoring tools. We further discuss how the aforementioned research dimensions and design issues are handled by current academic research as well as by commercial monitoring tools.
This is a preview of subscription content,
to check access.








Similar content being viewed by others
References
Mell P, Grance T (2011) The NIST definition of cloud computing (draft). NIST Spec Publ 800:145
Letaifa A, Haji A, Jebalia M, Tabbane S (2010) State of the art and research challenges of new services architecture technologies: virtualization, SOA and cloud computing. Int J Grid Distrib Comput 3
Cong C, Liu J, Zhang Q, Chen H, Cong Z (2010) The characteristics of cloud computing. In: 39th international conference on parallel processing workshops (ICPPW), pp 275–279
Zhang S, Zhang S, Chen X, Huo X (2010) Cloud computing research and development trend. In: 2nd international conference on future networks, ICFN’10, pp 93–97
Ahmed M, Chowdhury ASMR, Ahmed M, Rafee MMH (2012) An advanced survey on cloud computing and state-of-the-art research issues. Int J Comput Sci Issues (IJCSI) 9
Atzori L, Granelli F, Pescapè A (2011) A network-oriented survey and open issues in cloud computing
Shin S, Gu G (2012) CloudWatcher: network security monitoring using openflow in dynamic cloud networks (or: How to provide security monitoring as a service in clouds?). In: 2012 20th IEEE international conference on network protocols (ICNP), pp 1–6
De Chaves SA, Uriarte RB, Westphall CB (2011) Toward an architecture for monitoring private clouds. IEEE Commun Mag 49:130–137
Grobauer B, Walloschek T, Stocker E (2011) Understanding cloud computing vulnerabilities. In: IEEE security and privacy, vol 9, pp 50–57
Moses J, Iyer R, Illikkal R, Srinivasan S, Aisopos K (2011) Shared resource monitoring and throughput optimization in cloud-computing datacenters. In: 2011 IEEE international parallel and distributed processing symposium (IPDPS), pp 1024–1033
Wang L, Kunze M, Tao J, von Laszewski G (2011) Towards building a cloud for scientific applications. Adv Eng Softw 42(9):714–722
Wang L, Chen D, Ma Y, Wang J (2013) Towards enabling cyberinfrastructure as a service in clouds. Comput Electr Eng 39(1):3–14
Wang L, von Laszewski G, Younge AJ, He X, Kunze M, Tao J (2010) Cloud computing: a perspective study. New Gener Comput 28(2):137–146
Begoli E, Horey J (2012) Design principles for effective knowledge discovery from big data. In: Joint working IEEE/IFIP conference on software architecture (WICSA) and European conference on software architecture (ECSA), pp 215–218
Bryant R, Katz RH, Lazowska ED (2008) Big-data computing: creating revolutionary breakthroughs in commerce, science and society
Labrinidis A, Jagadish H (2012) Challenges and opportunities with big data. In: Proceedings of the VLDB endowment, vol 5, pp 2032–2033
Ma Y, Wang L, Liu D, Yuan T, Liu P, Zhang W (2013) Distributed data structure templates for data-intensive remote sensing applications. Concurr Comput Pract Exp 25(12):1784–1797
Zhang W, Wang L, Liu D, Song W, Ma Y, Liu P, Chen Dan (2013) Towards building a multi-datacenter infrastructure for massive remote sensing image processing. Concurr Comput Pract Exp 25(12):1798–1812
Zhang W, Wang L, Ma Y, Liu D (2013) Design and implementation of task scheduling strategies for massive remote sensing data processing across multiple data centers. Pract Exp Softw. doi:10.1002/spe.2229
Twitter and Natural Disasters (2011) Crisis communication lessons from the Japan tsunami. http://www.sciencedaily.com/releases/2011/04/110415154734.htm. Accessed 22 Feb 2014
Nita M-C, Chilipirea C, Dobre C, Pop F (2013) A SLA-based method for big-data transfers with multi-criteria optimization constraints for IaaS. In: 2013 11th roedunet international conference (RoEduNet), pp 1, 6
Zhao M, Figueiredo RJ (2007) Experimental study of virtual machine migration in support of reservation of cluster resources. In: Proceedings of the 2nd international workshop on virtualization technology in distributed computing, p 5
Wang L, Chen D, Zhao J, Tao J (2012) Resource management of distributed virtual machines. IJAHUC 10(2):96–111
Calheiros RN, Ranjan R, Buyya R (2011) Virtual machine provisioning based on analytical performance and qos in cloud computing environments. In: International conference on parallel processing (ICPP), pp 295–304
Kirschnick J, Calero A, Edwards N (2010) Toward an architecture for the automated provisioning of cloud services. IEEE Commun Mag 48:124–131
Ranjan R, Zhao L, Wu X, Liu A, Quiroz A, Parashar M (2010) Peer-to-peer cloud provisioning: service discovery and load-balancing. In: Cloud computing, Springer, pp 195–217
Liu X, Yang Y, Yuan D, Zhang G, Li W, Cao D (2011) A generic QoS framework for cloud workflow systems. In: 2011 IEEE ninth international conference on dependable, autonomic and secure computing (DASC), pp 713–720
Ranjan R, Benatallah B Programming cloud resource orchestration framework: operations and research challenges. In: Technical report. http://arxiv.org/abs/1204.2204. Accessed 22 Feb 2014
Aceto G, Botta A, de Donato W, Pescapè A (2013) Cloud monitoring: a survey. Comput Netw 57:2093–2115
Shao J, Wei H, Wang Q, Mei H (2010) A runtime model based monitoring approach for cloud. In: 2010 IEEE 3rd international conference on cloud computing (CLOUD), pp 313–320
Caron E, Rodero-Merino L, Desprez F, Muresan A (2012) Auto-scaling, load balancing and monitoring in commercial and open-source clouds
Spring J (2011) Monitoring cloud computing by layer, part 1. IEEE Secur Priv 9:66–68
Anand M (2012) Cloud monitor: monitoring applications in cloud. In: Cloud computing in emerging markets (CCEM), 2012 IEEE international conference on communication, networking and broadcasting, pp 1–4
Kutare M, Eisenhauer G, Wang C, Schwan K, Talwar V, Wolf M (2010) Monalytics: online monitoring and analytics for managing large scale data centers. In: Proceedings of the 7th international conference on autonomic computing, pp 141–150
Sundaresan S, de Donato W, Feamster N, Teixeira R, Crawford S, Pescape A (2011) Broadband internet performance: a view from the gateway. In: ACM SIGCOMM computer communication review, pp 134–145
Massonet P, Naqvi S, Ponsard C, Latanicki J, Rochwerger B, Villari M (2011) A monitoring and audit logging architecture for data location compliance in federated cloud infrastructures. In: IEEE international symposium on parallel and distributed processing workshops and PhD forum (IPDPSW), pp 1510–1517
Davis C, Neville S, Fernandez J, Robert J-M, Mchugh J (2008) Structured peer-to-peer overlay networks: ideal botnets command and control infrastructures? In: Computer security—ESORICS 2008, pp 461–480
Monitis (2014) http://portal.monitis.com/. Accessed 22 Feb 2014
RevealCloud (2014) http://copperegg.com/. Accessed 22 Feb 2014
RevealCloud (2014) http://sandhill.com/article. Accessed 22 Feb 2014
LogicMonitor (2014) http://www.logicmonitor.com/why-logicmonitor. Accessed 22 Feb 2014
Nimsoft (2014) http://www.nimsoft.com/solutions/nimsoft-monitor/cloud. Accessed 22 Feb 2014
Nagios (2014) http://www.nagios.com. Accessed 22 Feb 2014
SPAE (2014) http://shalb.com/en/spae/spae_features/. Accessed 22 Feb 2014
SPAE (2014) http://www.rackaid.com/resources/server-monitoring-cloud. Accessed 22 Feb 2014
CloudWatch (2014) http://awsdocs.s3.amazonaws.com/AmazonCloudWatch/latest/acw-dg.pdf. Accessed 22 Feb 2014
OpenNebula (2014) http://opennebula.org/documentation:rel4.0. Accessed 22 Feb 2014
Cloudharmony (2014) http://cloudharmony.com/. Accessed 22 Feb 2014
Azure FC (2014) http://www.techopedia.com/definition/26433/azure-fabric-controller. Accessed 22 Feb 2014
Azure FC (2014) http://snarfed.org/windows_azure_details#Configuration_and_APIs. Accessed 22 Feb 2014
Nathuji R, Kansal A, Ghaffarkhah A (2010) Q-clouds: managing performance interference effects for QoS-aware clouds. In: Proceedings of the 5th European conference on computer systems, pp 237–250
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Alhamazani, K., Ranjan, R., Mitra, K. et al. An overview of the commercial cloud monitoring tools: research dimensions, design issues, and state-of-the-art. Computing 97, 357–377 (2015). https://doi.org/10.1007/s00607-014-0398-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00607-014-0398-5
Keywords
- Cloud monitoring
- Cloud application monitoring
- Cloud resource monitoring
- Cloud application provisioning
- Cloud monitoring metrics
- Quality of service parameters
- Service level agreement