PPAM 2009: Parallel Processing and Applied Mathematics pp 125-134 | Cite as
A Role-Based Approach to Self-healing in Autonomous Monitoring Systems
Abstract
The main intention of this paper is to introduce the proposition of a new role-based approach to self-healing monitoring. This is preceded by an overview of existing approaches to the monitoring of distributed systems using self-healing features. Starting with a discussion of autonomous monitoring systems, we will come to self-healing systems. These systems should be able to automatically resolve the problems that occur in a system under monitoring. The paper provides insight into various aspects of self-healing monitoring systems at the software and hardware level. A detailed description of a new agent-based system, AgeMon, is covered later on. The system is based on the roles played by different types of agents. The self-healing features can be achieved by a form of cooperation of agents, e.g. monitoring agents, rule agents, database agents. The paper discusses the roles and gives an implementation background.
Keywords
Self-healing monitoring adaptive rule-based systems failure detectionPreview
Unable to display preview. Download preview PDF.
References
- 1.Predictive Self-Healing in the Solaris 10 Operating System - A Technical Introduction (September 2004), http://www.sun.com/bigadmin/content/selfheal/selfheal_overview.pdf
- 2.Alonso, J., Torres, J., Silva, L.M., Griffith, R., Kaiser, G.: Towards Self-adaptable monitoring framework for self-healing, CoreGRID TR-0150, July 3 (2008), http://www.coregrid.net/mambo/images/stories/-TechnicalReports/tr-0150.pdf
- 3.Griffith, R., Kaiser, G.: Adding self-healing capabilities to the common language runtime. Technical report, Columbia University (2005)Google Scholar
- 4.Using Nagios to monitor faults in a self-healing environment, by Mikko A.T. Pervilä (2007), http://www.cs.helsinki.fi/u/niklande/opetus/SemK07/-paper/pervila.pdf
- 5.Amin, M.: Toward self-healing energy infrastructure systems. Computer Applications in Power 14(1), 20–28 (2001)CrossRefGoogle Scholar
- 6.Sidiroglou, S., Laadan, O., Keromytis, A.D., Nieh, J.: Using Rescue Points to Navigate Software Recovery (Short Paper). In: Proceedings of the IEEE Symposium on Security and Privacy (May 2007)Google Scholar
- 7.The Intelligent Software Agents Lab – Home Page, http://www.cs.cmu.edu/~softagents/intro.htm
- 8.Baresi, L., Guinea, S., Pasquale, L.: Self-healing BPEL Processes with Dynamo and the JBoss Rule Engine. In: Int. Workshop on Engineering of Software Services for Pervasive Environments: in Conjunction with the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 11–20 (2007)Google Scholar
- 9.Glassbox – How It Works, http://www.glassbox.com/glassbox/HowItWorks.html
- 10.AspectJ – Home page, http://www.eclipse.org/aspectj/
- 11.Guttman, E., Perkins, C., Veizades, J., Day, M.: RFC 2608 Service Location Protocol, Version 2 (June 1999), http://tools.ietf.org/html/rfc2608
- 12.Goland, Y.Y., Cai, T., Leach, P., Gu, Y., Albright, S.: Simple Service Discovery Protocol/1.0. (October 28, 1999), http://coherence.beebits.net/chrome/site/draft-cai-ssdp-v1-03.txt
- 13.Jini Discovery and Join Specification v3. September 4 (2006), http://www.jini.org/wiki/Jini_Discovery_and_Join_Specification
- 14.PANACEA - Towards a Self-healing Development Framework. In: 10th IFIP/IEEE International Symposium on Integrated Network Management, IM 2007, May 21, pp. 169–178 (2007), ISBN: 1-4244-0798-2Google Scholar
- 15.HP Open View Self-Healing Services: Overview and Technical Introduction, HP Labs (2006), http://managementsoftware.hp.com/services/selfhealing_whitepaper.pdf
- 16.Funika, W., Godowski, P., Pȩgiel, P.: A Semantic-Oriented Platform for Performance Monitoring of Distributed Java Applications. In: Bubak, M., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2008, Part III. LNCS, vol. 5103, pp. 233–242. Springer, Heidelberg (2008)CrossRefGoogle Scholar
- 17.Wuttke, J.: An approach to detecting failures automatically. In: Fourth International Workshop on Software Quality Assurance: in Conjunction With the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 17–24 (2007)Google Scholar