A Role-Based Approach to Self-healing in Autonomous Monitoring Systems

  • Włodzimierz Funika
  • Piotr Pȩgiel
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6068)

Abstract

The main intention of this paper is to introduce the proposition of a new role-based approach to self-healing monitoring. This is preceded by an overview of existing approaches to the monitoring of distributed systems using self-healing features. Starting with a discussion of autonomous monitoring systems, we will come to self-healing systems. These systems should be able to automatically resolve the problems that occur in a system under monitoring. The paper provides insight into various aspects of self-healing monitoring systems at the software and hardware level. A detailed description of a new agent-based system, AgeMon, is covered later on. The system is based on the roles played by different types of agents. The self-healing features can be achieved by a form of cooperation of agents, e.g. monitoring agents, rule agents, database agents. The paper discusses the roles and gives an implementation background.

Keywords

Self-healing monitoring adaptive rule-based systems failure detection 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Predictive Self-Healing in the Solaris 10 Operating System - A Technical Introduction (September 2004), http://www.sun.com/bigadmin/content/selfheal/selfheal_overview.pdf
  2. 2.
    Alonso, J., Torres, J., Silva, L.M., Griffith, R., Kaiser, G.: Towards Self-adaptable monitoring framework for self-healing, CoreGRID TR-0150, July 3 (2008), http://www.coregrid.net/mambo/images/stories/-TechnicalReports/tr-0150.pdf
  3. 3.
    Griffith, R., Kaiser, G.: Adding self-healing capabilities to the common language runtime. Technical report, Columbia University (2005)Google Scholar
  4. 4.
    Using Nagios to monitor faults in a self-healing environment, by Mikko A.T. Pervilä (2007), http://www.cs.helsinki.fi/u/niklande/opetus/SemK07/-paper/pervila.pdf
  5. 5.
    Amin, M.: Toward self-healing energy infrastructure systems. Computer Applications in Power 14(1), 20–28 (2001)CrossRefGoogle Scholar
  6. 6.
    Sidiroglou, S., Laadan, O., Keromytis, A.D., Nieh, J.: Using Rescue Points to Navigate Software Recovery (Short Paper). In: Proceedings of the IEEE Symposium on Security and Privacy (May 2007)Google Scholar
  7. 7.
    The Intelligent Software Agents Lab – Home Page, http://www.cs.cmu.edu/~softagents/intro.htm
  8. 8.
    Baresi, L., Guinea, S., Pasquale, L.: Self-healing BPEL Processes with Dynamo and the JBoss Rule Engine. In: Int. Workshop on Engineering of Software Services for Pervasive Environments: in Conjunction with the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 11–20 (2007)Google Scholar
  9. 9.
  10. 10.
    AspectJ – Home page, http://www.eclipse.org/aspectj/
  11. 11.
    Guttman, E., Perkins, C., Veizades, J., Day, M.: RFC 2608 Service Location Protocol, Version 2 (June 1999), http://tools.ietf.org/html/rfc2608
  12. 12.
    Goland, Y.Y., Cai, T., Leach, P., Gu, Y., Albright, S.: Simple Service Discovery Protocol/1.0. (October 28, 1999), http://coherence.beebits.net/chrome/site/draft-cai-ssdp-v1-03.txt
  13. 13.
    Jini Discovery and Join Specification v3. September 4 (2006), http://www.jini.org/wiki/Jini_Discovery_and_Join_Specification
  14. 14.
    PANACEA - Towards a Self-healing Development Framework. In: 10th IFIP/IEEE International Symposium on Integrated Network Management, IM 2007, May 21, pp. 169–178 (2007), ISBN: 1-4244-0798-2Google Scholar
  15. 15.
    HP Open View Self-Healing Services: Overview and Technical Introduction, HP Labs (2006), http://managementsoftware.hp.com/services/selfhealing_whitepaper.pdf
  16. 16.
    Funika, W., Godowski, P., Pȩgiel, P.: A Semantic-Oriented Platform for Performance Monitoring of Distributed Java Applications. In: Bubak, M., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2008, Part III. LNCS, vol. 5103, pp. 233–242. Springer, Heidelberg (2008)CrossRefGoogle Scholar
  17. 17.
    Wuttke, J.: An approach to detecting failures automatically. In: Fourth International Workshop on Software Quality Assurance: in Conjunction With the 6th ESEC/FSE Joint Meeting, Dubrovnik, Croatia, pp. 17–24 (2007)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Włodzimierz Funika
    • 1
  • Piotr Pȩgiel
    • 1
  1. 1.Institute of Computer ScienceAGHKrakówPoland

Personalised recommendations