Maintenance of Monitoring Systems Throughout Self-healing Mechanisms

  • Clarissa Cassales Marquezan
  • André Panisson
  • Lisandro Zambenedetti Granville
  • Giorgio Nunzi
  • Marcus Brunner
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5273)


Monitoring is essential in modern network management. However, current monitoring systems are unable to recover their internal faulty entities forcing the network administrator to manually fix the occasionally broken monitoring solution. In this paper we address this issue by introducing a self-healing monitoring solution. This solution is described considering a scenario of a monitoring system for a Network Access Control (NAC) installation. The proposed solution combines the availability provided by P2P-based overlays with self-healing abilities. This paper also describes a set of experimental evaluations whose results present the tradeoff between the time required to recover the monitoring infrastructure when failures occur, and the associated bandwidth consumed in this process. Based on the experiments we show that it is possible to improve availability and robustness with minimum human intervention.


  1. 1.
    Oetiker, T.: MRTG - The Multi Router Traffic Grapher. In: LISA 1998: Proceedings of the 12th USENIX conference on System administration, Berkeley, CA, USA, USENIX Association, pp. 141–148 (1998)Google Scholar
  2. 2.
    López, G., Cánovas, O., Gómez, A.F., Jiménez, J.D., Marín, R.: A network access control approach based on the AAA architecture and authorization attributes. J. Netw. Comput. Appl. 30(3), 900–919 (2007)CrossRefGoogle Scholar
  3. 3.
    Perazolo, M.: A Self-Management Method for Cross-Analysis of Network and Application Problems. In: 2nd IEEE Workshop on Autonomic Communications and Network Management (ACNM 2008) (2008)Google Scholar
  4. 4.
    Trimintzios, P., Polychronakis, M., Papadogiannakis, A., Foukarakis, M., Markatos, E., Oslebo, A.: DiMAPI: An Application Programming Interface for Distributed Network Monitoring. In: Proceedings. 10th IEEE/IFIP Network Operations and Management Symposium, 2006. NOMS 2006, pp. 382–393 (2006)Google Scholar
  5. 5.
    Packard, H.: Management Software: HP OpenView (2008),
  6. 6.
    Agarwal, M.K.: Eigen Space Based Method for Detecting Faulty Nodes in Large Scale Enterprise Systems. In: IEEE/IFIP Network Operations and Management Symposium (NOMS 2008) (2008); CDROMGoogle Scholar
  7. 7.
    Varga, P., Moldován, I.: Integration of Service-Level Monitoring with Fault Management for End-to-End Multi-Provider Ethernet Services. IEEE Transactions on Network and Service Management 4(1), 28–38 (2007)CrossRefGoogle Scholar
  8. 8.
    Yalagandula, P., Sharma, P., Banerjee, S., Basu, S., Lee, S.J.: S3: a scalable sensing service for monitoring large networked systems. In: INM 2006: Proceedings of the 2006 SIGCOMM Workshop on Internet Network Management, pp. 71–76. ACM Press, New York (2006)CrossRefGoogle Scholar
  9. 9.
    Prieto, A.G., Stadler, R.: A-GAP: An Adaptive Protocol for Continuous Network Monitoring with Accuracy Objectives. IEEE Transactions on Network and Service Management 4(1), 2–12 (2007)CrossRefGoogle Scholar
  10. 10.
    Chaparadza, R., Coskun, H., Schieferdecker, I.: Addressing some challenges in autonomic monitoring in self-managing networks. In: 13th IEEE International Conference on Networks, p. 6 (2005); CDROM Google Scholar
  11. 11.
    Zhou, Y., Lyu, M.R.: An Energy-Efficient Mechanism for Self-Monitoring Sensor Web. In: 2007 IEEE Aerospace Conference, pp. 1–8 (2007)Google Scholar
  12. 12.
    Granville, L.Z., da Rosa, D.M., Panisson, A., Melchiors, C., Almeida, M.J.B., Tarouco, L.M.R.: Managing Computer Networks Using Peer-to-Peer Technologies. IEEE Communications Magazine 43(10), 62–68 (2005)CrossRefGoogle Scholar
  13. 13.
    Panisson, A., Melchiors, C., Granville, L.Z., Almeida, M.J.B., Tarouco, L.M.R.: Designing the Architecture of P2P-Based network Management Systems. In: Proceedings. IEEE Symposium on Computers and Communications (ISCC 2006), pp. 69–75. IEEE Computer Society, Los Alamitos (2006)Google Scholar
  14. 14.
    Marquezan, C.C., dos Santos, C.R.P., Nobre, J.C., Almeida, M.J.B., Tarouco, L.M.R., Granville, L.Z.: Self-managed Services over a P2P-based Network Management Overlay. In: Proc. 2nd Latin American Autonomic Computing Symposium (LAACS 2007) (2007)Google Scholar
  15. 15.
    GPPD: Parallel and Distributed Processing Group – GPPD (2008),

Copyright information

© IFIP International Federation for Information Processing 2008

Authors and Affiliations

  • Clarissa Cassales Marquezan
    • 1
    • 2
  • André Panisson
    • 1
  • Lisandro Zambenedetti Granville
    • 1
  • Giorgio Nunzi
    • 2
  • Marcus Brunner
    • 2
  1. 1.Federal University of Rio Grande do SulPorto AlegreBrazil
  2. 2.NEC Europe Network LaboratoriesHeidelbergGermany

Personalised recommendations