Abstract
New requirements of service-oriented fault management are analyzed and a framework MDFM (Multi-Domain Fault Manager) is proposed in this paper to solve the service fault localization problem in multi-domain context. Different from current solutions, our approach decomposes SLS (Service Level Specification) based on network capability, and monitor service performance in each domain along the end-to-end path. As a result, MDFM can localize the approximate domain rapidly on which the root cause resides, therefore causative region is narrowed down and computation cost for fault analysis is reduced. Faults on both server and client sides are considered in MDFM. A prototype has been implemented to prove the feasibility and efficiency of our service fault management framework.
This work was supported by the National Basic Research Program of China (Grant No. 2003CB314806 and 2006CB701306), the National Natural Science Foundation of China (No. 90204003 and 60472067) and the National 863 Program of China (No.2003AA121220).
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Yemini, S.A., Kliger, S., Mozes, E., Yemini, Y., Ohsie, D.: High speed and robust event correlation. Communications Magazine, IEEE 34(5), 82–90 (1996)
Steinder, M., Sethi, A.S.: The present and future of event correlation: A need for end-to-end service fault localization. In: World Multi-Conf. Systemics, Cybernetics, and Informatics (SCI), Orlando, FL (2001)
Steinder, M., Sethi, A.S.: Probabilistic Fault Localization in Communication Systems Using Belief Networks. IEEE/ACM Transactions on Networking 12(5) (October 2004)
Steinder, M., Sethi, A.S.: Multi-Domain Diagnosis of End-to-End Service Failures in Hierarchically Routed Networks. In: Mitrou, N.M., Kontovasilis, K., Rouskas, G.N., Iliadis, I., Merakos, L. (eds.) NETWORKING 2004. LNCS, vol. 3042, pp. 1036–1046. Springer, Heidelberg (2004)
Hanemann, A., Sailer, M., Schmitz, D.: Assured Service Quality by Improved Fault Management - Service-Oriented Event Correlation. In: Proceedings of the 2nd international conference on Service oriented computing (November 2004)
Kong, Q., Chen, G., Hussain, R.Y.: A Management Framework for Internet Services, Network Operations and Management Symposium. In: NOMS 1998, February 15-20, vol. 1, pp. 21–30. IEEE, Los Alamitos (1998)
Darst, C., Ramanathan, S.: Measurement and Management of Internet Services, Integrated Network Management. In: Proceedings of the Sixth IFIP/IEEE International Symposium on Distributed Management for the Networked Millennium, May 24-28, pp. 125–140 (1999)
Caswell, D., Ramanathan, S.: Using Service Models for Management of Internet Services. IEEE Journal on Selected Areas in Communications 18(5), 686–701 (2000)
Bronstein, A., Das, J., et al.: Self-Aware Services: Using Bayesian Networks for Detecting Anomalies in Internet-based Services, Technical Report HPL-2001-23 (R.1), HP Laboratories Palo Alto (2001), www.hpl.hp.com/techreports/2001/HPL-2001-23R1.ps
IBM Redbook, Business Service Management Best Practices, http://IBM.com/redbooks
Hauck, R., Radisic, I.: Service Oriented Application Management-Do Current Techniques Meet The Requirements? In: New Developments in Distributed Applications and Interoperable Systems: 3rd IFIP International Working Conference (DAIS 2001) (2001)
Nichols, K., Carpenter, B.: Definition of Differentiated Services Per Domain Behaviors and Rules for their Specification, RFC 3086 (April 2001)
Huang, X., Lin, Y., Wang, W., Que, X., Cheng, S., Jiao, L., Cui, Y.: QoSjava: An Open and Scalable Architecture Decoupling QoS Requirements from QoS Techniques, draft-bupt-qosjava-arch-02.txt, http://www.ietf.org/internet-drafts/draft-bupt-qosjava-arch-02.txt
Huang, X., Lin, Y., Wang, W., Cheng, S.: PDB-Based SLS Decomposition in Heterogeneous IP Network. In: Proceedings of 2004 IEEE International Workshop on IP Operations & Management (2004)
Xiao, J., Cui, Y., Wang, W., Cheng, S.: A Service Level Specification (SLS) Monitoring System in Multiple Services IP Network, High technology Letters, ISSN 1002-0470, published by Executive Office of the Journal, Institute of Scientific and Technical Information of China (to appear)
Iperf, University of Illinois, http://dast.nlanr.net/Projects/Iperf/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 IFIP International Federation for Information Processing
About this paper
Cite this paper
Huang, X., Zou, S., Wang, W., Cheng, S. (2005). MDFM: Multi-domain Fault Management for Internet Services. In: Dalmau Royo, J., Hasegawa, G. (eds) Management of Multimedia Networks and Services. MMNS 2005. Lecture Notes in Computer Science, vol 3754. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11572831_11
Download citation
DOI: https://doi.org/10.1007/11572831_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29641-6
Online ISBN: 978-3-540-32090-6
eBook Packages: Computer ScienceComputer Science (R0)