Abstract
This paper investigates an approach to improving the scalability and feasibility of probabilistic fault localization in communication systems by exploiting the domain semantics of computer networks. The proposed technique divides the computational effort and system knowledge among multiple, hierarchically organized managers. Each manager performs fault localization in the domain it manages and requires only the knowledge of its own domain. Since failures propagate among domains, domain managers cooperate with each other to find a consensus explanation of the observed disorder. We show through simulation that the proposed approach increases the effectiveness of probabilistic diagnosis and makes it feasible in networks of considerable size 1.
Chapter PDF
Similar content being viewed by others
References
M. Steinder and A. S. Sethi, “Non-deterministic fault localization in communication systems using belief networks,” IEEE/ACM Transactions on Networking, 2004, (to appear).
M. Steinder and A. S. Sethi, “Non-deterministic event-driven fault diagnosis through incremental hypothesis updating,” in Integrated Network Management VIII, G. Goldszmidt and J. Schoenwaelder, Eds., Colorado Springs, CO, Mar. 2003.
G. Jakobson and M. D. Weissman, “Alarm correlation,” IEEE Network, vol. 7, no. 6, pp. 52–59, Nov. 1993.
I. Katzela and M. Schwartz, “Schemes for fault identification in communication networks,” IEEE/ACM Transactions on Networking, vol. 3, no. 6, pp. 733–764, 1995.
S. A. Yemini, S. Kliger, E. Mozes, Y. Yemini, and D. Ohsie, “High speed and robust event correlation,” IEEE Communications Magazine, vol. 34, no. 5, pp. 82–90, 1996.
J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, Morgan Kaufmann Publishers, 1988.
A. T. Bouloutas, S. B. Calo, A. Finkel, and I. Katzela, “Distributed fault identification in telecommunication networks,” J. of Network and Systems Management, vol. 3, no. 3, 1995.
P. Hasselmeyer, “An infrastructure for the management of dynamic service networks,” IEEE Communications Magazine, vol. 41, no. 4, pp. 120–126, 2003.
M. Steinder and A. S. Sethi, “Multi-domain diagnosis of end-to-end service failures in hierarchically routed networks,” Tech. Rep. 2003–10, CIS, University of Delaware, 2003.
W. Aiello, F. Chung, and L. Lu, “A random graph model for massive graphs,” in ACM Symposium on Theory of Computing, Portland, OR, May 2000, pp. 171–180.
H.A. Barabasi and R. Albert, “Emergence of scaling in random networks,” Science, pp. 509–512, Oct. 1999.
K. Calvert, M. Doar, and E. Zegura, “Modeling Internet topology,” IEEE Transactions on Communications, pp. 160–163, Dec. 1997.
C. Jin, Q. Chen, and S. Jamin, “Inet: Internet topology generator,” Tech. Rep. CSE-TR443–00, EECS, University of Michigan, 2000.
A. Medina, A. Lakhina, I. Matta, and J. Byers, “BRITE:universal topology generation from a user’s perspective” Tech. Rep. BUCS-TR-2001–003, CS Dept, Boston Univ., 2001.
T. Bu and D. Towsley, “On distringuishing between Internet power law topology generators,” in Proc. of IEEE INFOCOM, New York, NY, Jun. 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Steinder, M., Sethi, A.S. (2004). Multi-domain Diagnosis of End-to-End Service Failures in Hierarchically Routed Networks. In: Mitrou, N., Kontovasilis, K., Rouskas, G.N., Iliadis, I., Merakos, L. (eds) Networking 2004. NETWORKING 2004. Lecture Notes in Computer Science, vol 3042. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24693-0_85
Download citation
DOI: https://doi.org/10.1007/978-3-540-24693-0_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21959-0
Online ISBN: 978-3-540-24693-0
eBook Packages: Springer Book Archive