Abstract
The dependency of society and business on telecommunication services is commonly recognized, but is this conception taken into account in our effort towards smarter and more autonomous networks? The objective of the paper is to discuss some dependability issues in the context of these networks and to pinpoint some challenges. As an introduction, a brief review of dependability concepts is given. Next the following issues are discussed: a) strategies for providing a survivable transport network; b) fault-tolerant network nodes vs. fault-tolerant functionality on a distributed platform; c) software faults and their consequences like error propagation and network wide failure modes.
The updated original online version for this book can be found at DOI: 10.1007/978-0-387-35581-8_35
Chapter PDF
Similar content being viewed by others
Key words
References
Finn Arve Aagesen, Bjarne E. Helvik, Vilas Wuwongse, Hein Meling, Rolv Bræk and Ulrik Johansen, “Towards a Plug and Play Architecture for Telecommunications”, This Issue.
Syed R. Ali, “Analysis of Total Outage Data for Stored Program Control Switching Systems”, IEEE Journ. on Selected Areas in Communications, Vol. SAC-4, No. 7, pp. 1044–1046, Oct. 1986.
Syed R. Ali, “Digital Switching Systems; System Reliability and Analysis” McGraw-Hill, 1997.
Beyltjens, van Houldt, “System 12, Switching System Maintenance”, Electrical Communication, Vol. 59, No. 1 /2, pp. 80–88, 1985.
Kenneth P. Birman, Robbed van Renesse (eds.), “Reliable Distributed computing with the Isis Toolkit”, IEEE Computer Society Press, 1994.
W.C. Carter, “A time for reflection”, Proc. 12th International Symposium on Fault-Tolerant Computing“ Santa Monica California (FTCS-12), p. 41, June 1982.
Brian A. Coan, Will E. Leland, Mario P. Vecchi, Abel Weinrib, Liang T. Wu, “Using Distributed Topology Update and Preplanned Configurations to Achieve Trunk Network Survivability”, IEEE Transactions on Reliability, Vol. 40, No. 4, pp. 404–416, October 1991.
R.W. Downing, J..S. Novak, L.S. Tuomenoksa, “No. 1 ESS Maintenance Plan”, The Bell System Technical Journal, Vol. 43, No. 5, Part 1, pp. 1961–2019, Sept. 1964.
Karen Fitzgerald, “Vulnerability exposed in AT&T’s 9-hour glitch”, The Institute, Vol. 114, No. 3, Pages 1 and 6, March 1990.
Jim Gray, “A Census of Tandem System Availability between 1985 and 1990”, IEEE Trans. on Reliability, Vol. 39, No. 4, pp. 409–418, Oct. 1990.
Bjarne E. Helvik, Anders Rygh Swensen, “Modelling of Clustering Effects In Point Processes. An Application to Failures in SPC-Systems”, Scandinavian Journal of Statistics, Vol. 14, pp. 57–66, 1987.
Bjarne E. Helvik: “Modelling the Influence of Unreliable Software in Distributed Computer Systems”, Proc. 18th International Symposium on Fault-Tolerant Computing (FTCS18), pp. 136–141, June 1988.
Bjarne E. Helvik, “The Error Propagation Phenomenon; An introduction”, Telektronikk, Vol. 93, No. 1, pp. 109–117, 1997
Bjarne E. Helvik, Sven Arne Gylterud, “Identification of Operational Modes of Distributed Systems by Clustering Analysis”, Telektronikk, Vol. 93, No. 1, pp. 118–127, 1997.
Svein-Olaf Hvasshovd, Oystein Torbjornsen, Svein Erik Bratsberg, Per Holager, “The ClustRa Telecom Database: High Availability, High Throughput, and Real-Time Response”, Proceedings of 21th International Conference on Very Large Data Bases (VLDB’95), pp. 469–477, September 11–15, 1995, Zurich, Switzerland.
ITU-T, “Terms and definitions related to quality of service and network performance including dependability”, Rec. E. 800, August 1994.
Pankaj Jalote, “Fault Tolerance in Distributed Systems”, Prentice Hall, 1994.
Barry W. Johnson, “Design and Analysis of Fault Tolerant Digital Systems”, Addison-Wesley, 1989.
K.R. Krishnan, R.D. Doverspike, C. D. Pack, “Unified Models of Survivability for Multi-Technology Networks”, Proc. ITC-14 (eds: Labetoulle and Roberts), pp. 655–666, Antibes Juan-les-Pins, France, Elsevier, June 1994.
K.R. Krishnan, R.D. Doverspike, C. D. Pack, “Improved Survivability with Multi-layer Dynamic Routing”, IEEE Communication Magazine, Vol. 33, No. 7, pp. 62–680, July 1995.
D. Richard Kuhn, “Sources of Failure in the Public Switched Telephone Network”, Computer, Vol. 30, No. 4, pp. 31–36, April 1997.
Craig Labovitz, Abha Ahuja, Farnam Jahanian, “Experimental Study of Internet Stability and Backbone Failures”, Proc. 29th International Symposium on Fault-Tolerant Computing (FTCS-29), pp. 278–285, June 1999.
L. Lamport, “Time, Clocks and the Ordering of Events in Distributed Systems”, Communication of the ACM, Vol. 21, No. 7, pp. 558–565, July 1978.
Jean-Claude Laprie (Ed), “Dependability: Basic Concepts and Associated Terminology”, Dependable Computing and Fault Tolerant Systems. Vol-5; Springer, 1992.
P.A.W. Lewis, “A Branching Poisson Process Model for the Analysis of Computer Failure Patterns”, Journ. of The Royal Statis. Soc. Ser. B, Vol. 26, pp. 493–503, 1964.
Michael R. Lyu (ed.), “Handbook of Software Reliability Engineering”, McGraw-Hill/IEEE Comp. Soc. Press, 1996.
Silvano Maffeis, Douglas C. Smidt, “Construction of Reliable Distributed Communication Systems with CORBA”, IEEE Communication Magazine, Vol. 35, No. 2, pp. 56–60, Feb. 1997.
John J. Metzner, “Reliable Data Communications”, Academic Press, 1997.
L. Nederlof et. al. “End-to-end Survivable Broadband Networks” IEEE Communication Magazine, Vol. 33, No. 9, pp. 63–70, Sept. 1995.
Louise E. Moser, P. M. Melliar-Smith, Deborah A. Agarwal, Ravi K. Budhia, Colleen A. Lingley-Papadopoulos, “Totem: A fault-Tolerant Multicast Group Communication System”, Communications of the ACM, Vol. 39, No. 4, pp. 54–63, April 1996.
Louise E. Moser, Roger J. Martin, “Fault Tolerance for CORBA” (Joint initial fault tolerance RFP submission by Eternal Systems and Sun Microsystems), ftp://ftp.omg.org/pub/docs/orbos/98–10–08, October 19, 1998.
Bengt Ossfelt, Ingmar Jonsson, “Recovery and Diagnostics in the Central Control of the AXE Switching System”, IEEE Trans. on Comp., pp. 482–491, June 1980.
Vern Paxson, “End-to-End Routing Behaviour in the Internet, IEEE/ACM Transactions on Networking, Vol. 5, No. 5, pp. 601–615, Oct. 1997.
David Powell, “Distributed Fault Tolerance: Lessons from Delta-4”, IEEE Micro, February 1994, pp. 36–47.
David Powell (Guest Editor), “Group communication”, Communications of the ACM, Vol. 39, No. 4, pp. 50–97, April 1996.
Robbert van Renesse, Kenneth P. Birman, Silvano Maffeis, “Horns: A flexible Group Communication System”, Communications of the ACM, Vol. 39, No. 4, pp. 76–83, April 1996.
Daniel P. Siewiorek, Robert S. Swarz, “Reliable Computer Systems; Design and Evaluation”, Digital Press, 2nd edition, 1992.
Chris Smith, “Fault Tolerant CORBA” (Fault tolerance joint initial submission by Ericsson, IONA, and Nortel supported by Alcatel), ftp://ftp.omg.org/pub/docs/orbos/98–10–10, October 20, 1998.
Jeffrey R. Spirn, “Fault Tolerance RFP: Initial Submission” (Oracle), ftp://ftp.omg.org/pub/docs/orbos/98–10–13, October 20, 1998.
Paul Veitch, Dave Johnson, “ATM network Resilience”, IEEE Network, Vol. 11, No. 5, pp. 26–33, Sept./Oct. 1997.
Shalini Yajnik (ed.), “Fault Tolerant CORBA using Entity Redundancy” (Joint initial fault tolerance RFP submission: Highlander Communications, L.C., Inprise Corporation, Lockheed Martin Corporation, Lucent Technologies, TIBCO Inc., and supported by Academia Sinica, Taiwan), ftp://ftp.omg.org/pub/docs/orbos/98–10–09, October 20, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 IFIP International Federation for Information Processing
About this chapter
Cite this chapter
Helvik, B.E. (2000). Dependability Issues in Smart Networks. In: Yongchareon, T., Aagesen, F.A., Wuwongse, V. (eds) Intelligence in Networks. SMARTNET 1999. IFIP Advances in Information and Communication Technology, vol 32. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35581-8_6
Download citation
DOI: https://doi.org/10.1007/978-0-387-35581-8_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-1022-9
Online ISBN: 978-0-387-35581-8
eBook Packages: Springer Book Archive