Abstract
Reliability and fault-tolerance issues are important in the study of interconnection networks used in large multiprocessor systems because of the large number of components involved. In this paper we study these issues with respect to multistage networks which are typically built forN inputs andN outputs using 2 × 2 switching elements and log2 N stages. In such networks, the failure of a switching element or connecting link destroys the communication capability between one or more pair(s) of source and destination terminals. Many techniques exist for designing multistage networks that tolerate switch and/or link failures without losing connectivity. Several approaches for achieving fault-tolerance in multistage interconnection networks are described in this paper. The techniques vary from providing redundant components in the network to making multiple passes through the faulty network. Quantitative measures are introduced for analysis of the reliability of these networks in terms of the component reliabilities. Several examples are given to illustrate the techniques.
Similar content being viewed by others
References
Abraham J A 1979IEEE Trans. Reliab. R-23: 58–61
Adams G B III, Siegel H J 1982IEEE Trans. Comput. C-31: 443–454
Agrawal D P 1983IEEE Trans. Comput. C-32: 637–648
Anderson G A, Jensen E D 1975ACM Comput. Surv. 7: 197–213
Barnes G H, Brown R M, Kato M, Kuck D J, Slotnik D L, Stokes R A 1968IEEE Trans. Comput. C-17: 746–757
Batcher K E 1976Proceedings of the 1976 International Conference on Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 65–71
Benes V E 1965Mathematical theory of connecting networks and telephone traffic (New York: Academic Press)
Cherkassky V, Opper E, Malek M 1984Proceedings of the 14th Annual International Symposium of Fault-Tolerant Computing (Silver Spring,Md:Ieee Comput. Soc. Press)
Ciminiera L, Serra A 1982Proceedings of the International Conference in Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press)
Computer 1985 18 (no. 6)
Feng T Y 1981Computer 14: 12–27
Goke L R, Lipovski G J 1973Proceedings of the First Annual Symposium on Computer Architecture (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 21–28
Grnarov A, Kleinrock L, Gerla M 1980A New Algorithm for Symbolic Reliability Analysis of Computer Communication Networks, Proceedings of the Pacific Telecommunications Conference
Harris J A, Smith D R 1977Proceedings of the Fourth Symposium on Computer Architecture (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 41–48
Hariri S, Raghavendra C S 1986Proceedings of the IEEE INFOCOM 86 (Silver Spring,Md:Ieee Comput. Soc. Press)
Hwang K 1984Supercomputers: Design and applications, Tutorial (Silver Spring,Md:Ieee Comput. Soc. Press)
Kung H T 1982Computer 15: 37–46
Lang T, Valero M, Alegre I 1982IEEE Trans. Comput. C-31: 1227–1234
Lawrie D H 1975IEEE Trans. Comput. C-24: 1145–1155
Liu M T 1978 inAdvances in computers (New York: Academic Press) 17: 163–221
McMillen R J, Siegel H J 1982IEEE Trans. Comput. C-31: 1202–1214
Padmanabhan K, Lawrie D H 1983aProceedings of the International Conference on Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 71–75
Padmanabhan K, Lawrie D H 1983bIEEE Trans. Comput. C-32: 1099–1108
Parker D S 1980IEEE Trans. Comput. C-29: 213–222
Parker D S, Raghavendra C S 1984IEEE Trans. Comput. C-33: pp. 367–373
Patel J H 1981IEEE Trans. Comput. C-30: 771–780
Pease M C 1977IEEE Trans. Comput. C-26: 458–473
Pfister G F, Brantley W C, George D A, Harvey S L, Kleinfelder W J, McAuliffe K P, Melton E A, Norton V A, Weiss J 1985Proceedings of the International Conference on Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 764–771
Raghavendra C S, Varma A 1984Proceedings of the Real-Time Systems Symposium, Austin, Texas (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 153–164
Reddy S M, Kumar V P 1984Proceedings of the International Conference on Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 155–164
Satyanarayana A, Hagstrom J N 1981IEEE Trans. Reliab. R-30: 325–334
Seitz C L 1985Commun. ACM 28: 22–33
Shen J P, Hayes J P 1984IEEE Trans. Comput. C-33: 241–248
Stone H S 1971IEEE Trans. Comput. C-20: 153–161
Tanimoto S L 1983Proceedings of the 10th Annual International Symposium on Computer Architecture (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 372–378
Varma A, Raghavendra C S 1985Proceedings of the International Conference on Parallel Processing (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 328–333
Varma A, Raghavendra C S 1986aProceedings of the 6th International Conference on Distributed Computing Systems, Cambridge, Massachusetts (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 54–61
Varma A, Raghavendra C S 1986bDigest of Papers, the 16th Annual Symposium on Fault-Tolerant Computing, Vienna, Austria (Silver Spring,Md:Ieee Comput. Soc. Press) pp. 104–109
Wittie L D 1981IEEE Trans. Comput. C-30: 264–273
Wu C L, Feng T-Y 1980IEEE Trans. Comput. C-29: 694–702
Author information
Authors and Affiliations
Additional information
This research is supported by thensf Presidential Young Investigator Award No.dci-8452003, a grant from AT&T Information Systems, and a grant fromtrw.
Rights and permissions
About this article
Cite this article
Raghavendra, C.S., Varma, A. Reliability and fault-tolerance in multistage interconnection networks. Sadhana 11, 111–128 (1987). https://doi.org/10.1007/BF02811314
Issue Date:
DOI: https://doi.org/10.1007/BF02811314