Abstract
Distribution and fault-tolerance are tightly related. Should a single element of a distributed system fail, users expect at worst a slight degradation of the service that is offered; distributed systems must thus at least have some built-in fault-tolerance. On the other hand, most fault-tolerant systems can, at some level or another, be seen as a distributed system due to their redundant processing resources. Distributed fault-tolerance is used here to refer to that class of techniques suitable for ensuring fault-tolerance in an architecture consisting of a set of processing elements (called nodes or stations) interconnected by a message-passing communication network (figure 1). The distributed fault-tolerance techniques discussed here are focussed towards distributed systems in which the communication network consists of one or more local area networks. In particular, the existence of high-bandwidth broadcast channels allowing efficient multicast communication is assumed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1991 ECSC — EEC — EAEC, Brussels — Luxembourg
About this chapter
Cite this chapter
Powell, D. (1991). Distributed Fault-Tolerance. In: Powell, D. (eds) Delta-4: A Generic Architecture for Dependable Distributed Computing. Research Reports ESPRIT, vol 1. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-84696-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-84696-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-54985-7
Online ISBN: 978-3-642-84696-0
eBook Packages: Springer Book Archive