Distributed Fault-Tolerance

Powell, David

doi:10.1007/978-3-642-84696-0_6

David Powell²

Part of the book series: Research Reports ESPRIT ((2768,volume 1))

52 Accesses
1 Citations

Abstract

Distribution and fault-tolerance are tightly related. Should a single element of a distributed system fail, users expect at worst a slight degradation of the service that is offered; distributed systems must thus at least have some built-in fault-tolerance. On the other hand, most fault-tolerant systems can, at some level or another, be seen as a distributed system due to their redundant processing resources. Distributed fault-tolerance is used here to refer to that class of techniques suitable for ensuring fault-tolerance in an architecture consisting of a set of processing elements (called nodes or stations) interconnected by a message-passing communication network (figure 1). The distributed fault-tolerance techniques discussed here are focussed towards distributed systems in which the communication network consists of one or more local area networks. In particular, the existence of high-bandwidth broadcast channels allowing efficient multicast communication is assumed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

LAAS-CNRS, 7, avenue du Colonel Roche, F-31077, Toulouse, France
David Powell

Authors

David Powell
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LAAS-CNRS, 7, avenue du Colonel Roche, F-31077, Toulouse, France
David Powell

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Powell, D. (1991). Distributed Fault-Tolerance. In: Powell, D. (eds) Delta-4: A Generic Architecture for Dependable Distributed Computing. Research Reports ESPRIT, vol 1. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-84696-0_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-84696-0_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-54985-7
Online ISBN: 978-3-642-84696-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics