A Theory for Observational Fault Tolerance

Francalanza, Adrian; Hennessy, Matthew

doi:10.1007/11690634_2

Adrian Francalanza¹⁸ &
Matthew Hennessy¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3921))

Included in the following conference series:

International Conference on Foundations of Software Science and Computation Structures

639 Accesses
3 Citations
1 Altmetric

Abstract

In general, faults cannot be prevented; instead, they need to be tolerated to guarantee certain degrees of software dependability. We develop a theory for fault tolerance for a distributed pi-calculus, whereby locations act as units of failure and redundancy is distributed across independently failing locations. We give formal definitions for fault tolerant programs in our calculus, based on the well studied notion of contextual equivalence. We then develop bisimulation proof techniques to verify fault tolerance properties of distributed programs and show they are sound with respect to our definitions for fault tolerance.

Download to read the full chapter text

Chapter PDF

What You Always Wanted to Know About Model Checking of Fault-Tolerant Distributed Algorithms

Modular Model-Checking of a Byzantine Fault-Tolerant Protocol

Fault Ascription in Concurrent Systems

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amadio, R.M., Prasad, S.: Localities and failures. FSTTCS: Foundations of Software Technology and Theoretical Computer Science 14 (1994)
Google Scholar
Christian, F.: Understanding fault tolerant distributed systems. Communications of the ACM 34(2), 56–78 (1991)
Article MathSciNet Google Scholar
Ciaffaglione, A., Hennessy, M., Rathke, J.: Proof methodologies for behavioural equivalence in Dπ. Technical Report 03/2005, University of Sussex (2005)
Google Scholar
Francalanza, A., Hennessy, M.: A theory for observational fault tolerance, www.cs.um.edu.mt/~afran/
Francalanza, A., Hennessy, M.: A theory of system behaviour in the presence of node and link failures. In: Abadi, M., de Alfaro, L. (eds.) CONCUR 2005. LNCS, vol. 3653, pp. 368–382. Springer, Heidelberg (2005)
Chapter Google Scholar
Hennessy, M., Merro, M., Rathke, J.: Towards a behavioural theory of access and mobility control in distributed systems. Theoretical Computer Science 322, 615–669 (2004)
Article MathSciNet MATH Google Scholar
Hennessy, M., Rathke, J.: Typed behavioural equivalences for processes in the presence of subtyping. Mathematical Structures in Computer Science 14, 651–684 (2004)
Article MathSciNet MATH Google Scholar
Hennessy, M., Riely, J.: Resource access control in systems of mobile agents. Information and Computation 173, 82–120 (2002)
Article MathSciNet MATH Google Scholar
Prasad, K.V.S.: Combinators and Bisimulation Proofs for Restartable Systems. PhD thesis, Department of Computer Science, University of Edinburgh (December 1987)
Google Scholar
Riely, J., Hennessy, M.: Distributed processes and location failures. Theoretical Computer Science 226, 693–735 (2001)
Article MathSciNet MATH Google Scholar
Sangiorgi, D., Walker, D.: The π-calculus. Cambridge University Press, Cambridge (2001)
MATH Google Scholar
Schlichting, R.D., Schneider, F.B.: Fail-stop processors: An approach to designing fault-tolerant computing systems. Computer Systems 1(3), 222–238 (1983)
Article Google Scholar
Verissimo, P., Rodrigues, L.: Distributed Systems for System Architects. Kluwer Academic Publishers, Dordrecht (2001)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

University of Malta, Msida , MSD 06, Malta
Adrian Francalanza
University of Sussex, Brighton, BN1 9RH, England
Matthew Hennessy

Authors

Adrian Francalanza
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Hennessy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Reykjavik University, Kringlan 1, 103, Reykjavík, Iceland
Luca Aceto
Department of Computer Science, Reykjavík University, Kringlan 1, IS-103, Reykjavík, Iceland
Anna Ingólfsdóttir

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Francalanza, A., Hennessy, M. (2006). A Theory for Observational Fault Tolerance. In: Aceto, L., Ingólfsdóttir, A. (eds) Foundations of Software Science and Computation Structures. FoSSaCS 2006. Lecture Notes in Computer Science, vol 3921. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11690634_2

Download citation

DOI: https://doi.org/10.1007/11690634_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33045-5
Online ISBN: 978-3-540-33046-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Theory for Observational Fault Tolerance

Abstract

Chapter PDF

Similar content being viewed by others

What You Always Wanted to Know About Model Checking of Fault-Tolerant Distributed Algorithms

Modular Model-Checking of a Byzantine Fault-Tolerant Protocol

Fault Ascription in Concurrent Systems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Theory for Observational Fault Tolerance

Abstract

Chapter PDF

Similar content being viewed by others

What You Always Wanted to Know About Model Checking of Fault-Tolerant Distributed Algorithms

Modular Model-Checking of a Byzantine Fault-Tolerant Protocol

Fault Ascription in Concurrent Systems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation