Abstract
The concept of object can be employed to achieve tolerance to hardware faults in distributed systems. Replication by introducing several copies for each object allows a continuous service even in case of failure. In particular, the paper describes an object model, PROM, which exploits replication by defining several passive back-up copies for any object. The system automatically recovers any failure of a copy in execution by activating a spare copy and restarting it from a previous checkpoint.
The aim of the paper is the analysis of the effective support for PROM. This support is organized in structured levels on a distributed architecture. The services that the support should include to guarantee the desired replication model are described.
Similar content being viewed by others
References
G.T. Almeset al., The EDEN System: A Technical Review,IEEE Trans. on S.E., SE-11: 1 (1985).
ANSA Reference Manual, ESPRIT Technical Week Edition, Release 00.03 (Draft), June 1987.
J.F. Bartlett, A NonStop Kernel, Proc. 8th Symp. on Operating System Principles, December 1981.
K. P. Birman, Replication and Fault-Tolerance in the ISIS System, Tech. Rep. 85-668, Cornell University, March 1985.
K.P. Birman,et al., Implementing Fault-Tolerant Distributed Objects,IEEE Trans. Soft. Eng., SE-11: n6 (1985).
K.P. Birman and T. A. Joseph, Reliable Communication in an Unreliable Environment, ACM TOCS, Feb. 1987.
A. Borg,et al., A Message System Supporting Fault Tolerance, Proc. 9th Symp. on Operating System Principles, October 1983.
A.J. Borr, Transaction Monitoring in Encompass: Reliable Distributed Transaction Processing, Proc. Int. Conf. on Very Large Databases, September 1981.
A. Corradi and L. Leonardi, An Environment based on Parallel Objects: PO, IEEE Phoenix Conf. on Computers and Communications, Scottsdale, Feb. 1987.
A. Corradi, L. Leonardi, and C. Salati, Availability by Replication in Embedded Distributed Systems: The CONCORDIA Project, SARSS'87, Nov. 1987.
Bull, INESC, MCS Functional Specifications, DELTA-4 ESPRIT Technical Rept., Aug. 1986.
DELTA-4—Overall System Specifications, DELTA-4 ESPRIT Technical Rept., D. Powell ed., Dec. 1988.
A. Goldberg and J. Robson. SMALLTALK-80: the Language and Its Implementation, Addison Wesley, 1983.
J. Gray, Why Do Computers Stop and What Can Be Done about It?, 5th Symposium on Reliability in Distributed Systems and Databases Systems, Jan. 1986.
C. Hewitt, and H. Baker, Laws for Communicating Parallel Processes, IFIP Conf. Proc., North-Holland, 1977.
C.A.R. Hoare, Monitors: an Operating System Structuring Concept,CACM.,17: 10 (1974).
A.K. Jones, A Conceptual Tool for Structuring Software, in:Operating Systems, An Advanced Course, R. Bayeret al. eds., Springer-Verlag. N.60, 1987.
B. Lampson, Atomic Transactions, in:Distributed Systems, Springer-Verlag, 1981.
T.J. LeBlanc, and R.P. Cook. An Analysis of Language Models for High-Performance Communication in Local-Area Networks,ACM Sigplan,18:6 (1983).
A. Lister, The Problem of Nested Monitor Calls,Operating System Review,11: 3 (1977).
B. Liskov, The Argus Language and System, Lecture Notes for the Advanced Course on Distributed Systems-Methods and Tools for Specification, Institut fuer Informatik der Tech. Univ. Munchen, April 1984.
B. Liskov,et al., Communication in the Mercury System, Proc. of the 21th Annual Hawaii Int. Conf. on System Sciences, Jan. 1988.
R. Rashid and G. Robertson, Accent: A Communication Oriented Network Operating System Kernel, 8th ACM SOSP, Dec. 1981.
F. P Schneider, Byzantine General in Action: Implementing Fail-Stop Processors,ACM Trans. on Computer Systems,2: 2 (1984).
S. K. Shrivastava and F. Panzieri, The design of a Reliable Remote Procedure Call Mechanism;IEEE Trans. on Computers, C-31: 7 (1982).
C. Weinstock, SIFT: System Design and Implementation, 10th Symposium on Fault Tolerant Computing, Oct. 1980.
N. Wirth, Programming in MODULA-2 (Third, corrected edition), Springer-Verlag, 1985.
A. Yonezawa and M. Tokoro (eds.), Object-Oriented Concurrent Programming, MIT Press, 1987.
H. Zimmermannet al., CHORUS: A Communication and Processing Architecture for Distributed Systems, INRIA Tech. Report, N.328, Sept. 1984.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Corradi, A., Leonardi, L. PROM: A support for robust replication in a distributed object environment. J. of Compt. Sci. & Technol. 5, 139–155 (1990). https://doi.org/10.1007/BF02943420
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02943420