A Fault Tolerant System Using Collaborative Agents

  • Sebnem Bora
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3949)


Replication of data or processes is an effective way to provide enhanced performance, high availability and fault tolerance in distributed systems. For instance, in systems based on the client-server model, a server may serve many clients and because of heavy loads, the server cannot respond to the requests on time. In such a case, replicating data or servers may improve performance. Moreover, data and processes can be replicated to protect against failures. However, this is a very complex procedure. In this paper, I propose a method, to make systems fault tolerant based on replication, by way of exploiting the use of collaborative agents. This method is also used to improve fault tolerance in multi-agent systems.


Multiagent System Mobile Agent Failure Detector Sequencer Agent Fault Tolerant System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
  2. 2.
  3. 3.
    Cristian, F., et al.: Fault-Tolerance in the Advanced Automation System. In: 20th International Conference on Fault-Tolerant Computing, Newcastle upon Tyne, England (1990)Google Scholar
  4. 4.
    Mishra, S.: Consul: A Communication Substrate for Fault-Tolerant Distributed Programs. Ph.D thesis, Dept. of Computer Science, Univ. of Arizona, Tuscon, Arizona (1992)Google Scholar
  5. 5.
    Schneider, F.: Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial. ACM Computing Surveys 22(4), 299–319 (1990)CrossRefGoogle Scholar
  6. 6.
    Elmootazbellah, N., et al.: Replicated Distributed Processes. In: Proceedings of the Twenty-Second International Symposium on Fault Tolerant Computing (FTCS-22), pp. 18–27 (1992)Google Scholar
  7. 7.
    Shokri, E., et al.: An Approach for Adaptive Fault-Tolerance in Object-Oriented Open Distributed Systems. In: Workshop on Object-Oriented Reliable Distributed Systems (1997)Google Scholar
  8. 8.
    Ren, J., Cukier, M., Rubel, P., Sanders, W., Karr, D.: Building Dependable Distributed Applications using AQuA. In: Proceeding of the 4th IEEE International Symp. On High Assurance Systems Engineering, pp. 189–196 (1999)Google Scholar
  9. 9.
    Tanenbaum, A.S., van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice-Hall, Englewood Cliffs (2002)MATHGoogle Scholar
  10. 10.
    Chockler, G.V., Keidar, I., Vitenberg, R.: Group Communication Specifications: A Comprehensive Study. ACM Computing Surveys 33(4), 1–43 (2001)CrossRefGoogle Scholar
  11. 11.
    Grasshopper Programmer’s Guide,

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Sebnem Bora
    • 1
  1. 1.Department of Computer EngineeringEge UniversityIzmirTurkey

Personalised recommendations