Beyond 1-Safety and 2-Safety for Replicated Databases: Group-Safety

  • Matthias Wiesmann
  • André Schiper
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2992)

Abstract

In this paper, we study the safety guarantees of group communication-based database replication techniques. We show that there is a model mismatch between group communication and database, and because of this, classical group communication systems cannot be used to build 2-safe database replication. We propose a new group communication primitive called end-to-end atomic broadcast that solves the problem, i.e., can be used to implement 2-safe database replication. We also introduce a new safety criterion, called group-safety, that has advantages both over 1-safety and 2-safety. Experimental results show the gain of efficiency of group-safety over lazy replication, which ensures only 1-safety.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Gray, J.N., Helland, P., O’Neil, P., Shasha, D.: The dangers of replication and a solution. In: Proceedings of the 1996 International Conference on Management of Data, Montreal, Canada, pp. 173–182. ACM-SIGMOD (1996)Google Scholar
  2. 2.
    Schiper, A., Raynal, M.: From group communication to transactions in distributed systems. Communications of the ACM 39, 84–87 (1996)CrossRefGoogle Scholar
  3. 3.
    Agrawal, D., Alonso, G., El Abbadi, A., Stanoi, I.: Exploiting atomic broadcast in replicated databases. Technical report, Department of Computer Science, University of California, Santa Barbara, California USA (1996)Google Scholar
  4. 4.
    Kemme, B., Alonso, G.: Don’t be lazy, be consistent: Postgres-R, a new way to implement database replication. In: Proceedings of the 26th International Conference on Very Large Databases (VLDB), Cairo, Egypt (2000)Google Scholar
  5. 5.
    Wiesmann, M.: Group Communications and Database Replication: Techniques, Issues and Performance. PhD thesis, École Polytechnique Fédérale de Lausanne, Switzerland (2002)Google Scholar
  6. 6.
    Holliday, J., Agrawal, D., Abbadi, A.E.: Using multicast communication to reduce deadlocks in replicated databases. In: Proceedings of the 19th Symposium on Reliable Distributed Systems SRDS 2000, Nürnberg, Germany, pp. 196–205. IEEE Computer Society, Los Alamitos (2000)CrossRefGoogle Scholar
  7. 7.
    Wiesmann, M., Pedone, F., Schiper, A., Kemme, B., Alonso, G.: Database replication techniques: a three parameter classification. In: Proceedings of 19th Symposium on Reliable Distributed Systems (SRDS 2000), Nürnberg, Germany, pp. 206–215. IEEE Computer Society, Los Alamitos (2000)CrossRefGoogle Scholar
  8. 8.
    Keidar, I.: A highly available paradigm for consistent object replication. Master’s thesis, The Hebrew University of Jerusalem, Jerusalem, Israel (1994); Also technical report CS 1994 (1994)Google Scholar
  9. 9.
    Stanoi, I., Agrawal, D., Abbadi, A.E.: Using broadcast primitives in replicated databases (abstract). In: Proceedings of the 16th Annual Symposium on Principles of Distributed Computing, Santa Barbara, California USA, p. 283. ACM, New York (1997)CrossRefGoogle Scholar
  10. 10.
    Pedone, F., Guerraoui, R., Schiper, A.: The database state machine approach. Technical Report SSC/1999/008, École Polytechnique Fédérale de Lausanne, Switzerland (1999)Google Scholar
  11. 11.
    Amir, Y., Tutu, C.: From total order to database replication. Technical Report CNDS-2001-6, Departement of Computer Science John Hopkins University, Baltimore, MD 21218 USA (2001)Google Scholar
  12. 12.
    Gray, J.N., Reuter, A.: Transaction Processing: concepts and techniques. In: Data Management Systems, Morgan Kaufmann Publishers, Inc., San Mateo (1993)Google Scholar
  13. 13.
    Frølund, S., Guerraoui, R.: Implementing e-transactions with asynchronous replication. IEEE Transactions on Parallel and Distributed Systems 12 (2001)Google Scholar
  14. 14.
    Gärtner, F.C.: A gentle introduction to failure detectors and related problems. Technical Report TUD-BS-2001-01, Darmstadt University of Technology, Department of Computer Science (2001)Google Scholar
  15. 15.
    Birman, K.P., Joseph, T.A.: Exploiting virtual synchrony in distributed systems. In: Proceedings of the 11th ACM Symposium on OS Principles, Austin, TX, USA, pp. 123–138. ACM SIGOPS, ACM (1987)Google Scholar
  16. 16.
    Malloth, C.P., Felber, P., Schiper, A., Wilhelm, U.: Phoenix: A toolkit for building fault-tolerant distributed applications in large scale. In: Workshop on Parallel and Distributed Platforms in Industrial Products, San Antonio, Texas, USA, IEEE, Los Alamitos (1995)Google Scholar
  17. 17.
    van Renesse, R., Birman, K.P., Maffeis, S.: Horus: A flexible group communication system. Communications of the ACM 39, 76–83 (1996)CrossRefGoogle Scholar
  18. 18.
    Hiltunen, M.A., Schlichting, R.D.: The cactus approach to building configurable middleware services. In: Proceedings of the Workshop on Dependable System Middleware and Group Communication (DSMGC 2000), Nürnberg, Germany (2000)Google Scholar
  19. 19.
    Birman, K., Constable, R., Hayden, M., Kreitz, C., Rodeh, O., van Renesse, R., Vogels, W.: The Horus and Ensemble projects: Accomplishments and limitations. In: Proceedings of the DARPA Information Survivability Conference & Exposition (DISCEX 2000), Hilton Head, South Carolina USA (2000) Google Scholar
  20. 20.
    Miranda, H., Pinto, A., Rodrigues, L.: Appia: A flexible protocol kernel supporting multiple coordinated channels. In: Proceedings of the 21st International Conference on Distributed Computing Systems (ICDCS 2001), Phoenix, Arizona, USA, pp. 707–710. IEEE Computer Society, Los Alamitos (2001)CrossRefGoogle Scholar
  21. 21.
    Aguilera, M.K., Chen, W., Toueg, S.: Failure detection and consensus in the crash recovery model. Distributed Computing 13, 99–125 (2000)CrossRefGoogle Scholar
  22. 22.
    Gafni, E., Lamport, L.: Disk paxos. Technical Report SRC 163, Compaq Systems Research Center, 130, Lytton Avenue Palo Alto, California 94301 USA (2000)Google Scholar
  23. 23.
    Rodrigues, L., Raynal, M.: Atomic broadcast in asynchronous crash-recovery distributed systems. In: Proceedings of the 20th International Conference on Distributed Systems (ICDCS 2000), Taipei, Taiwan (ROC), pp. 288–295. IEEE Computer Society, Los Alamitos (2000)CrossRefGoogle Scholar
  24. 24.
    Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. Communications of the ACM 43, 225–267 (1996)MATHMathSciNetGoogle Scholar
  25. 25.
    Lynch, N.: Distributed Algorithms. Morgan Kaufmann, San Francisco (1996)MATHGoogle Scholar
  26. 26.
    Saltzer, J.H., Reed, D.P., Clark, D.D.: End-to-end arguments in system design. ACM Transactions on Computer Systems 2, 277–288 (1984)CrossRefGoogle Scholar
  27. 27.
    Holliday, J.: Replicated database recovery using multicast communications. In: Proceedings of the Symposium on Network Computing and Applications (NCA 2001), Cambridge, MA, USA, pp. 104–107. IEEE, Los Alamitos (2001)CrossRefGoogle Scholar
  28. 28.
    Cheriton, D.R., Skeen, D.: Understanding the limitations of causally and totally ordered communication. In: Liskov, B. (ed.) Proceedings of the 14th Symposium on Operating Systems Principles, Asheville, North Carolina, vol. 27, pp. 44–57. ACM Press, New York (1993)Google Scholar
  29. 29.
    Keidar, I., Dolev, D.: Totally ordered broadcast in the face of network partitions. In: Avresky, D. (ed.) Dependable Network Computing, Kluwer Academic Publications, Dordrecht (2000)Google Scholar
  30. 30.
    Davidson, S.B., Garcia-Molina, H., Skeen, D.: Consistency in partitioned networks. ACM Computing Surveys 17, 341–370 (1985)CrossRefGoogle Scholar
  31. 31.
    Fu, A.W., Cheung, D.W.: A transaction replication scheme for a replicated database with node autonomy. In: Proceedings of the International Conference on Very Large Databases, Santiago, Chile (1994)Google Scholar
  32. 32.
    Kemme, B., Alonso, G.: A suite of database replication protocols based on group communication primitives. In: Proceedings of the 18th International Conference on Distributed Computing Systems (ICDCS 1998), Amsterdam, The Netherlands (1998)Google Scholar
  33. 33.
    Kemme, B., Pedone, F., Alonso, G., Schiper, A.: Processing transactions over optimistic atomic broadcast protocols. In: Proceedings of the International Conference on Distributed Computing Systems, Austin, Texas (1999)Google Scholar
  34. 34.
    Holliday, J., Agrawal, D., Abbadi, A.E.: The performance of database replication with group multicast. In: Proceedings of International Symposium on Fault Tolerant Computing (FTCS29), pp. 158–165. IEEE Computer Society, Los Alamitos (1999)Google Scholar
  35. 35.
    Babao§lu, Ö., Toueg, S.: Understanding non-blocking atomic commitement. Technical Report UBLCS-93-2, Laboratory for Computer Science, University of Bologna, 5 Piazza di Porta S. Donato, 40127 Bologna, Italy (1993)Google Scholar
  36. 36.
    Keidar, I., Dolev, D.: Increasing the resilience of distributed and replicated database systems. Journal of Computer and System Sciences (JCSS) 57, 309–324 (1998)MATHCrossRefMathSciNetGoogle Scholar
  37. 37.
    Jiménez-Paris, R., Patiño-Martínez, M., Alonso, G., Arévalo, S.: A low-latency non-blocking commit service. In: Welch, J.L. (ed.) DISC 2001. LNCS, vol. 2180, pp. 93–107. Springer, Heidelberg (2001)CrossRefGoogle Scholar
  38. 38.
    Wiesmann, M., Pedone, F., Schiper, A., Kemme, B., Alonso, G.: Understanding replication in databases and distributed systems. In: Proceedings of 20th International Conference on Distributed Computing Systems (ICDCS 2000), Taipei, Taiwan, R.O.C., IEEE Computer Society, Los Alamitos (2000)Google Scholar
  39. 39.
    Kemme, B., Bartoli, A., Babao§lu, Ö.: Online reconfiguration in replicated databases based on group communication. In: Proceedings of the Internationnal Conference on Dependable Systems and Networks (DSN2001), Göteborg, Sweden (2001)Google Scholar
  40. 40.
    Amir, Y.: Replication using group communication over a partitioned network. PhD thesis, Hebrew University of Jerusalem, Israel (1995)Google Scholar
  41. 41.
    Ezhilchelvan, P.D., Shrivastava, S.K.: Enhancing replica management services to cope with group failures. In: Krakowiak, S., Shrivastava, S.K. (eds.) BROADCAST 1999. LNCS, vol. 1752, pp. 79–103. Springer, Heidelberg (2000)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Matthias Wiesmann
    • 1
  • André Schiper
    • 1
  1. 1.École Polytechnique Fédérale de Lausanne (EPFL)LausanneCH

Personalised recommendations