Skip to main content
Log in

Preventive Replication in a Database Cluster

  • Published:
Distributed and Parallel Databases Aims and scope Submit manuscript

Abstract

In a database cluster, preventive replication can provide strong consistency without the limitations of synchronous replication. In this paper, we present a full solution for preventive replication that supports multi-master and partial configurations, where databases are partially replicated at different nodes. To increase transaction throughput, we propose an optimization that eliminates delay at the expense of a few transaction aborts and we introduce concurrent replica refreshment. We describe large-scale experimentation of our algorithm based on our RepDB* prototype (http://www.sciences.univ-nantes.fr./lina/ATLAS/RepDB) over a cluster of 64 nodes running the PostgreSQL DBMS. Our experimental results using the TPC-C Benchmark show that the proposed approach yields excellent scale-up and speed-up.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. T. Anderson, Y. Breitbart, H. Korth, and A. Wool, “Replication, consistency, and practicality: Are these mutually exclusive”? in SIGMOD Conference, 1998, pp. 484–495.

  2. C. Coulon, G. Gaumer, E. Pacitti, and P. Valduriez, The RepDB* prototype: Preventive Replication in a Database Cluster, Base de Données Avancées, Montpellier, France, 2004.

  3. C. Coulon, E. Pacitti, and P. Valduriez, “Scaling up the preventive replication of autonomous databases in cluster systems,” in Int. Conf. on High Performance Computing for Computational Science (VecPar 2004), Valencia, Spain, 2004, Lecture Notes in Computer Science 3402, Springer 2005, pp. 174–188.

  4. C. Coulon, E. Pacitti, and P. Valduriez, “Consistency management for partial replication in a high Performance database cluster,” in IEEE Int. Conf. on Parallel and Distributed Systems (ICPADS 2005), Fukuoka, Japan, 2005.

  5. S. Gançarski, H. Naacke, E. Pacitti, and P. Valduriez, “Parallel processing with autonomous databases in a cluster system,” in Int. Conf. on Cooperative Information Systems (CoopIS), 2002.

  6. J. Gray, P. Helland, P. O’ Neil, and D. Shasha, “The danger of replication and a solution,” in ACM SIGMOD Int. Conf. on Management of Data, Montreal, 1996.

  7. V. Hadzilacos and S. Toueg, Fault-Tolerant Broadcasts and Related Problems. Distributed Systems, 2nd edition, S. Mullender (Ed.), Addison-Wesley, 1993.

  8. R. Jiménez-Peris, M. Patiño-Martínez, B. Kemme, and G. Alonso, “Improving the scalability of fault-tolerant database clusters: early results,” in Int. Conf. on distributed Computing Systems (ICDCS), 2002.

  9. B. Kemme and G. Alonso, “Don't be lazy be consistent: Postgres-R, a new way to implement database replication,” in Int. Conf. on Very Large Databases (VLDB), 2000.

  10. Y. Lin, B. Kemme, M. Patino-Martinez, and R. Jimenez-Peris, “Middleware based data replication providing snapshot isolation,” in ACM SIGMOD Int. Conf. on Management of Data. Baltimore, USA, June 2005.

  11. T. Özsu and P. Valduriez, Principles of Distributed Database Systems, 2nd edition, Prentice Hall, 1999.

  12. E. Pacitti, P. Minet, and E. Simon, “Replica Consistency in Lazy Master Replicated Databases,” Distributed and Parallel Databases, Kluwer Academic, vol. 9, no. 3, 2001.

  13. E. Pacitti, T. Özsu, C. Coulon, “Preventive Multi-Master Replication in a cluster of autonomous databases,” in Euro-Par Int. Conf., 2003.

  14. E. Pacitti and P. Valduriez, “Replicated Databases: Concepts, architectures and techniques,” Network and Information Systems Journal, Hermès, vol. 1, no. 3, 1998.

  15. Paris Project: http://www.irisa.fr/paris/General/cluster.htm.

  16. F. Pedonne, “A schiper: optimistic atomic broadcast,” in Distributed Information Systems Conf. (DISC), 1998.

  17. C. Plattner and G. Alonso, “Ganymed: scalable replication for transactional web applications,” in Proc. of the 5th International Middleware Conference, Toronto, Canada, 2004.

  18. F. Raab, TPC-C—The Standard Benchmark for Online transaction Processing (OLTP). The Benchmark Handbook for Database and Transaction Systems, 2nd edition, Morgan Kaufmann, 1993.

  19. RepDB*: Data Management Component for Replicating Autonomous Databases in a Cluster System (released as open source software under GPL). http://www.sciences.univ-nantes.fr/lina/ATLAS/RepDB.

  20. A. Sousa, F. Pedone, R. Oliveira, and F. Moura, “partial replication in the database state machine,” in IEEE Int. Symposium on Network Computing and Applications (NCA), 2001.

  21. A. Sousa, J. Pereira, F. Moura, and R. Oliveira, “Optimistic total order in wide area networks,” in Proc. 21st IEEE Symposium on Reliable Distributed Systems, 2002, pp. 190–199.

  22. K. Tindell and J. Clark, “Holistic schedulability analysis for distributed hard real-time systems,” Micro-processors and Microprogramming, vol. 40, 1994.

  23. P. Valduriez, “Parallel database systems: Open problems and new issues,” Int. Journal on Distributed and Parallel Databases, Kluwer Academic, vol. 1, no. 2, 1993.

  24. S. Wu and B. Kemme, “Postgres-R(SI): Combining replica control with concurrency control based on snapshot isolation,” in IEEE Int. Conference on Data Engineering (ICDE), 2005.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Esther Pacitti.

Additional information

Work partially funded by the MDP2P project of the ACI “Masses de Donniées” of the French Ministry of Research.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Pacitti, E., Coulon, C., Valduriez, P. et al. Preventive Replication in a Database Cluster. Distrib Parallel Databases 18, 223–251 (2005). https://doi.org/10.1007/s10619-005-4257-4

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10619-005-4257-4

Keywords

Navigation