Distributed and Parallel Databases

, Volume 18, Issue 3, pp 223–251 | Cite as

Preventive Replication in a Database Cluster

  • Esther Pacitti
  • Cédric Coulon
  • Patrick Valduriez
  • M. Tamer Özsu


In a database cluster, preventive replication can provide strong consistency without the limitations of synchronous replication. In this paper, we present a full solution for preventive replication that supports multi-master and partial configurations, where databases are partially replicated at different nodes. To increase transaction throughput, we propose an optimization that eliminates delay at the expense of a few transaction aborts and we introduce concurrent replica refreshment. We describe large-scale experimentation of our algorithm based on our RepDB* prototype ( over a cluster of 64 nodes running the PostgreSQL DBMS. Our experimental results using the TPC-C Benchmark show that the proposed approach yields excellent scale-up and speed-up.


database cluster partial replication preventive replication strong consistency TPC-C benchmarking 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    T. Anderson, Y. Breitbart, H. Korth, and A. Wool, “Replication, consistency, and practicality: Are these mutually exclusive”? in SIGMOD Conference, 1998, pp. 484–495.Google Scholar
  2. 2.
    C. Coulon, G. Gaumer, E. Pacitti, and P. Valduriez, The RepDB* prototype: Preventive Replication in a Database Cluster, Base de Données Avancées, Montpellier, France, 2004.Google Scholar
  3. 3.
    C. Coulon, E. Pacitti, and P. Valduriez, “Scaling up the preventive replication of autonomous databases in cluster systems,” in Int. Conf. on High Performance Computing for Computational Science (VecPar 2004), Valencia, Spain, 2004, Lecture Notes in Computer Science 3402, Springer 2005, pp. 174–188.Google Scholar
  4. 4.
    C. Coulon, E. Pacitti, and P. Valduriez, “Consistency management for partial replication in a high Performance database cluster,” in IEEE Int. Conf. on Parallel and Distributed Systems (ICPADS 2005), Fukuoka, Japan, 2005.Google Scholar
  5. 5.
    S. Gançarski, H. Naacke, E. Pacitti, and P. Valduriez, “Parallel processing with autonomous databases in a cluster system,” in Int. Conf. on Cooperative Information Systems (CoopIS), 2002.Google Scholar
  6. 6.
    J. Gray, P. Helland, P. O’ Neil, and D. Shasha, “The danger of replication and a solution,” in ACM SIGMOD Int. Conf. on Management of Data, Montreal, 1996.Google Scholar
  7. 7.
    V. Hadzilacos and S. Toueg, Fault-Tolerant Broadcasts and Related Problems. Distributed Systems, 2nd edition, S. Mullender (Ed.), Addison-Wesley, 1993.Google Scholar
  8. 8.
    R. Jiménez-Peris, M. Patiño-Martínez, B. Kemme, and G. Alonso, “Improving the scalability of fault-tolerant database clusters: early results,” in Int. Conf. on distributed Computing Systems (ICDCS), 2002.Google Scholar
  9. 9.
    B. Kemme and G. Alonso, “Don't be lazy be consistent: Postgres-R, a new way to implement database replication,” in Int. Conf. on Very Large Databases (VLDB), 2000.Google Scholar
  10. 10.
    Y. Lin, B. Kemme, M. Patino-Martinez, and R. Jimenez-Peris, “Middleware based data replication providing snapshot isolation,” in ACM SIGMOD Int. Conf. on Management of Data. Baltimore, USA, June 2005.Google Scholar
  11. 11.
    T. Özsu and P. Valduriez, Principles of Distributed Database Systems, 2nd edition, Prentice Hall, 1999.Google Scholar
  12. 12.
    E. Pacitti, P. Minet, and E. Simon, “Replica Consistency in Lazy Master Replicated Databases,” Distributed and Parallel Databases, Kluwer Academic, vol. 9, no. 3, 2001.Google Scholar
  13. 13.
    E. Pacitti, T. Özsu, C. Coulon, “Preventive Multi-Master Replication in a cluster of autonomous databases,” in Euro-Par Int. Conf., 2003.Google Scholar
  14. 14.
    E. Pacitti and P. Valduriez, “Replicated Databases: Concepts, architectures and techniques,” Network and Information Systems Journal, Hermès, vol. 1, no. 3, 1998.Google Scholar
  15. 15.
  16. 16.
    F. Pedonne, “A schiper: optimistic atomic broadcast,” in Distributed Information Systems Conf. (DISC), 1998.Google Scholar
  17. 17.
    C. Plattner and G. Alonso, “Ganymed: scalable replication for transactional web applications,” in Proc. of the 5th International Middleware Conference, Toronto, Canada, 2004.Google Scholar
  18. 18.
    F. Raab, TPC-C—The Standard Benchmark for Online transaction Processing (OLTP). The Benchmark Handbook for Database and Transaction Systems, 2nd edition, Morgan Kaufmann, 1993.Google Scholar
  19. 19.
    RepDB*: Data Management Component for Replicating Autonomous Databases in a Cluster System (released as open source software under GPL).
  20. 20.
    A. Sousa, F. Pedone, R. Oliveira, and F. Moura, “partial replication in the database state machine,” in IEEE Int. Symposium on Network Computing and Applications (NCA), 2001.Google Scholar
  21. 21.
    A. Sousa, J. Pereira, F. Moura, and R. Oliveira, “Optimistic total order in wide area networks,” in Proc. 21st IEEE Symposium on Reliable Distributed Systems, 2002, pp. 190–199.Google Scholar
  22. 22.
    K. Tindell and J. Clark, “Holistic schedulability analysis for distributed hard real-time systems,” Micro-processors and Microprogramming, vol. 40, 1994.Google Scholar
  23. 23.
    P. Valduriez, “Parallel database systems: Open problems and new issues,” Int. Journal on Distributed and Parallel Databases, Kluwer Academic, vol. 1, no. 2, 1993.Google Scholar
  24. 24.
    S. Wu and B. Kemme, “Postgres-R(SI): Combining replica control with concurrency control based on snapshot isolation,” in IEEE Int. Conference on Data Engineering (ICDE), 2005.Google Scholar

Copyright information

© Springer Science + Business Media, Inc 2005

Authors and Affiliations

  • Esther Pacitti
    • 1
  • Cédric Coulon
    • 1
  • Patrick Valduriez
    • 1
  • M. Tamer Özsu
    • 2
  1. 1.INRIA and LINAUniversity of NantesFrance
  2. 2.University of WaterlooCanada

Personalised recommendations