Preventive Replication in a Database Cluster
- 92 Downloads
In a database cluster, preventive replication can provide strong consistency without the limitations of synchronous replication. In this paper, we present a full solution for preventive replication that supports multi-master and partial configurations, where databases are partially replicated at different nodes. To increase transaction throughput, we propose an optimization that eliminates delay at the expense of a few transaction aborts and we introduce concurrent replica refreshment. We describe large-scale experimentation of our algorithm based on our RepDB* prototype (http://www.sciences.univ-nantes.fr./lina/ATLAS/RepDB) over a cluster of 64 nodes running the PostgreSQL DBMS. Our experimental results using the TPC-C Benchmark show that the proposed approach yields excellent scale-up and speed-up.
Keywordsdatabase cluster partial replication preventive replication strong consistency TPC-C benchmarking
Unable to display preview. Download preview PDF.
- 1.T. Anderson, Y. Breitbart, H. Korth, and A. Wool, “Replication, consistency, and practicality: Are these mutually exclusive”? in SIGMOD Conference, 1998, pp. 484–495.Google Scholar
- 2.C. Coulon, G. Gaumer, E. Pacitti, and P. Valduriez, The RepDB* prototype: Preventive Replication in a Database Cluster, Base de Données Avancées, Montpellier, France, 2004.Google Scholar
- 3.C. Coulon, E. Pacitti, and P. Valduriez, “Scaling up the preventive replication of autonomous databases in cluster systems,” in Int. Conf. on High Performance Computing for Computational Science (VecPar 2004), Valencia, Spain, 2004, Lecture Notes in Computer Science 3402, Springer 2005, pp. 174–188.Google Scholar
- 4.C. Coulon, E. Pacitti, and P. Valduriez, “Consistency management for partial replication in a high Performance database cluster,” in IEEE Int. Conf. on Parallel and Distributed Systems (ICPADS 2005), Fukuoka, Japan, 2005.Google Scholar
- 5.S. Gançarski, H. Naacke, E. Pacitti, and P. Valduriez, “Parallel processing with autonomous databases in a cluster system,” in Int. Conf. on Cooperative Information Systems (CoopIS), 2002.Google Scholar
- 6.J. Gray, P. Helland, P. O’ Neil, and D. Shasha, “The danger of replication and a solution,” in ACM SIGMOD Int. Conf. on Management of Data, Montreal, 1996.Google Scholar
- 7.V. Hadzilacos and S. Toueg, Fault-Tolerant Broadcasts and Related Problems. Distributed Systems, 2nd edition, S. Mullender (Ed.), Addison-Wesley, 1993.Google Scholar
- 8.R. Jiménez-Peris, M. Patiño-Martínez, B. Kemme, and G. Alonso, “Improving the scalability of fault-tolerant database clusters: early results,” in Int. Conf. on distributed Computing Systems (ICDCS), 2002.Google Scholar
- 9.B. Kemme and G. Alonso, “Don't be lazy be consistent: Postgres-R, a new way to implement database replication,” in Int. Conf. on Very Large Databases (VLDB), 2000.Google Scholar
- 10.Y. Lin, B. Kemme, M. Patino-Martinez, and R. Jimenez-Peris, “Middleware based data replication providing snapshot isolation,” in ACM SIGMOD Int. Conf. on Management of Data. Baltimore, USA, June 2005.Google Scholar
- 11.T. Özsu and P. Valduriez, Principles of Distributed Database Systems, 2nd edition, Prentice Hall, 1999.Google Scholar
- 12.E. Pacitti, P. Minet, and E. Simon, “Replica Consistency in Lazy Master Replicated Databases,” Distributed and Parallel Databases, Kluwer Academic, vol. 9, no. 3, 2001.Google Scholar
- 13.E. Pacitti, T. Özsu, C. Coulon, “Preventive Multi-Master Replication in a cluster of autonomous databases,” in Euro-Par Int. Conf., 2003.Google Scholar
- 14.E. Pacitti and P. Valduriez, “Replicated Databases: Concepts, architectures and techniques,” Network and Information Systems Journal, Hermès, vol. 1, no. 3, 1998.Google Scholar
- 15.Paris Project: http://www.irisa.fr/paris/General/cluster.htm.
- 16.F. Pedonne, “A schiper: optimistic atomic broadcast,” in Distributed Information Systems Conf. (DISC), 1998.Google Scholar
- 17.C. Plattner and G. Alonso, “Ganymed: scalable replication for transactional web applications,” in Proc. of the 5th International Middleware Conference, Toronto, Canada, 2004.Google Scholar
- 18.F. Raab, TPC-C—The Standard Benchmark for Online transaction Processing (OLTP). The Benchmark Handbook for Database and Transaction Systems, 2nd edition, Morgan Kaufmann, 1993.Google Scholar
- 19.RepDB*: Data Management Component for Replicating Autonomous Databases in a Cluster System (released as open source software under GPL). http://www.sciences.univ-nantes.fr/lina/ATLAS/RepDB.
- 20.A. Sousa, F. Pedone, R. Oliveira, and F. Moura, “partial replication in the database state machine,” in IEEE Int. Symposium on Network Computing and Applications (NCA), 2001.Google Scholar
- 21.A. Sousa, J. Pereira, F. Moura, and R. Oliveira, “Optimistic total order in wide area networks,” in Proc. 21st IEEE Symposium on Reliable Distributed Systems, 2002, pp. 190–199.Google Scholar
- 22.K. Tindell and J. Clark, “Holistic schedulability analysis for distributed hard real-time systems,” Micro-processors and Microprogramming, vol. 40, 1994.Google Scholar
- 23.P. Valduriez, “Parallel database systems: Open problems and new issues,” Int. Journal on Distributed and Parallel Databases, Kluwer Academic, vol. 1, no. 2, 1993.Google Scholar
- 24.S. Wu and B. Kemme, “Postgres-R(SI): Combining replica control with concurrency control based on snapshot isolation,” in IEEE Int. Conference on Data Engineering (ICDE), 2005.Google Scholar