Increasing the Expressiveness of Analytical Performance Models for Replicated Databases

  • Matthias Nicola
  • Matthias Jarke
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1540)

Abstract

The vast number of design options in replicated databases requires efficient analytical performance evaluations so that the considerable overhead of simulations or measurements can be focused on a few promising options. A review of existing analytical models in terms of their modeling assumptions, replication schemata considered, and network properties captured, shows that data replication and intersite communication as well as workload patterns should be modeled more accurately. Based on this analysis, we define a new modeling approach named 2RC (2-dimensional replication model with integrated communication). We derive a complete analytical queueing model for 2RC and demonstrate that it is of higher expressiveness than existing models. 2RC also yields a novel bottleneck analysis and permits to evaluate the trade-off between throughput and availability.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. [1]
    G. Alonso: Partial Database Replication and Group Communication Primitives, Proceedings of the 2nd European Research Seminar on Advances in Distributed Systems (ERSADS’97), March 1997.Google Scholar
  2. [2]
    R. Alonso, D. Barbara, H. Garcia-Molina: Data Caching Issues in an Information Retrieval System, ACM Transactions on Database Systems, Vol. 15,No. 3, pp. 359–384, 1990.CrossRefGoogle Scholar
  3. [3]
    T. Anderson, Y. Breitbart, H. Korth, A. Wool: Replication, Consistency, and Practicality: Are These Mutually Exclusive, ACM SIGMOD International Conference on Management of Data, pp484–495, June 1998.Google Scholar
  4. [4]
    F. Bacelli, E.G. Coffmann Jr.: A database replication analysis using an M/M/m queue with service interruptions, Performance Evaluation Review, Vol. 11,No. 4, pp. 102–107, 1983.CrossRefGoogle Scholar
  5. [5]
    Sujata Banerjee, Victor O K Li, Chihping Wang: Performance analysis of the send-ondemand: A distributed database concurrency control protocol for high-speed networks, Computer Communications, Vol. 17,No. 3, pp. 189–204, March 1994.CrossRefGoogle Scholar
  6. [6]
    A.B. Bondi, V. Jin: A performance model of a design for a minimally replicated distributed database for database driven telecommunication services, Journal on Distributed and Parallel Databases, Vol. 4,No. 4, pp. 295–317, October 1996.CrossRefGoogle Scholar
  7. [7]
    Michael J. Carey, Miron Livny: Distributed Concurrency Control Performance: A Study of Algorithms, Distribution, and Replication, Proceedings of the 14th International Conference on Very Large Databases, pp. 13–25, 1988.Google Scholar
  8. [8]
    S. Ceri, M.A.H. Houtsma, A.M. Keller, P. Samarati: A Classification of Update Methods for Replicated Databases, Technical Report STAN-CS-91-1392, Stanford University, October 1991.Google Scholar
  9. [9]
    Shu-Wie Chen, Calton Pu: A Structural Classification of Integrated Replica Control Mechanisms, Technical Report CUCS-006-92, Columbia University, NY, 1992.Google Scholar
  10. [10]
    B Ciciani, D.M Dias, P.S. Yu: Analysis of Replication in Distributed Database Systems, IEEE Transactions on Knowledge and Data Engineering, Vol. 2,No. 2, pp. 247–261, June 1990.CrossRefGoogle Scholar
  11. [11]
    E.G. Coffmann, Erol Gelenbe, Brigitte Plateau: Optimization of the number of copies in a distributed system, IEEE Transactions on Software Engineering, Vol. 7, pp. 78–84, January 1981.CrossRefGoogle Scholar
  12. [12]
    R. Gallersdorfer, K. Klabunde, A. Stolz, M. Essmajor: Intelligent Networks as a Data Intensive Application-Final Project Report, Technical Report AIB-96-14, University of Aachen, 1996.Google Scholar
  13. [13]
    Rainer Gallersdorfer, Matthias Nicola: Improving Performance in Replicated Databases through Relaxed Coherency, Proceedings of the 21th International Conference on Very Large Databases, pp. 445–456, Sept 1995.Google Scholar
  14. [14]
    H. Garcia-Molina: Performance of the Update Algorithms for Replicated Data in a Distributed Database, Ph.D. Dissertation, revised version, Computer Science Department, Stanford University, North Holland, 1982.Google Scholar
  15. [15]
    Jim Gray, P. Helland, P. O’Neil, D. Shasha: The dangers of replication and a solution, SIGMOD Record, Vol. 25,No. 2, pp. 173–182, June 1996.CrossRefGoogle Scholar
  16. [16]
    S.Y. Hwang, K.S. Lee, Y.H. Chin: Data Replication in a Distributed System: A Performance Study, 7th International Conference on Database and Expert Systems Applications, pp. 708–717 1996.Google Scholar
  17. [17]
    B. Kemme, G. Alonso: A Suite of Database Replication Protocols based on Group Communication Primitives, Proceedings of the 18th International Conference on Distributed Computing Systems, May 1998.Google Scholar
  18. [18]
    C.S. Keum, E.K. Hong, W.Y. Kim, K.Y. Whang: Performance Evaluation of Replica Control Algorithms in a Locally Distributed Database System, Proceedings of the 4th International Conference on Database Systems for Advanced Database Applications, pp. 388–396, April 1995.Google Scholar
  19. [19]
    L. Kleinrock: Queueing Systems, Volume I: Theory, John Wiley & Sons, 1975.Google Scholar
  20. [20]
    Kin K. Leung: An Update Algorithm for Replicated Signalling Databases in Wireless and Advanced Intelligent Networks, IEEE Transactions on Computers, Vol. 46,No. 3, pp. 362–367, March 1997.CrossRefGoogle Scholar
  21. [21]
    D. Liang, S. K. Tripathi: Performance Analysis of Long-Lived Transaction Processing Systems with Rollbacks and Aborts, IEEE Transactions on Knowledge and Data Engineering, Vol. 8,No. 5, pp. 802–815, 1996.CrossRefGoogle Scholar
  22. [22]
    M.C. Little, D.L. McCue: The Replica Management System: A Scheme for Flexible and Dynamic Replication, Proceedings of the 2nd Workshop on Configurable Distributed Systems, 1994.Google Scholar
  23. [23]
    J. Mc Dermott, R. Mukkamala: Performance Analysis of Transaction Management Algorithms for the SINTRA Replicated Architecture Database Systems, IFIP Transactions (Comp. Science & Technology), Vol. A-47, pp. 215–234, 1994.Google Scholar
  24. [24]
    Randolph D. Nelson, Balakrishna R. Iyer: Analysis of a Replicated Database“, Performance Evaluation, Vol. 5, pp. 133–148, 1985.CrossRefGoogle Scholar
  25. [25]
    Esther Pacitti, Erics Simon: Update Propagation Strategies to Improve Freshness of Data in Lazy Master Schemes, Technical Report No. 3233, INRIA Rocquencourt, France, August 1997.Google Scholar
  26. [26]
    J.F. Ren, Y. Takahashi, T. Hasegawa: Analysis of impact of network delay on multiversion timestamp algorithms in DDBS, Performance Evaluation, pp. 21–50, July 1996.Google Scholar
  27. [27]
    D. Saha, S. Rangarajan, S. K. Tripathi: An Analysis of the Average Message Overhead in Replica Control Protocols, IEEE Transactions on Parallel and Distributed Systems, Vol. 7,No. 10, pp. 1026–1034, Oct 1996.CrossRefGoogle Scholar
  28. [28]
    S.H. Son, N. Haghighi: Performance Evaluation of Multiversion Database Systems, Proceedings of the 6th International Conference on Data Engineering, pp. 129–136, 1990.Google Scholar
  29. [29]
    S.H. Son, F. Zhang: Real-Time Replication Control for Distributed Database Systems: Algorithms and Their Performance“ 4th International Conference on Database Systems for Advanced Database Applications, pp. 214–221 1995.Google Scholar
  30. [30]
    M. Stonebraker. Concurrency Control and Consistency of Multiple Copies of Data in Distributed Ingres, IEEE Transactions on Software Engineering, 5(3):188–194, 1979.CrossRefGoogle Scholar
  31. [31]
    P. Triantafillou: Independent Recovery in Large-Scale Distributed Systems, IEEE Transactions on Software Engineering, Vol. 22,No. 11, pp. 812–826, November 1996.CrossRefGoogle Scholar
  32. [32]
    O. Wolfson, S. Jajodia, Y. Huang: An Adaptive Data Replication Algorithm, ACM Transactions on Database Systems, Vol. 22,No. 2, pp. 255–314, 1997.CrossRefGoogle Scholar
  33. [33]
    Shaoyu Zhou, M.H. Williams, H. Taylor: Practical Throughput Estimation for Parallel Databases, Software Engineering Journal, Vol. 11,No. 4, pp. 255–263, July 1996.CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • Matthias Nicola
    • 1
  • Matthias Jarke
    • 1
  1. 1.Informatik V (Information Systems)Technical University of AachenAachenGermany

Personalised recommendations