Stochastic Modeling of Composite Web Services for Closed-Form Analysis of Their Performance and Reliability Bottlenecks

  • N. Sato
  • K. S. Trivedi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4749)


Web services providers often commit service-level agreements (SLAs) with their customers for guaranteeing the quality of the services. These SLAs are related not just to functional attributes of the services but to performance and reliability attributes as well. When combining several services into a composite service, it is non-trivial to determine, prior to service deployment, performance and reliability values of the composite service appropriately. Moreover, once the service is deployed, it is often the case that during operation it fails to meet its SLA and needs to detect what has gone wrong (i.e., performance/reliabilty bottlenecks).

To resolve these, we develop a continuous-time Markov chain (CTMC) formulation of composite services with failures. By explicitly including failure states into the CTMC representation of a service, we can compute accurately both its performance and reliability using the single CTMC. We can also detect its performance and reliability bottlenecks by applying the formal sensitivity analysis technique. We demonstrate our approach by choosing a representative example of composite Web services and providing a set of closed-form formulas for its bottleneck detection.


  1. 1.
    Blake, J., Reibman, A., Trivedi, K.: Sensitivity analysis of reliability and performability measures for multiprocessor systems. In: ACM SIGMETRICS, pp. 177–186. ACM Press, New York (1988)Google Scholar
  2. 2.
    Bolch, G., Greiner, S., de Meer, H., Trivedi, K.: Queueing networks and Markov chains: modeling and performance evaluation with computer science applications, 2nd edn. Wiley-Interscience, New York, NY, USA (2006)zbMATHGoogle Scholar
  3. 3.
    Chimento, P., Trivedi, K.: The completion time of programs on processors subject to failure and repair. IEEE Trans. Comput. 42(10), 1184–1194 (1993)CrossRefGoogle Scholar
  4. 4.
    Chowdhary, P., Bhaskaran, K., Caswell, N.S., Chang, H., Chao, T., Chen, S., Dikun, M., Lei, H., Jeng, J., Kapoor, S., Lang, C.A., Mihaila, G., Stanoi, I., Zeng, L.: Model driven development for business performance management. IBM Systems Journal 45(3) (2006)Google Scholar
  5. 5.
    Fox, J.: An R and S-Plus Companion to Applied Regression. Sage Publications, Thousand Oaks (2002)Google Scholar
  6. 6.
    Garg, S., Kintala, C., Yajnik, S., Huang, Y., Trivedi, K.: Performance and reliability evaluation of passive replication schemes in application level fault tolerance. In: the 29th Annual International Symposium on Fault-Tolerant Computing, p. 322 (1999)Google Scholar
  7. 7.
    Goseva-Popstojanova, K., Trivedi, K.: Architecture-based Approach to Reliability Assessment of Software Systems. Performance Evaluation 45(2/3), 179–204 (2001)zbMATHCrossRefGoogle Scholar
  8. 8.
    Goyal, A., Lavenberg, S., Trivedi, K.: Probabilistic Modeling of Computer System Availability. Annals of Operations Research 8, 285–306 (1987)CrossRefGoogle Scholar
  9. 9.
    Heidelberger, P., Trivedi, K.: Analytic Queueing Models for Programs with Internal Concurrency. IEEE Transactions on Computers 32(1), 73–82 (1983)zbMATHCrossRefGoogle Scholar
  10. 10.
    Hirel, C., Sahner, R., Zang, X., Trivedi, K.: Reliability and Performability Modeling using SHARPE 2000. In: Haverkort, B., Bohnenkamp, H.C., Smith, C.U. (eds.) TOOLS 2000. LNCS, vol. 1786, Springer, Heidelberg (2000)Google Scholar
  11. 11.
    Littlewood, B.: A reliability model for systems with markov structure. Applied Statistics 24(2), 172–177 (1975)CrossRefMathSciNetGoogle Scholar
  12. 12.
    Meeker, W., Escobar, L.: Statistical Methods for Reliability Data. John Wiley & Sons, West Sussex, England (1998)zbMATHGoogle Scholar
  13. 13.
    OASIS: Specification: Business Process Execution Language for Web Services (1.1) (2004)Google Scholar
  14. 14.
    Sharma, V., Trivedi, K.: Reliability and performance of component based software systems with restarts, retries, reboots and repairs. In: International Symposium on Software Reliability Engineering (2006)Google Scholar
  15. 15.
    Tobias, P., Trindade, D.: Applied Reliability, 2nd edn. Kluwer, Dordrecht (1995)Google Scholar
  16. 16.
    Towsley, D., Browne, J., Chandy, K.: Models for Parallel Processing within Programs: Application to CPU:I/O and I/O:I/O Overlap. CACM 21(10), 821–831 (1978)zbMATHGoogle Scholar
  17. 17.
    Trivedi, K.: SPNP User’s Manual Version 6.0. Duke University (September 1999)Google Scholar
  18. 18.
    Trivedi, K.: Probability and Statistics with Reliability, Queuing, and Computer Science Applications. John Wiley & Sons, West Sussex, England (2001)Google Scholar
  19. 19.
    Wang, D., Fricks, R., Trivedi, K.: Dealing with Non-Exponential Distributions in Dependability Models. In: Performance Evaluation and Perspectives, pp. 273–302 (2003)Google Scholar
  20. 20.
    Wang, D., Trivedi, K.: Modeling User-Perceived Service Availability. In: Malek, M., Nett, E., Suri, N. (eds.) ISAS 2005. LNCS, vol. 3694, pp. 107–122. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  21. 21.
    Wang, W., Choi, H., Trivedi, K.: Analysis of Conditional MTTF of Fault-Tolerant Systems. Microelectronics and Reliability 38(3), 393–401 (1998)CrossRefGoogle Scholar
  22. 22.
    Whitt, W.: The queueing network analyzer. Bell System Technical Journal 62(9), 2779–2815 (1983)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • N. Sato
    • 1
  • K. S. Trivedi
    • 2
  1. 1.IBM Research 
  2. 2.Duke University 

Personalised recommendations