Skip to main content

Making Services Fault Tolerant

  • Conference paper
Service Availability (ISAS 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4328))

Included in the following conference series:

Abstract

With ever growing use of Internet, Web services become increasingly popular and their growth rate surpasses even the most optimistic predictions. Services are self-descriptive, self-contained, platform-independent and openly-available components that interact over the network. They are written strictly according to open specifications and/or standards and provide important and often critical functions for many business-to-business systems. Failures causing either service downtime or producing invalid results in such systems may range from a mere inconvenience to significant monetary penalties or even loss of human lives. In applications where sensing and control of machines and other devices take place via services, making the services highly dependable is one of main critical goals. Currently, there is no experimental investigation to evaluate the reliability and availability of Web services systems. In this paper, we identify parameters impacting the Web services dependability, describe the methods of dependability enhancement by redundancy in space and redundancy in time and perform a series of experiments to evaluate the availability of Web services. To increase the availability of the Web service, we use several replication schemes and compare them with a single service. The Web services are coordinated by a replication manager. The replication algorithm and the detailed system configuration are described in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bilorusets, R., Bosworth, A., et al.: Web Services Reliable Messaging Protocol WS-ReliableMessaging, EA, Microsoft, IBM and TIBCO Software (March 2004), http://msdn.microsoft.com/library/enus/dnglobspec/html/ws-reliablemessaging.asp

  2. Looker, N., Munro, M.: WS-FTM: A Fault Tolerance Mechanism for Web Services, University of Durham, Technical Report, March 19 (2005)

    Google Scholar 

  3. Liang, D., Fang, C., Chen, C.: FT-SOAP: A Fault-tolerant Web Service, Institute of Information Science, Academia Sinica, Technical Report (2003)

    Google Scholar 

  4. Liang, D., Fang, C., Yuan, S.: A Fault-Tolerant Object Service on CORBA. Journal of Systems and Software 48, 197–211 (1999)

    Article  Google Scholar 

  5. Townend, P., Groth, P., Looker, N., Xu, J.: Ft-grid: A fault-tolerance system for e-science. In: Proc. of the UK OST e-Science Fourth All Hands Meeting (AHM 2005) (September 2005)

    Google Scholar 

  6. Merideth, M., Iyengar, A., Mikalsen, T., Tai, S., Rouvellou, I., Narasimhan, P.: Thema: Byzantine-Fault-Tolerant Middleware for Web-Service Application. In: Proc. of IEEE Symposium on Reliable Distributed Systems, Orlando, FL (October 2005)

    Google Scholar 

  7. Erradi, A., Maheshwari, P.: A broker-based approach for improving Web services reliability. In: Proc. of IEEE International Conference on Web Services, July 11-15, vol. 1, pp. 355–362 (2005)

    Google Scholar 

  8. Tsai, W., Cao, Z., Chen, Y.Y, Paul, R.: Web services-based collaborative and cooperative computing. In: Proc. of Autonomous Decentralized Systems, April 4-8, pp. 552–556 (2005)

    Google Scholar 

  9. Leu, D., Bastani, F., Leiss, E.: The effect of statically and dynamically replicated components on system reliability. IEEE Transactions on Reliability 39(2), 209–216 (1990)

    Article  MATH  Google Scholar 

  10. Kim, B.: Reliability analysis of real-time controllers with dual-modular temporal redundancy. In: Proc. of the Sixth International Conference on Real-Time Computing Systems and Applications (RTCSA 1999), December 13-15, pp. 364–371 (1999)

    Google Scholar 

  11. Shen, K., Xie, M.: On the increase of system reliability by parallel redundancy. IEEE Transactions on Reliability 39(5), 607–611 (1990)

    Article  MATH  Google Scholar 

  12. Avizienis, A., Chen, L.: On the implementation of N-version programming for software fault-tolerance during program execution. In: Proc. of First International Computer Software and Applications Conference, pp. 149–155 (1977)

    Google Scholar 

  13. Avizienis, A., Kelly, J.: Fault Tolerance by Design Diversity: Concepts and Experiments. IEEE Transactions on Computer, 67–80 (August 1984)

    Google Scholar 

  14. Lyu, M.R., Avizienis, A.: Assuring Design Diversity in N-Version Software: A Design Paradigm for N-Version Programming. In: Pham, H. (ed.) Fault-Tolerant Software Systems: Techniques and Applications. IEEE Computer Society Press Technology Series, pp. 45–54. IEEE Computer Society Press, Los Alamitos (1992)

    Google Scholar 

  15. Lala, J., Harper, R.: Architectural principles for safety-critical real-time applications. Proc. of the IEEE 82(1), 25–40 (1994)

    Article  Google Scholar 

  16. Riter, R.: Modeling and Testing a Critical Fault-Tolerant Multi-Process System. In: Proc. the 25th International Symposium on Fault-Tolerant Computing, pp. 516–521 (1995)

    Google Scholar 

  17. Lyu, M., Mendiratta, V.: Software Fault Tolerance in a Clustered Architecture: Techniques and Reliability Modeling. In: Proc. of 1999 IEEE Aerospace Conference, Snowmass, Colorado, March 6-13, vol. 5, pp. 141–150 (1999)

    Google Scholar 

  18. Sayal, M., Breitbart, Y., Scheuermann, P., Vingralek, R.: Selection algorithms for replicated web servers. In: Proc. of Workshop on Internet Server Performance 1998, Madison, WI (June 1998)

    Google Scholar 

  19. Looker, N., Munro, M., Xu, J.: Simulating Errors in Web Services. International Journal of Simulation: Systems, Science and Technology 5, 29–38 (2004)

    Google Scholar 

  20. Yan, Y., Liang, Y., Du, X.: Controlling remote instruments using Web services for online experiment systems. In: Proc. of IEEE International Conference on Web Services (ICWS 2005), July 11-15 (2005)

    Google Scholar 

  21. Yan, Y., Liang, Y., Du, X.: Distributed and collaborative environment for online experiment system using Web services. In: Proc. the Ninth International Conference on Computer Supported Cooperative Work in Design 2005, May 24-26, vol. 1, pp. 265–270 (2005)

    Google Scholar 

  22. Looker, N., Xu, J.: Assessing the Dependability of SOAP-RPC-Based Web Services by Fault Injection. In: Proc. of the 9th IEEE International Workshop on Object-oriented Real-time Dependable Systems, pp. 163–170 (2003)

    Google Scholar 

  23. Goseva-Popstojanova, K., Trivedi, K.: Failure correlation in software reliability models. IEEE Transactions on Reliability 49(1), 37–48 (2000)

    Article  Google Scholar 

  24. Guen, H., Marie, R., Thelin, T.: Reliability estimation for statistical usage testing using Markov chains. In: Proc. of the 15th International Symposium on Software Reliability Engineering (ISSRE 2004), November 2-5, pp. 54–65 (2004)

    Google Scholar 

  25. Sahner, R., Trivedi, K., Puliafito, A.: Performance and Reliability Analysis of Computer Systems. An Example-BasedApproach Using the SHARPE Software Package. Kluwer, Boston (1996)

    Google Scholar 

  26. Lyu, M.: Handbook of Software Reliability Engineering. IEEE Computer Society Press and McGraw-Hill Book Company

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chan, P.P.W., Lyu, M.R., Malek, M. (2006). Making Services Fault Tolerant. In: Penkler, D., Reitenspiess, M., Tam, F. (eds) Service Availability. ISAS 2006. Lecture Notes in Computer Science, vol 4328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11955498_4

Download citation

  • DOI: https://doi.org/10.1007/11955498_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68724-5

  • Online ISBN: 978-3-540-68725-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics