Abstract
With ever growing use of Internet, Web services become increasingly popular and their growth rate surpasses even the most optimistic predictions. Services are self-descriptive, self-contained, platform-independent and openly-available components that interact over the network. They are written strictly according to open specifications and/or standards and provide important and often critical functions for many business-to-business systems. Failures causing either service downtime or producing invalid results in such systems may range from a mere inconvenience to significant monetary penalties or even loss of human lives. In applications where sensing and control of machines and other devices take place via services, making the services highly dependable is one of main critical goals. Currently, there is no experimental investigation to evaluate the reliability and availability of Web services systems. In this paper, we identify parameters impacting the Web services dependability, describe the methods of dependability enhancement by redundancy in space and redundancy in time and perform a series of experiments to evaluate the availability of Web services. To increase the availability of the Web service, we use several replication schemes and compare them with a single service. The Web services are coordinated by a replication manager. The replication algorithm and the detailed system configuration are described in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bilorusets, R., Bosworth, A., et al.: Web Services Reliable Messaging Protocol WS-ReliableMessaging, EA, Microsoft, IBM and TIBCO Software (March 2004), http://msdn.microsoft.com/library/enus/dnglobspec/html/ws-reliablemessaging.asp
Looker, N., Munro, M.: WS-FTM: A Fault Tolerance Mechanism for Web Services, University of Durham, Technical Report, March 19 (2005)
Liang, D., Fang, C., Chen, C.: FT-SOAP: A Fault-tolerant Web Service, Institute of Information Science, Academia Sinica, Technical Report (2003)
Liang, D., Fang, C., Yuan, S.: A Fault-Tolerant Object Service on CORBA. Journal of Systems and Software 48, 197–211 (1999)
Townend, P., Groth, P., Looker, N., Xu, J.: Ft-grid: A fault-tolerance system for e-science. In: Proc. of the UK OST e-Science Fourth All Hands Meeting (AHM 2005) (September 2005)
Merideth, M., Iyengar, A., Mikalsen, T., Tai, S., Rouvellou, I., Narasimhan, P.: Thema: Byzantine-Fault-Tolerant Middleware for Web-Service Application. In: Proc. of IEEE Symposium on Reliable Distributed Systems, Orlando, FL (October 2005)
Erradi, A., Maheshwari, P.: A broker-based approach for improving Web services reliability. In: Proc. of IEEE International Conference on Web Services, July 11-15, vol. 1, pp. 355–362 (2005)
Tsai, W., Cao, Z., Chen, Y.Y, Paul, R.: Web services-based collaborative and cooperative computing. In: Proc. of Autonomous Decentralized Systems, April 4-8, pp. 552–556 (2005)
Leu, D., Bastani, F., Leiss, E.: The effect of statically and dynamically replicated components on system reliability. IEEE Transactions on Reliability 39(2), 209–216 (1990)
Kim, B.: Reliability analysis of real-time controllers with dual-modular temporal redundancy. In: Proc. of the Sixth International Conference on Real-Time Computing Systems and Applications (RTCSA 1999), December 13-15, pp. 364–371 (1999)
Shen, K., Xie, M.: On the increase of system reliability by parallel redundancy. IEEE Transactions on Reliability 39(5), 607–611 (1990)
Avizienis, A., Chen, L.: On the implementation of N-version programming for software fault-tolerance during program execution. In: Proc. of First International Computer Software and Applications Conference, pp. 149–155 (1977)
Avizienis, A., Kelly, J.: Fault Tolerance by Design Diversity: Concepts and Experiments. IEEE Transactions on Computer, 67–80 (August 1984)
Lyu, M.R., Avizienis, A.: Assuring Design Diversity in N-Version Software: A Design Paradigm for N-Version Programming. In: Pham, H. (ed.) Fault-Tolerant Software Systems: Techniques and Applications. IEEE Computer Society Press Technology Series, pp. 45–54. IEEE Computer Society Press, Los Alamitos (1992)
Lala, J., Harper, R.: Architectural principles for safety-critical real-time applications. Proc. of the IEEE 82(1), 25–40 (1994)
Riter, R.: Modeling and Testing a Critical Fault-Tolerant Multi-Process System. In: Proc. the 25th International Symposium on Fault-Tolerant Computing, pp. 516–521 (1995)
Lyu, M., Mendiratta, V.: Software Fault Tolerance in a Clustered Architecture: Techniques and Reliability Modeling. In: Proc. of 1999 IEEE Aerospace Conference, Snowmass, Colorado, March 6-13, vol. 5, pp. 141–150 (1999)
Sayal, M., Breitbart, Y., Scheuermann, P., Vingralek, R.: Selection algorithms for replicated web servers. In: Proc. of Workshop on Internet Server Performance 1998, Madison, WI (June 1998)
Looker, N., Munro, M., Xu, J.: Simulating Errors in Web Services. International Journal of Simulation: Systems, Science and Technology 5, 29–38 (2004)
Yan, Y., Liang, Y., Du, X.: Controlling remote instruments using Web services for online experiment systems. In: Proc. of IEEE International Conference on Web Services (ICWS 2005), July 11-15 (2005)
Yan, Y., Liang, Y., Du, X.: Distributed and collaborative environment for online experiment system using Web services. In: Proc. the Ninth International Conference on Computer Supported Cooperative Work in Design 2005, May 24-26, vol. 1, pp. 265–270 (2005)
Looker, N., Xu, J.: Assessing the Dependability of SOAP-RPC-Based Web Services by Fault Injection. In: Proc. of the 9th IEEE International Workshop on Object-oriented Real-time Dependable Systems, pp. 163–170 (2003)
Goseva-Popstojanova, K., Trivedi, K.: Failure correlation in software reliability models. IEEE Transactions on Reliability 49(1), 37–48 (2000)
Guen, H., Marie, R., Thelin, T.: Reliability estimation for statistical usage testing using Markov chains. In: Proc. of the 15th International Symposium on Software Reliability Engineering (ISSRE 2004), November 2-5, pp. 54–65 (2004)
Sahner, R., Trivedi, K., Puliafito, A.: Performance and Reliability Analysis of Computer Systems. An Example-BasedApproach Using the SHARPE Software Package. Kluwer, Boston (1996)
Lyu, M.: Handbook of Software Reliability Engineering. IEEE Computer Society Press and McGraw-Hill Book Company
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chan, P.P.W., Lyu, M.R., Malek, M. (2006). Making Services Fault Tolerant. In: Penkler, D., Reitenspiess, M., Tam, F. (eds) Service Availability. ISAS 2006. Lecture Notes in Computer Science, vol 4328. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11955498_4
Download citation
DOI: https://doi.org/10.1007/11955498_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68724-5
Online ISBN: 978-3-540-68725-2
eBook Packages: Computer ScienceComputer Science (R0)