The Session Based Fault Tolerance Algorithm of Platform EGO Web Service Gateway
Although grid computing has adopted Web services technology to deal with platforms heterogeneity and to enhance service and application interoperability, it is still a challenge to build web service applications with high reliability and availability to meet the requirements of grid communities. The paper discusses the design of Platform EGO WSG with high reliability. To support a huge user base and reduce the response time, WSGs work in cluster model and the loads are dynamic balanced among them. Besides, a lightweight notification mechanism is implemented to provide better interoperability between WSG and WSCs. Moreover, we designed a session-based a-synchronized recovery algorithm to achieve WSG fault tolerance, which has short freezing time and is able to isolate the recovery process for each WSC. This approach can rebuild the service sessions and the notification mechanism after restart, to handle Notification failure, and WSG failure report, etc.
KeywordsGrid web service gateway fault tolerance session load balance
Unable to display preview. Download preview PDF.
- 1.I. Foster (2006) Globus Toolkit Version 4: Software for Service-Oriented Systems. IFIP International Conference on Network and Parallel Computing, Springer-Verlag LNCS 3779, pp 2–13.Google Scholar
- 2.Aghdaie, N., Tamir, Y. (2002) Implementation and Evaluation ofTransparent Fault-Tolerant Web Service with Kernel-Level Support.Proceedings of the IEEE International Conference on ComputerCommunications and Networks, pp 63–68.Google Scholar
- 3.Dialani, V., Miles, S., Moreau, et al. (2002) Transparent FaultTolerance for Web Services Based Architectures. Proceedings of 8thInternational Euro-Par Conference on Parallel Processing, Paderborn,Germany Proceedings. Volume 2400.Google Scholar
- 4.G. Teixeira Santos, L. Cheuk Lung, C. Montez (2005) FTWeb: A FaultTolerant Infrastructure for Web Services. Proceedings of the 2005Ninth IEEE International EDOC Enterprise Computing Conference(EDOC’05).Google Scholar
- 5.Alvisi, L. Bressoud, T.C. El-Khashab, and et al. (2001) Wrappingserver-side TCP to mask connection failures. Proceedings of INFOCOM2001.Twentieth Annual Joint Conference of the IEEE Computer andCommunications Societies.Google Scholar
- 6.P. Townend and J. Xu (2004) “Replication-based Fault Tolerance in a Grid Environment”, in Proceedings of U.K. e-Science 3rd All-HandsMeeting, Simon J. Cox Eds., Nottingham Conference Center, U.K., 31st August - 3rd September, 2004, ISBN 1-904425-21-6..Google Scholar
- 7.Condor Team, University of Wisconsin-Madison (2002)Condor Version 6.8.2 Manual, http://www.cs.wisc.edu/condor/manual/v6.8/ref.html. Accessed January 2006.
- 8.A. Nadalin IBM, C. Kaler Microsoft, R. Monzillo Sun, et al. (2006)wss-v1.1-os-UsernameTokenProfile. http://docs.oasis-open.org/wss/v1.1/. Accessed May 2006.
- 9.S. Graham, P. Niblett, D. Chappell et al (2004) Publish-SubscribeNotification for Web services, 1.0. http://www-128.ibm.com/developerworks/library/ws-pubsub/WS -PubSub.pdf Accessed May 2006.