Abstract
This paper proposes an efficient algorithm for load-sharing and fault-tolerance in Internet-based clustering systems. The algorithm creates a global scheduler based on the Weighted Factoring algorithm. And it applies an adaptive granularity strategy and the refined fixed granularity algorithm for better performance. It may also execute a partial job several times for fault-tolerance. For the simulation, the matrix multiplication using PVM is used in a Internet-based clustering system. Compared to other algorithms such as Send, GSS and Weighted Factoring, the proposed algorithm results in an improvement of performance by 55%, 63% and 20%, respectively. Also, this paper shows that it can process the fault-tolerance.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Goo, B.-G.: Refined fixed granularity algorithm on Networks of Workstations. KIPSÂ 8(2) (2001)
Hummel, S.F., Schmidt, J., Uma, R.N., Wein, J.: Load-Sharing in Heterogeneous Systems via Weighted Factoring. In: SPAA (1997)
Kee, Y., Ha, S.: A Robust Dynamic Load-Balancing Scheme for Data Parallel Application on Message Passing Architecture. In: PDPTA 1998, vol. II, pp. 974–980 (1998)
Kim, J.-S., Shim, Y.-C.: Space-Sharing Scheduling Schemes for NOW with Heterogeneous Computing Power. KISSÂ 27(7) (2000)
Piotrowski, A., Dandamudi, S.: A Comparative Study of Load Sharing on Networks of Workstations. In: Proc. Int. Conf. Parallel and Distributed computing system, New Orleans (October 1997)
Shao, G.: Adaptive Scheduling of Master/Worker Applications on Distributed Computational Resources, Ph.D. thesis, UCSD (June 2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choi, IB., Lee, JD. (2004). An Efficient Load-Sharing and Fault-Tolerance Algorithm in Internet-Based Clustering Systems. In: Bubak, M., van Albada, G.D., Sloot, P.M.A., Dongarra, J. (eds) Computational Science - ICCS 2004. ICCS 2004. Lecture Notes in Computer Science, vol 3036. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24685-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-24685-5_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22114-2
Online ISBN: 978-3-540-24685-5
eBook Packages: Springer Book Archive