On synchronisation in fault-tolerant data and compute intensive programs over a network of workstations

Smith, J.

doi:10.1007/BFb0002847

J. Smith¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1300))

Included in the following conference series:

European Conference on Parallel Processing

382 Accesses

Abstract

An application structured as a fault-tolerant bag of tasks adapts easily to changing resources. To be represented by a single bag of tasks, a computation must decompose into purely independent tasks. The work summarised here investigates performance of structuring approaches applicable where this ideal is not possible, partly through analysis and partly through measurements of a realistic fault-tolerant computation.

Download to read the full chapter text

Chapter PDF

References

G. S. Almasi and A. Gottlieb. Highly Parallel Computing. Benjamin/Cummings, 2nd edition, 1994. ISBN 0-8053-0443-6.
Google Scholar
D. E. Bakken. Supporting Fault-Tolerant Parallel Programming in Linda. PhD thesis, The University of Arizona, Aug. 1994.
Google Scholar
A. Baratloo, P Dasgupta, and Z. M. Kedem. CALYPSO: A novel software system for fault-tolerant parallel processing on distributed platforms. In 4th International Symposium on High Performance Distributed Computing. IEEE, Aug. 1995.
Google Scholar
P A. Bernstein, M. Hsu, and B. Mann. Implementing recoverable requests using queues. ACM SIGMOD, pages 112–122, 1990.
Article Google Scholar
P. M. Chen, E. K. Lee, G. A. Gibson, R. H. Katz, and D. A. Patterson. RAID: highperformance, reliable secondary storage. ACM Computing Surveys, 26(2):145–185, June 1994.
Article Google Scholar
T. Clark and K. P. Birman. Using the ISIS resource manager for distributed, fault-tolerant computing. Technical Report 92-1289, Cornell University Computer Science Department, June 1992.
Google Scholar
J. M. del Rosario and A. Choudhary. High performance I/O for parallel computers: Problems and prospects. IEEE Computer, pages 59–68, Mar. 1994.
Google Scholar
G. H. Golub and C. F. V Loan. Matrix Computations. John Hopkins University Press, second edition, 1989. ISBN 0-8018-3772-3.
Google Scholar
J. Gray and A. Reuter. Transaction Processing: Concepts and Techniques. Morgan Kauffman, 1993.
Google Scholar
K. Jeong. Fault-Tolerant Parallel Processing Combining Linda, Checkpointing, and Transactions. PhD thesis, New York University, Jan. 1996.
Google Scholar
J. Smith. Fault Tolerant Parallel Applications Using a Network Of Workstations. PhD thesis, University of Newcastle upon Tyne, 1996. Forthcoming.
Google Scholar
J. A. Smith and S. Shrivastava. Performance of data and compute intensive programs over a network of workstations. Theoretical Computer Science, 1997. To appear in special issue for Euro-Par'96 papers.
Google Scholar
V. S. Sunderam, G. A. Geist, J. J. Dongarra, and R. J. Manchek. The PVM concurrent computing system: Evolution, experiences, and trends. Parallel Computing Vol. 20(4), pages 531–546, 1993.
Article Google Scholar
M. Zyngier. md. ftp://sweet-smoke.ufr-info-p7.ibp.fr/pub/Linux/, Apr. 1996. version 0.35.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing Science, The University of Newcastle upon Tyne, NE1 7RU, Newcastle upon Tyne, UK
J. Smith

Authors

J. Smith
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Christian Lengauer Martin Griebl Sergei Gorlatch

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Smith, J. (1997). On synchronisation in fault-tolerant data and compute intensive programs over a network of workstations. In: Lengauer, C., Griebl, M., Gorlatch, S. (eds) Euro-Par'97 Parallel Processing. Euro-Par 1997. Lecture Notes in Computer Science, vol 1300. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0002847

Download citation

DOI: https://doi.org/10.1007/BFb0002847
Published: 26 September 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63440-9
Online ISBN: 978-3-540-69549-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics