A complexity theory of efficient parallel algorithms

Kruskal, Clyde P.; Rudolph, Larry; Snir, Marc

doi:10.1007/3-540-19488-6_126

Clyde P. Kruskal¹,
Larry Rudolph² &
Marc Snir³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 317))

Included in the following conference series:

International Colloquium on Automata, Languages, and Programming

277 Accesses
13 Citations

Abstract

Theoretical research on parallel algorithms has focused on NC theory. This motivates the development of parallel algorithms that are extremely fast, but possibly wasteful in their use of processors. Such algorithms seem of limited interest for real applications currently run on parallel computers. This paper explores an alternative approach that emphasizes the efficiency of parallel algorithms. We define a complexity class PE of problems that can be solved by parallel algorithms that are efficient (the speedup is proportional to the number of processors used) and polynomially faster than sequential algorithms. Other complexity classes are also defined, in terms of time and efficiency: A class that has a slightly weaker efficiency requirement than PE, and a class that is a natural generalization of NC. We investigate the relationship between various models of parallel computation, using a newly defined concept of efficient simulation. This includes new models that reflect asynchrony and high communication latency in parallel computers. We prove that the class PE is invariant across the shared memory models (PRAM's) and fully connected message passing machines. These results show that our definitions are robust. Many open problems motivated by our approach are listed.

Extended abstract

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

A. Aggarwal and A. K. Chandra, Communication complexity of PRAM's. 15th ICALP (1988).
Google Scholar
H. Alt, T. Hagerup, K. Mehlhorn, and F. P. Preparata, Deterministic simulation of idealized parallel computers on more realistic ones. SIAM Journal of Computing 16 (1987) 808–835.
Google Scholar
A. V. Aho, J. E. Hopcroft, and J. D. Ullman, The Design and Analysis of Computer Algorithms. Addison-Wesley (1974).
Google Scholar
R. J. Anderson and G. L. Miller, Optical communication for pointer based algorithms. Manuscript.
Google Scholar
M. Blum, A machine-independent theory of the complexity of recursive functions. JACM 14 (1967) 322–336.
Google Scholar
S. A. Cook, C. Dwork, and R. Reischuk, Upper and lower time bounds for parallel random access machines without simultaneous writes. SIAM Journal of Computing 15 (1986) 87–97.
Google Scholar
L. Carter and M. Wegman, New hash functions and their use in authentication and set equality. Journal of Computer and System Sciences 22 (1981) 265–279.
Google Scholar
P. W. Dymond and W. L. Ruzzo, Parallel random access machines with owned global memory and deterministic context free language recognition. 13th ICALP (1986) 95–104.
Google Scholar
S. Fortune and J. Wyllie, Parallelism in random access machines. 10th ACM STOC (1978) 114–118.
Google Scholar
A. Gottlieb, R. Grishman, C. P. Kruskal, K. P. McAuliffe, L. Rudolph, and M. Snir, The NYU Ultracomputer — designing an MIMD parallel machine. IEEE Trans. on Computers TC-32 (1983) 175–189.
Google Scholar
A. Gottlieb and C. P. Kruskal, Complexity results for permuting data and other computations on parallel processors. JACM 31 (1984) 193–209.
Google Scholar
L. Goldschlager, A unified approach to models of synchronous parallel machines. 10th ACM STOC (1978), 89–94.
Google Scholar
L. Kucera, Parallel computation and conflicts in memory accesses. IPL 14 (1982) 93–96.
Google Scholar
C. P. Kruskal, T. Madej, and L. Rudolph, Parallel prefix on fully connected direct connection machine. ICPP (1986) 278–283.
Google Scholar
C. P. Kruskal, Algorithms for replace-add based paracomputers. ICPP (1982) 219-223.
Google Scholar
C. P. Kruskal, Searching, merging, and sorting in parallel computation. IEEE Trans. on Computers TC-32 (1983) 942–946.
Google Scholar
C. P. Kruskal, L. Rudolph, and M. Snir, Efficient synchronization in multiprocessors with shared memory. 6th PODC (1986) 218–228.
Google Scholar
C. P. Kruskal, L. Rudolph, and M. Snir, A complexity theory of efficient parallel algorithms, Technical Report, IBM T. J. Watson Research Center, 1988.
Google Scholar
A. K. Karlin and E. Upfal, Parallel hashing — an efficient implementation of shared memory. 18th ACM STOC (1986) 160–168.
Google Scholar
R. M. Karp, E. Upfal, and A. Widgerson, Are search and decision problems computationally equivalent? 17th ACM STOC (1985) 464–475.
Google Scholar
G. Lev, N. Pippenger, and L. G. Valiant, A fast parallel algorithm for routing in permutation networks. IEEE Trans. on Computers TC-30 (1981) 93–100.
Google Scholar
K. Mehlhorn and U. Vishkin, Randomized and deterministic simulations of PRAMs by parallel machines with restricted granularity of parallel memories. Acta Informatica 21 (1984) 339–374.
Google Scholar
A. G. Ranade, How to emulate shared memory. 28th FOCS (1987) 185–194.
Google Scholar
J. H. Reif, Depth-first search is inherently sequential. IPL 20 (1985) 229–234.
Google Scholar
L. Rudolph, Software structures for ultraparallel computing. Ph.D. Thesis, NYU, 1982.
Google Scholar
J. Schwartz, Ultracomputers. ACM TOPLAS 2 (1980) 484–521.
Google Scholar
M. Snir, On parallel searching. SIAM Journal of Computing 14 (1985) 688–708.
Google Scholar
M. Snir, Asynchronous parallel computations. Manuscript.
Google Scholar
U. Vishkin, Implementation of simultaneous memory address access in models that forbid it. Journal of Algorithms 4 (1983) 45–50.
Google Scholar
J. S. Vitter and R. A. Simons, New classes for parallel complexity: a study of unification and other complete problems for P. IEEE Trans. on Computers TC-35 (1986) 403–418.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Inst. for Advanced Computer Studies, Univ. of Maryland, 20742, College Park, Maryland, USA
Clyde P. Kruskal
Dept. of Computer Science, The Hebrew Univ. of Jerusalem, 91904, Jerusalem, Israel
Larry Rudolph
IBM T. J. Watson Res. Center, P.O. Box 218, 10598, Yorktown Heights, NY, USA
Marc Snir

Authors

Clyde P. Kruskal
View author publications
You can also search for this author in PubMed Google Scholar
Larry Rudolph
View author publications
You can also search for this author in PubMed Google Scholar
Marc Snir
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Timo Lepistö Arto Salomaa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kruskal, C.P., Rudolph, L., Snir, M. (1988). A complexity theory of efficient parallel algorithms. In: Lepistö, T., Salomaa, A. (eds) Automata, Languages and Programming. ICALP 1988. Lecture Notes in Computer Science, vol 317. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-19488-6_126

Download citation

DOI: https://doi.org/10.1007/3-540-19488-6_126
Published: 31 May 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-19488-0
Online ISBN: 978-3-540-39291-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics