Speculative Parallelization of Partially Parallel Loops

Dang, Francis H.; Rauchwerger, Lawrence

doi:10.1007/3-540-40889-4_22

Francis H. Dang⁵ &
Lawrence Rauchwerger⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1915))

Included in the following conference series:

International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers

271 Accesses
5 Citations

Abstract

Current parallelizing compilers cannot identify a significant fraction of parallelizable loops because they have complex or statically insufficiently de- fined access patterns. We have previously proposed a framework for their identifi- cation. We speculatively executed a loop as a doall, and applied a fully parallel data dependence test to determine if it had any cross-processor dependences; if the test failed, then the loop was re-executed serially. While this method ex- ploits doall parallelism well, it can cause slowdowns for loops with even one cross-processor flow dependence because we have to re-execute sequentially. Moreover, the existing, partial parallelism of loops is not exploited. In this paper we propose a generalization of our speculative doall parallelization technique, named Recursive LRPD test, that can extract and exploit the maximum available parallelism of any loop and that limits potential slowdowns to the overhead of the run-time dependence test itself, i.e., removes the time lost due to incorrect parallel execution. The asymptotic time-complexity is, for fully serial loops, equal to the sequential execution time. We present the base algorithm and an analysis of the different heuristics for its practical application. Some preliminary experimental results on loops from Track will show the performance of this new technique.

Research supported in part byNSF CAREER Award CCR-9734471,NSF GrantACI-9872126, NSF Grant EIA-9975018, DOE ASCI ASAP Level 2 Grant B347886 and a Hewlett-Packard Equipment Grant

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. MarkBull. Feedback guided dynamic loop scheduling: Algorithms and experiments. In EUROPAR98, Sept., 1998.
Google Scholar
Zhiyuan Li. Array privatization for parallel execution of loops. In Proc. of the 19th Int. Symp. on Computer Architecture, pp. 313–322, 1992.
Google Scholar
M. J. Frisch et. al. Gaussian 94, Revision B.1. Gaussian, Inc., Pittsburgh PA, 1995.
Google Scholar
D. Maydan, S. Amarasinghe, and M. Lam. Data dependenceand data-flowanalysis of arrays.In Proc. 5th Workshop on Languages and Compilers for Parallel Computing, Aug. 1992.
Google Scholar
L. Nagel. SPICE2: A Computer Program to Simulate Semiconductor Circuits. PhD thesis, University of California, May 1975.
Google Scholar
D. A. Padua and M. J. Wolfe. Advanced compiler optimizations for supercomputers. Com-munications of the ACM, 29:1184–1201, Dec. 1986.
Google Scholar
L. Rauchwerger, N. Amato, and D. Padua. A scalable method for run-time loop parallelization. Int. J. Parallel Programming, 26(6):537–576, July 1995.
Article Google Scholar
L. Rauchwerger and D. Padua. The LRPD Test: Speculative Run-Time Parallelization of Loops with Privatization and Reduction Parallelization. IEEE Trans. on Parallel and Distributed Systems, 10(2), 1999.
Google Scholar
P. Tu and D. Padua. Automatic array privatization. In Proc. 6th Workshop on Languages and Compilers for Parallel Computing, Portland, OR, Aug. 1993.
Google Scholar
R. Whirley and B. Engelmann. DYNA3D: A Nonlinear, Explicit, Three-Dimensional Finite Element Code For Solid and Structural Mechanics. L. Livermore National Lab., Nov., 1993.
Google Scholar
M. Wolfe. Optimizing Compilers for Supercomputers. The MIT Press, Boston, MA, 1989.
Google Scholar
Hao Yu and L. Rauchwerger. Run-time parallelization overhead reduction techniques. In Proc. of the 9th Int. Conf. on Compiler Construction (CC2000), Berlin, Germany. LectureNotes in Computer Science, Springer-Verlag, March 2000.
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Science, Texas A&M University, College Station, TX, 77843-3112
Francis H. Dang & Lawrence Rauchwerger

Authors

Francis H. Dang
View author publications
You can also search for this author in PubMed Google Scholar
Lawrence Rauchwerger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Rochester, Rochester, NY, 14627-0226, USA
Sandhya Dwarkadas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dang, F.H., Rauchwerger, L. (2000). Speculative Parallelization of Partially Parallel Loops. In: Dwarkadas, S. (eds) Languages, Compilers, and Run-Time Systems for Scalable Computers. LCR 2000. Lecture Notes in Computer Science, vol 1915. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40889-4_22

Download citation

DOI: https://doi.org/10.1007/3-540-40889-4_22
Published: 26 July 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41185-7
Online ISBN: 978-3-540-40889-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics