Program optimization for concurrent multithreaded architectures

Tsait, Jenn-Yuan; Jiang, Zhenzhen; Yew, Pen-Chung

doi:10.1007/BFb0032689

Jenn-Yuan Tsait¹,
Zhenzhen Jiang² &
Pen-Chung Yew²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1366))

Included in the following conference series:

International Workshop on Languages and Compilers for Parallel Computing

98 Accesses

Abstract

This paper presents some compiler and program transformation techniques for concurrent multithreaded architectures, in particular the superthreaded architecture [9], which adopts a thread pipelining execution model that allows threads with data dependences and control dependences to be executed in parallel. In this paper, we identify several important program analysis and transformation techniques that allow the superthreaded architecture to exploit more parallelism in programs with less run-time overhead. We evaluate the performance of the superthreaded architecture and the effectiveness of the program transformation techniques by manually compiling several benchmark programs and running them through a trace-driven, cycle-by-cycle superthreaded processor simulator. The simulation results show that a superthreaded processor can achieve promising speedups for most of the benchmark programs with the proposed program transformation techniques applied.

This work is supported in part by the National Science Foundation under Grant No. MIP 9610379; by the U.S. Army Intelligence Center and Fort Huachuca under Contract DABT63-95-C-0127 and ARPA order No. D 346; and by a gift from Intel Corporation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ding-Kai Chen and Pen-Chung Yew. Statement reordering for doacross loops. In Proceedings of International Conference on Parallel Processing, volume Vol. II, August 1994.
Google Scholar
Pradeep K. Dubey, Kevin O'Brien, Kathryn O'Brien, and Charles Barton. Singleprogram speculative multithreading (SPSM) architecture: Compiler-assisted finegrained multithreading. In Proceedings of the IFIP WG 10.3 Working Conference on Parallel Architectures and Compilation Techniques, PACT '95, pages 109–121, June 27–29, 1995.
Google Scholar
Marco Fillo, Stephen W. Keckler, William J. Dally, Nicholas P. Carter, Andrew Chang, Yevgeny Gurevich, and Whay S. Lee. The m-machine multicomputer. In Proceedings of the 28th Annual International Symposium on Microarchitecture, pages 146–156, November 29-December 1, 1995.
Google Scholar
Manoj Franklin and Gurindar S. Sohi. The expandable split window paradigm for exploiting fine-grained parallelism. In Proceedings of the 19th Annual International Symposium on Computer Architecture, pages 58–67, May 19–21, 1992.
Google Scholar
Hiroaki Hirata, Kozo Kimura, Satoshi Nagamine, Yoshiyuki Mochizuki, Akio Nishimura, Yoshimori Nakase, and Teiji Nishizawa. An elementary processor architecture with simultaneous instruction issuing from multiple threads. In Proceedings of the 19th Annual International Symposium on Computer Architecture, pages 136–145, May 19–21, 1992.
Google Scholar
Zhiyuan Li. Array privatization for parallel execution of loops. In Proceedings of the 6th ACM International Conference on Supercomputing, pages 313–322, July 1992.
Google Scholar
M. D. Smith. Tracing with pixie. Technical report, Stanford University, Stanford, California 94305, November 1991. Technical Report CSL-TR-91-497.
Google Scholar
Gurindar S. Sohi, Scott E. Breach, and T. N. Vijaykumar. Multiscalar processors. In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 414–425, June 22–24, 1995.
Google Scholar
Jenn-Yuan Tsai and Pen-Chung Yew. The superthreaded architecture: Thread pipelining with run-time data dependence checking and control speculation. In Proceedings of the 1996 Conference on Parallel Architectures and Compilation Techniques, PACT '96, pages 35–46, October 20–23, 1996.
Google Scholar
Dean M. Tullsen, Susan J. Eggers, and Henry M. Levy. Simultaneous multithreading: Maximizing on-chip parallelism. In Proceedings of the 22nd Annual International Symposium on Computer Architecture, pages 392–403, June 22–24, 1995.
Google Scholar
Michael J. Wolfe. Optimizing supercompilers for supercomputers. Technical report, University of Illinois, October 1982. Technical Report UIUCDCS-R-82-1105.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Illinois, 61801, Urbana, IL
Jenn-Yuan Tsait
Department of Computer Science, University of Minnesota, 55455, Minneapolis, MN
Zhenzhen Jiang & Pen-Chung Yew

Authors

Jenn-Yuan Tsait
View author publications
You can also search for this author in PubMed Google Scholar
Zhenzhen Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Pen-Chung Yew
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Zhiyuan Li Pen-Chung Yew Siddharta Chatterjee Chua-Huang Huang P. Sadayappan David Sehr

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tsait, JY., Jiang, Z., Yew, PC. (1998). Program optimization for concurrent multithreaded architectures. In: Li, Z., Yew, PC., Chatterjee, S., Huang, CH., Sadayappan, P., Sehr, D. (eds) Languages and Compilers for Parallel Computing. LCPC 1997. Lecture Notes in Computer Science, vol 1366. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0032689

Download citation

DOI: https://doi.org/10.1007/BFb0032689
Published: 09 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64472-9
Online ISBN: 978-3-540-69788-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics