The MIT Alewife Machine: A Large-Scale Distributed-Memory Multiprocessor

Agarwal, Anant; Chaiken, David; Johnson, Kirk; Kranz, David; Kubiatowicz, John; Kurihara, Kiyoshi; Lim, Beng-Hong; Maa, Gino; Nussbaum, Dan

doi:10.1007/978-1-4615-3604-8_13

Anant Agarwal³,
David Chaiken³,
Kirk Johnson³,
David Kranz³,
John Kubiatowicz³,
Kiyoshi Kurihara³,
Beng-Hong Lim³,
Gino Maa³ &
…
Dan Nussbaum³

106 Accesses
37 Citations

Abstract

The Alewife multiprocessor project focuses on the architecture and design of a large-scale parallel machine. The machine uses a low dimension direct interconnection network to provide scalable communication bandwidth, while allowing the exploitation of locality. Despite its distributed memory architecture, Alewife allows efficient shared memory programming through a multilayered approach to locality management. A new scalable cache coherence scheme called Limit LESS directories allows the use of caches for reducing communication latency and network bandwidth requirements. Alewife also employs run-time and compile-time methods for partitioning and placement of data and processes to enhance communication locality. While the above methods attempt to minimize communication latency, remote communication with distant processors cannot be completely avoided. Alewife’s processor, Sparcle, is designed to tolerate these latencies by rapidly switching between threads of computation. This paper describes the Alewife architecture and concentrates on the novel hardware features of the machine including LimitLESS directories and the rapid context switching processor.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Parallel and Distributed Computing

SCore

Message Passing or Shared Memory: Evaluating the Delegation Abstraction for Multicores

References

Sarita V. Adve and Mark D. Hill. Weak Ordering—A New Definition. In Proceedings 17th Annual International Symposium on Computer Architecture, June 1990.
Google Scholar
Anant Agarwal. Limits on Interconnection Network Performance. IEEE Transactions on Parallel and Distributed Systems, 1991. To appear.
Google Scholar
Anant Agarwal. Performance Tradeoffs in Multithreaded Processors. September 1989. MIT VLSI Memo 89-566, Laboratory for Computer Science. Submitted for publication.
Google Scholar
Anant Agarwal, Beng-Hong Lim, David A. Kranz, and John Kubiatowicz. APRIL: A Processor Architecture for Multiprocessing. In Proceedings 17th Annual International Symposium on Computer Architecture, pages 104–114, June 1990.
Google Scholar
Anant Agarwal, Richard Simoni, John Hennessy, and Mark Horowitz. An Evaluation of Directory Schemes for Cache Coherence. In Proceedings of the 15th International Symposium on Computer Architecture, IEEE, New York, June 1988.
Google Scholar
Lucien M. Censier and Paul Feautrier. A New Solution to Coherence Problems in Multicache Systems. IEEE Transactions on Computers, C-27(12):1112–1118, December 1978.
Article Google Scholar
David Chaiken, Craig Fields, Kiyoshi Kurihara, and Anant Agarwal. Directory-Based Cache-Coherence in Large-Scale Multiprocessors. IEEE Computer, June 1990.
Google Scholar
David Chaiken, John Kubiatowicz, and Anant Agarwal. LimitLESS Directories: A Scalable Cache Coherence Scheme. In Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (AS-PLOS IV). To appear., ACM, April 1991.
Google Scholar
Mathews Cherian. A Study of Backoff Barrier Synchronization in Shared-Memory Multiprocessors. Technical Report, S.M. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, May 1989.
Google Scholar
D. R. Cheriton, H. A. Goosen,, and P. D. Boyle. ParaDIGM: A Highly Scalable Shared-Memory Multi-computer Architecture. IEEE Computer. To appear.
Google Scholar
William J. Dally. A VLSI Architecture for Concurrent Data Structures. Kluwer Academic Publishers, 1987.
Google Scholar
Michel Dubois, Christoph Scheurich, and Faye A. Briggs. Synchronization, coherence, and event ordering in multiprocessors. IEEE Computer, 9–21, February 1988.
Google Scholar
David A. Kranz et al. ORBIT: An Optimizing Compiler for Scheme, In Proceedings of SIGPLAN’ 86, Symposium on Compiler Construction, June 1986.
Google Scholar
Daniel Gajski, David Kuck, Duncan Lawrie, and Ahmed Saleh. Cedar — A Large Scale Multiprocessor. In International Conference on Parallel Processing, pages 524–529, August 1983.
Google Scholar
James R. Goodman. Using Cache Memory to Reduce Processor-Memory Traffic. In Proceedings of the 10th Annual Symposium on Computer Architecture, pages 124–131, IEEE, New York, June 1983.
Google Scholar
James R. Goodman and Philip J. Woest. The Wisconsin Multicube: A New Large Scale Cache-Coherent Multiprocessor. In Proceedings of the 15th Annual International Symposium on Computer Architecture, pages 422–431, Hawaii, June 1988.
Google Scholar
A. Gottlieb, R. Grishman, C. P. Kruskal, K. P. McAuliffe, L. Rudolph, and M. Snir. The NYU Ultracomputer — Designing a MIMD Shared-Memory Parallel Machine. IEEE Transactions on Computers, C-32(2):175–189, February 1983.
Article Google Scholar
R.H. Halstead and T. Fujita. MASA: A Multithreaded Processor Architecture for Parallel Symbolic Computing. In Proceedings of the 15th Annual International Symposium on Computer Architecture, pages 443–451, IEEE, New York, June 1988.
Chapter Google Scholar
W. D. Hillis. The Connection Machine. The MIT Press, Cambridge, MA, 1985.
Google Scholar
David V. James, Anthony T. Laundrie, Stein Gjessing, and Gurindar S. Sohi. Distributed-Directory Scheme: Scalable Coherent Interface. IEEE Computer, 74–77, June 1990.
Google Scholar
Parviz Kermani and Leonard Kleinrock. Virtual Cut-Through: A New Computer Communication Switching Technique. Computer Networks, 3:267–286, October 1979.
MathSciNet MATH Google Scholar
David A. Kranz. ORBIT: An Optimizing Compiler for Scheme. PhD thesis, Yale University, February 1988. Technical Report YALEU/DCS/RR-632.
Google Scholar
David A. Kranz, R. Halstead, and E. Mohr. Mul-T: A High-Performance Parallel Lisp. In Proceedings of SIGPLAN’ 89, Symposium on Programming Languages Design and Implementation, June 1989.
Google Scholar
James T. Kuehn and Burton J. Smith. The HORIZON Supercomputing System: Architecture and Software. In Proceedings of Supercomputing’ 88, November 1988.
Google Scholar
Kiyoshi Kurihara. Performance Evaluation of Large-Scale Multiprocessors. Technical Report, S.M. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, September 1990.
Google Scholar
D. Lenoski, J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy. The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor. In Proceedings 17th Annual International Symposium on Computer Architecture, pages 49–58, June 1990.
Google Scholar
Gino Maa. The WAIF Intermediate Graphical Form. Oct. 1990. Alewife Memo.
Google Scholar
Eric Mohr, David A. Kranz, and Robert H. Halstead. Lazy task creation: a technique for increasing the granularity of parallel programs. In Proceedings of Symposium on Lisp and Functional Programming, June 1990.
Google Scholar
Dan Nussbaum and Anant Agarwal. Scalability of Parallel Machines. Communications of the ACM, March 1990. To appear.
Google Scholar
Brian W. O’Krafka and A. Richard Newton. An Empirical Evaluation of Two Memory-Efficient Directory Methods. In Proceedings 17th Annual International Symposium on Computer Architecture, June 1990.
Google Scholar
G. M. Papadopoulos and D.E. Culler. Monsoon: An Explicit Token-Store Architecture. In Proceedings 17th Annual International Symposium on Computer Architecture, June 1990.
Google Scholar
G. F. Pfister et al. The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture. In Proceedings ICPP, pages 764–771, August 1985.
Google Scholar
G. N. S. Prasanna. Structure Driven Multiprocessor Compilation of Numeric Problems. PhD thesis, Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 1990.
Google Scholar
Charles L. Seitz. Concurrent VLSI Architectures. IEEE Transactions on Computers, C-33(12):1247–1265, December 1984.
Article Google Scholar
B.J. Smith. Architecture and Applications of the HEP Multiprocessor Computer System. SPIE, 298:241–248, 1981.
Google Scholar
SPARC Architecture Manual. 1988. SUN Microsystems, Mountain View, California.
Google Scholar
C. K. Tang. Cache Design in the Tightly Coupled Multiprocessor System. In AFIPS Conference Proceedings, National Computer Conference, NY, NY, pages 749–753, June 1976.
Google Scholar
Charles P. Thacker and Lawrence C. Stewart. Firefly: a Multiprocessor Workstation. In Proceedings of ASPLOS II, pages 164–172, October 1987.
Google Scholar
Wolf-Dietrich Weber and Anoop Gupta. Analysis of Cache Invalidation Patterns in Multiprocessors. In Third International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS III), April 1989.
Google Scholar
Wolf-Dietrich Weber and Anoop Gupta. Exploring the Benefits of Multiple Hardware Contexts in a Multiprocessor Architecture: Preliminary Results. In Proceedings 16th Annual International Symposium on Computer Architecture, IEEE, New York, June 1989.
Google Scholar
Andrew Wilson. Hierarchical Cache/Bus Architecture for Shared Memory Multiprocessors. In Proceedings of the 14th Annual International Symposium on Computer Architecture, pages 244–252, June 1987.
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Computer Science, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA
Anant Agarwal, David Chaiken, Kirk Johnson, David Kranz, John Kubiatowicz, Kiyoshi Kurihara, Beng-Hong Lim, Gino Maa & Dan Nussbaum

Authors

Anant Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
David Chaiken
View author publications
You can also search for this author in PubMed Google Scholar
Kirk Johnson
View author publications
You can also search for this author in PubMed Google Scholar
David Kranz
View author publications
You can also search for this author in PubMed Google Scholar
John Kubiatowicz
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Kurihara
View author publications
You can also search for this author in PubMed Google Scholar
Beng-Hong Lim
View author publications
You can also search for this author in PubMed Google Scholar
Gino Maa
View author publications
You can also search for this author in PubMed Google Scholar
Dan Nussbaum
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Southern California, USA
Michel Dubois
Sequent Computer Systems, USA
Shreekant Thakkar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Agarwal, A. et al. (1992). The MIT Alewife Machine: A Large-Scale Distributed-Memory Multiprocessor. In: Dubois, M., Thakkar, S. (eds) Scalable Shared Memory Multiprocessors. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-3604-8_13

Download citation

DOI: https://doi.org/10.1007/978-1-4615-3604-8_13
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-6601-0
Online ISBN: 978-1-4615-3604-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

The MIT Alewife Machine: A Large-Scale Distributed-Memory Multiprocessor

Abstract

Access this chapter

Preview

Similar content being viewed by others

Parallel and Distributed Computing

SCore

Message Passing or Shared Memory: Evaluating the Delegation Abstraction for Multicores

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

The MIT Alewife Machine: A Large-Scale Distributed-Memory Multiprocessor

Abstract

Access this chapter

Preview

Similar content being viewed by others

Parallel and Distributed Computing

SCore

Message Passing or Shared Memory: Evaluating the Delegation Abstraction for Multicores

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation