Two fundamental issues in multiprocessing

Arvind; Iannucci, Robert A.

doi:10.1007/3-540-18923-8_15

Arvind¹ &
Robert A. Iannucci¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 295))

Included in the following conference series:

International Seminar of the German Aerospace Research Establishment

367 Accesses
14 Citations

Abstract

A general purpose multiprocessor should be scalable, i.e. show higher performance when more hardware resources are added to the machine. Architects of such multiprocessors must address the loss in processor efficiency due to two fundamental issues: long memory latencies and waits due to synchronization events. It is argued that a well designed processor can overcome these losses provided there is sufficient parallelism in the program being executed. The detrimental effect of long latency can be reduced by instruction pipelining, however, the restriction of a single thread of computation in von Neumann processors severely limits their ability to have more than a few instructions in the pipeline. Furthermore, techniques to reduce the memory latency tend to increase the cost of task switching. The cost of synchronization events in von Neumann machines makes decomposing a program into very small tasks counter-productive. Dataflow machines, on the other hand, treat each instruction as a task, and by paying a small synchronization cost for each instruction executed, offer the ultimate flexibility in scheduling instructions to reduce processor idle time.

This report describes research done at the Laboratory for Computer Science of the Massachusetts Institute of Technology. Funding for the Laboratory is provided in part by the Advanced Research Projects Agency of the Department of Defense under Office of Naval Research contracts N00014-83-K-0125 and N00014-84-K-0099. The second author is employed by the International Business Machines Corporation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

7. References

Arvind and R.E. Bryant Design Considerations for a Partial Equation Machine. Proceedings of Scientific Computer Information Exchange Meeting, Lawrence Livermore Laboratory, Livermore, CA, September, 1979, pp. 94–102.
Google Scholar
Arvind and D.E. Culler “Dataflow Architectures”. Annual Reviews of Computer Science 1 (1986), 225–253.
Google Scholar
Arvind and D.E. Culler, R.A. Iannucci, V. Kathail, K. Pingali, and R.E. Thomas The Tagged Token Dataflow Architecture. Internal Report. (including architectural revisions of October, 1983).
Google Scholar
Arvind and K.P. Gostelow “The U-Interpreter”. Computer 15, 2 (February 1982), 42–49
Google Scholar
Arvind and R.A. Iannucci Instruction Set Definition for a Tagged-Token Data FLow Machine. Computation Structures Group Memo 212-3, Laboratory for Computer Science, MIT, Cambridge, Mass., Cambridge, MA 02139, December, 1981.
Google Scholar
Arvind and R.S. Nikhil Executing a Program on the MIT Tagged-Token Data FLow Architecture. Proc. PARLE, (Parallel Architectures and Languages Europe), Eindhoven, The Netherlands, June, 1987.
Google Scholar
Block, E. The Engineering Design of the STRETCH Computer. Proceedings of the EJCC, 1959, pp. 48–59.
Google Scholar
Buehrer, R. and K. Ekanadham Dataflow Principles in Multi-processor Systems. ETH Zurich, and Research Division, Yorktown Heights, IBM Corporation, July, 1986.
Google Scholar
Burks, A., H.H. Goldstine, and J. von Neumann “Preliminary Discussion of the Logical Design of an Electronic Instrument, Part 2”. Datamation 8, 10 (October 1962), 36–41
Google Scholar
Censier, L.M. and P. Feautrier “A New Solution to the Coherence Problems in Multicache Systems”. IEEE Transactions on Computers C-27, 12 (December 1979), 1112–1118.
Google Scholar
Clack, C. and Peyton-Jones, S.L. The Four-Stroke Reduction Engine. Proceedings of the 1986 ACM Conference on Lisp and Functional Programming, Association for Computing Machinery, August, 1986, pp. 220–232.
Google Scholar
Crowley, W.P., C.P. Hendrickson and T.E. Rudy The SIMPLE Code. Internal Report UCID-17715, Lawrence Livermore Laboratory, Livermore, CA, February, 1978.
Google Scholar
Darlington, J. and M Reeve ALICE: A Multi-Processor Reduction Machine for the Parallel Evaluation of Applicative Languages. Proceedings of the 1981 Conference on Functional Programming Languages and Computer Architecture, Portsmouth, NH, 1981, pp. 65–76.
Google Scholar
Dennis, J.B. Lecture Notes in Computer Science. Volume 19: First Version of a Data Flow Procedure Language. In In Programming Symposium: Proceedings, Colloque sur la Programmation, B. Robinet, Ed., Springer-Verlag, 1974, pp. 362–376.
Google Scholar
Dennis, J.B. “Data Flow Supercomputers”. Computer 13, 11 (November 1980), 48–56.
Google Scholar
Eckert, J.P., J.C. Chu, A.B. Tonik & W.F. Schmitt Design of UNIVAC-LARC System: 1. Proceedings of the EJCC, 1959, pp. 59–65.
Google Scholar
Edler, J., A. Gottlieb, C.P. Kruskal, K.P. McAuliffe, L. Rudolph, M. Snir, P.J. Teller & J. Wilson Issues Related to MIMD Shared-Memory Computers: The NYU Ultracomputer Approach. Proceedings of the 12th Annual International Symposium On Computer Architecture, Boston, June, 1985, pp. 126–135.
Google Scholar
Ellis, J.R. Culldog: a Compiler for VLIW Architectures. The MIT Press, 1986.
Google Scholar
Fisher, J.A. Very Long Instruction Word Architectures and the ELI-512. Proc. of the 10th, Internation Symposium on Computer Architecture, IEEE Computer Society, June, 1983.
Google Scholar
Gajski, D.D. & J-K. Peir “Essential Issues in Multiprocessor Systems”. Computer 18, 6 (June 1985), 9–27.
Google Scholar
Gurd, J.R., C.C. Kirkham, and I. Watson “The Manchester Prototype Dataflow Computer”. Communications of ACM 28, 1 (January 1985), 34–52.
Google Scholar
Hennessey, J.L. “VLSI Processor Architecture”. IEEE Transactions on Computers C-33, 12 (December 1984), 1221–1246.
Google Scholar
Hiraki, K., S. Sekiguchi, and T. Shimada System Architecture of a Dataflow Supercomputer. Computer Systems Division, Electrotechnical Laboratory, Japan, 1987.
Google Scholar
Iannucci, R.A. A Dataflow I von Neuamnn Hybrid Architecture. Ph.D.Th.Dept. of Electrical Engineering and Computer Science, MIT, Cambridge, Mass., (in preparation) 1987.
Google Scholar
Jordan, H.F. Performance Measurement on HEP-A Pipelined MIMD Computer. Proceedings of the 10th Annual International Symposium On Computer Architecture, Stockholm, Sweden, June, 1983, pp. 207–212.
Google Scholar
Kuck, D.E. Davidson, D. Lawrie, and A. Sameh “Parallel Supercomputing Today and the Cedar Approach”. Science Magazine 231 (February 1986), 967–974.
Google Scholar
Lampson, B.W. and K.A. Pier A Processor for a High-Performance Personal Computer. Xerox Palo Alto Research Center, January, 1981.
Google Scholar
Li, Z. and W. Abu-Sufah A Technique for Reducing Synchronization Overhead in Large Scale Multiprocessors. Proc. of the 12th, International Symposium on Computer Architecture, June, 1985, pp. 284–291.
Google Scholar
Moon, D.A. Architecture of the Symbolics 3600. Proceedings of the 12th Annual International Symposium On Computer Architecture, Boston, June, 1985, pp. 76–83.
Google Scholar
Nikhil, R.S., K. Pingali, and Arvind Id Nouveau. Computation Structures Group Memo 265, Laboratory for Computer Science, MIT, Cambridge, Mass., Cambridge, MA 02139, July, 1986.
Google Scholar
Papadopoulos, G.M. Implementation of a General Purpose Dataflow Multiprocessor. Ph.D.Th., Dept. of Electrical Engineering and Computer Science, MIT, Cambridge, Mass., (in preparation) 1987.
Google Scholar
Paterson, D.A. “Reduced Instruction Set Computers”. Communications of ACM 28, 1 (January 1985), 8–21.
Google Scholar
Pfister, G.F., W.C. Brantley, D.A. George, S.L. Harvey, W.J. Kleinfelder, K.P. McAuliffe, E.A. Melton, V.A. Norton, and J. Weiss The IBM Research Parallel Processor Prototype (RP3): Introduction and Architecture. Proceedings of the 1985 International Conference on Parallel Processing, Institute of Electrical and Electronics Engineers, Piscataway, N.J., 08854, August, 1985, pp. 764–771.
Google Scholar
Radin, G. The 801 Minicomputer. Proceedings of the Symposium on Architectural Support for Programming Languages and Operating Systems, ACM, March, 1982.
Google Scholar
Rau, B., D. Glaeser, and E. Greenwalt Architectural Support for the Efficient Generation of Code for Horizontal Architectures. Proceedings of the Symposium on Architectural Support for Programming Languages and Operating Systems, March, 1982. Same as Computer Architecture News 10,2 and SIGPLAN Notices 17,4.
Google Scholar
Rettberg, R., C. Wyman, D. Hunt, M. Hoffmann, P. Carvey, B. Hyde, W. Clark, and M. Kraley Development of a Voice Funnel System: Design Report. 4098, Bolt Beranek and Newman Inc., August, 1979.
Google Scholar
Russell, R.M. “The CRAY-1 Computer System”. Communications of ACM 21, 1 (January 1978), 63–72.
Google Scholar
Seitz, C.M. “The Cosmic Cube”. Communications of ACM 21, 1 (January 1985), 22–33.
Google Scholar
Smith, B.J. A Pipelined, Shared Resource MIMD Computer. Proceedings of the 1978 International Conference on Parallel Proceeding, 1978, pp. 6–8.
Google Scholar
Thomton, J.E. Parallel Operations in the Control Data 6600. Proceedings of the SJCC, 1964, pp. 33–39.
Google Scholar
Traub, K.R. A Compiler for the MIT Tagged-Token Dataflow Architecture — S.M. Thesis. Technical Report 370, Laboratory for Computer Science, MIT, Cambridge, Mass., Cambridge, MA 02139, AUGUST, 1986.
Google Scholar
ALTO: A Personal Computer System — Hardware Manual Xerox Palo Alto Research Center, Palo Alto, California, 94304, 1979.
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory for Computer Science, Massachusetts Institute of Technology, 02139, Cambridge, Massachusetts, USA
Arvind & Robert A. Iannucci

Authors

Arvind
View author publications
You can also search for this author in PubMed Google Scholar
Robert A. Iannucci
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Rüdiger Dierstein Dieter Müller-Wichards Hans-Martin Wacker

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Arvind, Iannucci, R.A. (1988). Two fundamental issues in multiprocessing. In: Dierstein, R., Müller-Wichards, D., Wacker, HM. (eds) Parallel Computing in Science and Engineering. DFVLR-Seminar 1987. Lecture Notes in Computer Science, vol 295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-18923-8_15

Download citation

DOI: https://doi.org/10.1007/3-540-18923-8_15
Published: 05 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-18923-7
Online ISBN: 978-3-540-38848-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics