CRAY-2 Memory Organization and Interprocessor Memory Contention
Conference paper
Abstract
This paper describes a simulation study of interprocessor memory contention for a shared memory, vector multiprocessor like the CRAY-2. When programs execute together on such a system, each program’s performance, relative to its performance on a single dedicated processor, degrades because of contention among processors for shared memory. From the results of the simulation study, the paper proposes analytic forms for the asymptotic steady state behavior of throughput, time delay, and efficiency as functions of hardware parameters. The results suggest criteria for evaluating hardware designs and an index of quality for comparing different designs.
Keywords
System Time Delay System Throughput Clock Period Memory Bank Request Rate
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Preview
Unable to display preview. Download preview PDF.
References
- [1]D. H. Bailey, Vector computer memory bank contention, IEEE Trans. Computers, C-36 (1987), pp. 293–298.CrossRefGoogle Scholar
- [2]F. Baskett and A. J. Smith, Interference in multiprocessor computer systems with interleaved memory, Commun. Assoc. Comput. Mach., 19 (1976), pp. 327–334.MATHGoogle Scholar
- [3]D. P. Bhandarkar, Analysis of memory interference in multiprocessors, IEEE Trans. Computers, C-24 (1975), pp. 897–908.CrossRefGoogle Scholar
- [4]G. Birkhoff, Hydrodynamics, Princeton University Press, Princeton, New Jersey, 2nd ed., 1960.MATHGoogle Scholar
- [5]L. Brand, The pi theorem of dimensional analysis, Arch. Rat. Mech. Anal., 1 (1957), pp. 35–45.MathSciNetMATHCrossRefGoogle Scholar
- [6]P. W. Bridgman, Dimensional Analysis, Yale University Press, New Haven, 2nd ed., 1931.Google Scholar
- [7]F. A. Briggs and E. S. Davidson, Organization of semiconductor memories for parallel-pipelined processors, IEEE Trans. Computers, C-26 (1977), pp. 162–169.CrossRefGoogle Scholar
- [8]I. Y. Bucher and D. A. Calahan, Access conflicts in multiprocessor memories: Queuing models and simulation studies, Proc. 1990 International Conf. on Supercomputing, (1990), pp. 428–438.Google Scholar
- [9]I. Y. Bucher and D. A. Calahan, Models of access delays in multiprocessor memories. IEEE Trans. Par. Distr. Computing, to appear, 1992.Google Scholar
- [10]I. Y. Bucher and M. L. Simmons, Measurement of memory access contentions in multiple vector processor systems, Proc. Supercomputing ’91, (1991), pp. 806–817.Google Scholar
- [11]E. Buckingham, On physically similar systems: Illustrations of the use of dimensional equations, Physical Review, 4 (1914), pp. 345–376.CrossRefGoogle Scholar
- [12]P. Budnik and D. J. Kuck, The organization and use of parallel memories, IEEE Trans. Computers, C-20 (1971), pp. 1566–1569.CrossRefGoogle Scholar
- [13]G. J. Burnett and E. G. Coffman Jr., Analysis of interleaved memory systems using blockage buffers, Commun. Assoc. Comput. Mach., 18 (1975), pp. 91–95.MATHGoogle Scholar
- [14]D. A. Calahan, Characterization of memory conflict loading on the CRAY-2, Proc. 1988 Int. Conf. Parallel Processing, I (1988), pp. 299–302.Google Scholar
- [15]D. A. Calahan and D. H. Bailey, Measurement and analysis of memory conflicts in vector multiprocessors, in Performance Evaluation of Supercomputers, J. L. Martin, ed., Elsevier, 1988, pp. 83–106.Google Scholar
- [16]D. Y. Chang, D. J. Kuck, and D. H. Lawrie, On the effective bandwidth of parallel memories, IEEE Trans. Computers, C-26 (1977), pp. 480–489.CrossRefGoogle Scholar
- [17]T. Cheung and J. E. Smith, An analysis of the CRAY X-MP memory system, Proc. 1984 Int. Conf. Parallel Processing, (1984), pp. 613–622.Google Scholar
- [18]T. Cheung and J. E. Smith, A simulation study of the CRAY X-MP memory system, IEEE Trans. Computers, C-35 (1986), pp. 613–622.CrossRefGoogle Scholar
- [19]E. G. Coffman, Jr., G. J. Burnett, and R. A. Snowdon, On the performance of interleaved memories with multiple-word bandwidths, IEEE Trans. Computers, C-20 (1971), pp. 1570–1573.CrossRefGoogle Scholar
- [20]S. Drobot, On the foundations of dimensional analysis, Studia Mathematica, 14 (1954), pp. 84–99.MathSciNetGoogle Scholar
- [21]I. Flores, Derivation of a waiting-time factor for a multiple-bank memory, J. Assoc. Comp. Mach., 11 (1964), pp. 265–282.MathSciNetMATHCrossRefGoogle Scholar
- [22]C. M. Focken, Dimensional Methods and Their Applications, Edward Arnold and Co., London, 1953.Google Scholar
- [23]C. Fricker, On memory contention problems in vector multiprocessors, Tech. Rep. 1034, INRIA, 1989.Google Scholar
- [24]C. H. Hoogendoorn, A general model for memory interference in multiprocessors, IEEE Trans. Computers, C-26 (1977), pp. 998–1005.CrossRefGoogle Scholar
- [25]L. Kleinrock, Queueing Systems, vol. 1, John Wiley, New York, 1975.Google Scholar
- [26]D. E. Knuth and G. S. Rao, Activity in an interleaved memory, IEEE Trans. Computers, C-24 (1975), pp. 943–944.CrossRefGoogle Scholar
- [27]R. W. Numrich, CRAY-2 common memory, Tech. Rep. HN-2043, Cray Research, Inc., 1988.Google Scholar
- [28]W. Oed and O. Lange, On the effective bandwidth of interleaved memories invector processor systems, IEEE Trans. Computers, C-34 (1985), pp. 949–957.Google Scholar
- [29]W. Oed and O. Lange, On the effective bandwidth of interleaved memories in vector processor systems, Proc. 1985 Int. Conf. Parallel Processing, (1985), pp. 33–40.Google Scholar
- [30]W. Oed and O. Lange, Modelling, measurement, and simulation of memory interference in the CRAY X-MP, Parallel Computing, 3 (1986), pp. 343–358.CrossRefGoogle Scholar
- [31]K. Pawlikowski, Steady-state simulation of queueing processes: A survey of problems and solutions, ACM Computing Surveys, 22 (1990), pp. 123–170.CrossRefGoogle Scholar
- [32]C. V. Ramamoorthy and B. W. Wah, An optimal algorithm for scheduling requests on interleaved memories for a pipelined processor, IEEE Trans. Computers, C-30 (1981), pp. 787–800.CrossRefGoogle Scholar
- [33]B. R. Rau, Program behavior and the performance of interleaved memories, IEEE Trans. Computers, C-28 (1979), pp. 191–199.CrossRefGoogle Scholar
- [34]C. V. Ravi, On the bandwidth and interference in interleaved memory systems, IEEE Trans. Computers, C-21 (1972), pp. 899–901.CrossRefGoogle Scholar
- [35]K. A. Robbins and S. Robbins, Bus conflicts for logical memory banks on a CRAY Y-MP type processor system, Proc. 1991 Int. Conf. Parallel Processing, I (1991), pp. 21–24.Google Scholar
- [36]K. V. Sastry and R. Y. Kain, On the performance of certain multiprocessor computer organizations, IEEE Trans. Computers, C-24 (1975), pp. 1066–1074.CrossRefGoogle Scholar
- [37]C. E. Skinner and J. R. Asher, Effects of storage contention on system performance, IBM Syst. J., 8 (1969), pp. 319–333.CrossRefGoogle Scholar
- [38]J. E. Smith, W. C. Hsu, and C. Hsiung, Future general purpose supercomputer architectures, Proc. Supercomputing ’90, (1990), pp. 796–804.Google Scholar
- [39]B. Speelpenning and J. Nievergelt, A simple model of processor-resource utilization in networks of communicating modules, IEEE Trans. Computers, C-28 (1979), pp. 927–929.CrossRefGoogle Scholar
- [40]H. S. Stone, Parallel processing with the perfect shuffle, IEEE Trans. Computers, C-20 (1971), pp. 153–161.CrossRefGoogle Scholar
- [41]P. Tang and R. H. Mendez, Memory conflicts and machine performance, Proc. Supercomputing ’89, (1989), pp. 826–831.Google Scholar
- [42]D. W. L. Yen, J. H. Patel, and E. S. Davidson, Memory interference in synchronous multiprocessor systems, IEEE Trans. Computers, C-31 (1982), pp. 1116–1121.CrossRefGoogle Scholar
Copyright information
© Springer-Verlag New York, Inc. 1993