Cluster Computing

, Volume 5, Issue 4, pp 353–363 | Cite as

A Software Suite for High-Performance Communications on Clusters of SMPs

  • P. Geoffray
  • C. Pham
  • B. Tourancheau


A cluster, by opposition to a parallel computer, is a set of separate workstations interconnected by a high-speed network. The performances one can get on a cluster heavily depend on the performances of the lowest communication layers. In this paper, we address the special case where the cluster contains multi-processor machines. These shared-memory multi-processors desktop machines (SMPs) with 2 or 4 processors are now becoming very popular and present a high performance/price ratio. We present a software suite for achieving high-performance communications on a Myrinet-based cluster: BIP, BIP-SMP and MPI-BIP. The software suite supports single-processor (Intel PC and Digital Alpha) and multi-processor machines, as well as any combination of the two architectures.

communication software Myrinet cluster of SMPs 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic and W.-K. Su, Myrinet - a gigabit-per-second localarea network, IEEE Micro 15(1) (February 1995) pp. 29–36. Myricom, Scholar
  2. [2]
    T.E. Anderson, D.E. Culler, D.A. Patterson and the NOW team, A case for NOW (Networks of Workstations), IEEE Micro 15(1) (February 1995) 54–64.Google Scholar
  3. [3]
    W. Gropp, E. Lusk, N. Doss and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing 22(6) (September 1996) 789–828.Google Scholar
  4. [4]
    L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on myrinet, in: Workshop PC-NOW, IPPS/SPDP98, Orlando, USA (1998).Google Scholar
  5. [5]
    L. Prylli, B. Tourancheau and R. Westrelin, An improved NIC program for high-performance MPI, in: International Conference on Supercomputing (ICS'99), Workshop on Cluster-based Computing, eds. N.P. Carter and S.S. Lumetta.Google Scholar
  6. [6]
    L. Prylli, B. Tourancheau and R. Westrelin, The design for a high performance mpi implementation on the myrinet network, in: Euro-PVM/MPI'99 (1999).Google Scholar
  7. [7]
    Myricom, The GM api, December 1998. doc/gm_toc.html.Google Scholar
  8. [8]
    P. Geoffray, L. Prylli and B. Tourancheau, BIP-SMP: High performance message passing over a cluster of commodity SMPs, in: Supercomputing' 99 (SC99).Google Scholar
  9. [9]
    P. Husbands and J.C. Hoe, Mpi-start: Delivering network performance to numerical applications, in: SuperComputing (SC'98), Orlando, Florida, November 1998.Google Scholar
  10. [10]
    S.S. Lumetta, A.M. Mainwaring and D.E. Culler, Multi-protocol active messages on a cluster of smp's, in: SuperComputing (SC'97), University of California at Berkeley, August 1997.Google Scholar
  11. [11]
    T. von Eicken, Active messages: An efficient communication architecture for multiprocessors, Ph.D. thesis, University of California at Berkeley, November 1993.Google Scholar
  12. [12]
    A. Singhal, D. Broniarczyk, F. Cerauskis, J. Price, L. Yuan, C. Cheng, D. Doblar, S. Fosth, N. Agarwal, K. Harvey, E. Hagersten and B. Liencres, Gigaplane: A high performance bus for large smps, in: Hot Interconnects IV, Stanford, California, August 1996, pp. 41-52.Google Scholar
  13. [13]
    R.M. Butler and E.L. Lusk, Monitors, messages, and clusters: The p4 parallel programming system, Technical report, University of North Florida and Argonne National Laboratory, 1993. http://wwwfp. lusk/p4/p4-paper/paper.html.Google Scholar
  14. [14]
    S.S. Lumetta and D.E. Culler, Managing concurrent access for shared memory active messages, in: International Parallel Processing Symposium, Orlando, Florida, April 1998.Google Scholar
  15. [15]
    X. Leroy, The linuxthreads library, June 1999. http://pauillac.inria. fr/ xleroy/linuxthreads/.Google Scholar
  16. [16]
    NASA, NAS Parallel Benchmark 2.3. Software/NPB/.Google Scholar
  17. [17]
    J.J. Dongarra, Performance of various computers using standard linear equations software, Technical report CS-89-85, University of Tennessee Computer Science, 1999.Google Scholar
  18. [18]
    R.C. Whaley and J.J. Dongarra, Automatically tuned linear algebra software, in: SuperComputing (SC'98) (1998).Google Scholar
  19. [19]
    H. Brunst, CongDuc Pham and S. Fdida, Conservative simulation of load-balanced routing in a large ATM network model, in: 12th Parallel and Distributed Simulation Workshop (PADS'98), Banff, Canada, May 1998.Google Scholar
  20. [20]
    K. Chandy and J. Misra, Distributed simulation: A case study in design and verification of distributed programs, IEEE Trans. Software Engrg. 5(5) (1979).Google Scholar
  21. [21]
    J. Dongarra, Linpack benchmark-parallel, April 1999. http://performance. Scholar
  22. [22]
    L. Prylli, Bip user reference manual, Technical report TR97-02, LIP/ENS-LYON, Septembre 1997.Google Scholar

Copyright information

© Kluwer Academic Publishers 2002

Authors and Affiliations

  • P. Geoffray
    • 1
  • C. Pham
    • 2
  • B. Tourancheau
    • 3
  1. 1.Myricom IncFrance
  2. 2.RESAM laboratoryUniversity Lyon 1France
  3. 3.SUN Labs GrenobleFrance

Personalised recommendations