Cluster Computing

, Volume 5, Issue 4, pp 353–363 | Cite as

A Software Suite for High-Performance Communications on Clusters of SMPs

  • P. Geoffray
  • C. Pham
  • B. Tourancheau


A cluster, by opposition to a parallel computer, is a set of separate workstations interconnected by a high-speed network. The performances one can get on a cluster heavily depend on the performances of the lowest communication layers. In this paper, we address the special case where the cluster contains multi-processor machines. These shared-memory multi-processors desktop machines (SMPs) with 2 or 4 processors are now becoming very popular and present a high performance/price ratio. We present a software suite for achieving high-performance communications on a Myrinet-based cluster: BIP, BIP-SMP and MPI-BIP. The software suite supports single-processor (Intel PC and Digital Alpha) and multi-processor machines, as well as any combination of the two architectures.

communication software Myrinet cluster of SMPs 


  1. [1]
    N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic and W.-K. Su, Myrinet - a gigabit-per-second localarea network, IEEE Micro 15(1) (February 1995) pp. 29–36. Myricom, Scholar
  2. [2]
    T.E. Anderson, D.E. Culler, D.A. Patterson and the NOW team, A case for NOW (Networks of Workstations), IEEE Micro 15(1) (February 1995) 54–64.Google Scholar
  3. [3]
    W. Gropp, E. Lusk, N. Doss and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing 22(6) (September 1996) 789–828.Google Scholar
  4. [4]
    L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on myrinet, in: Workshop PC-NOW, IPPS/SPDP98, Orlando, USA (1998).Google Scholar
  5. [5]
    L. Prylli, B. Tourancheau and R. Westrelin, An improved NIC program for high-performance MPI, in: International Conference on Supercomputing (ICS'99), Workshop on Cluster-based Computing, eds. N.P. Carter and S.S. Lumetta.Google Scholar
  6. [6]
    L. Prylli, B. Tourancheau and R. Westrelin, The design for a high performance mpi implementation on the myrinet network, in: Euro-PVM/MPI'99 (1999).Google Scholar
  7. [7]
    Myricom, The GM api, December 1998. doc/gm_toc.html.Google Scholar
  8. [8]
    P. Geoffray, L. Prylli and B. Tourancheau, BIP-SMP: High performance message passing over a cluster of commodity SMPs, in: Supercomputing' 99 (SC99).Google Scholar
  9. [9]
    P. Husbands and J.C. Hoe, Mpi-start: Delivering network performance to numerical applications, in: SuperComputing (SC'98), Orlando, Florida, November 1998.Google Scholar
  10. [10]
    S.S. Lumetta, A.M. Mainwaring and D.E. Culler, Multi-protocol active messages on a cluster of smp's, in: SuperComputing (SC'97), University of California at Berkeley, August 1997.Google Scholar
  11. [11]
    T. von Eicken, Active messages: An efficient communication architecture for multiprocessors, Ph.D. thesis, University of California at Berkeley, November 1993.Google Scholar
  12. [12]
    A. Singhal, D. Broniarczyk, F. Cerauskis, J. Price, L. Yuan, C. Cheng, D. Doblar, S. Fosth, N. Agarwal, K. Harvey, E. Hagersten and B. Liencres, Gigaplane: A high performance bus for large smps, in: Hot Interconnects IV, Stanford, California, August 1996, pp. 41-52.Google Scholar
  13. [13]
    R.M. Butler and E.L. Lusk, Monitors, messages, and clusters: The p4 parallel programming system, Technical report, University of North Florida and Argonne National Laboratory, 1993. http://wwwfp. lusk/p4/p4-paper/paper.html.Google Scholar
  14. [14]
    S.S. Lumetta and D.E. Culler, Managing concurrent access for shared memory active messages, in: International Parallel Processing Symposium, Orlando, Florida, April 1998.Google Scholar
  15. [15]
    X. Leroy, The linuxthreads library, June 1999. http://pauillac.inria. fr/ xleroy/linuxthreads/.Google Scholar
  16. [16]
    NASA, NAS Parallel Benchmark 2.3. Software/NPB/.Google Scholar
  17. [17]
    J.J. Dongarra, Performance of various computers using standard linear equations software, Technical report CS-89-85, University of Tennessee Computer Science, 1999.Google Scholar
  18. [18]
    R.C. Whaley and J.J. Dongarra, Automatically tuned linear algebra software, in: SuperComputing (SC'98) (1998).Google Scholar
  19. [19]
    H. Brunst, CongDuc Pham and S. Fdida, Conservative simulation of load-balanced routing in a large ATM network model, in: 12th Parallel and Distributed Simulation Workshop (PADS'98), Banff, Canada, May 1998.Google Scholar
  20. [20]
    K. Chandy and J. Misra, Distributed simulation: A case study in design and verification of distributed programs, IEEE Trans. Software Engrg. 5(5) (1979).Google Scholar
  21. [21]
    J. Dongarra, Linpack benchmark-parallel, April 1999. http://performance. Scholar
  22. [22]
    L. Prylli, Bip user reference manual, Technical report TR97-02, LIP/ENS-LYON, Septembre 1997.Google Scholar

Copyright information

© Kluwer Academic Publishers 2002

Authors and Affiliations

  • P. Geoffray
    • 1
  • C. Pham
    • 2
  • B. Tourancheau
    • 3
  1. 1.Myricom IncFrance
  2. 2.RESAM laboratoryUniversity Lyon 1France
  3. 3.SUN Labs GrenobleFrance

Personalised recommendations