N.J. Boden, D. Cohen, R.E. Felderman, A.E. Kulawik, C.L. Seitz, J.N. Seizovic and W.-K. Su, Myrinet - a gigabit-per-second localarea network, IEEE Micro 15(1) (February 1995) pp. 29–36. Myricom, http://www.myri.com.Google Scholar
T.E. Anderson, D.E. Culler, D.A. Patterson and the NOW team, A case for NOW (Networks of Workstations), IEEE Micro 15(1) (February 1995) 54–64.Google Scholar
W. Gropp, E. Lusk, N. Doss and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing 22(6) (September 1996) 789–828.Google Scholar
L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on myrinet, in: Workshop PC-NOW, IPPS/SPDP98, Orlando, USA (1998).
L. Prylli, B. Tourancheau and R. Westrelin, An improved NIC program for high-performance MPI, in: International Conference on Supercomputing (ICS'99), Workshop on Cluster-based Computing, eds. N.P. Carter and S.S. Lumetta.
L. Prylli, B. Tourancheau and R. Westrelin, The design for a high performance mpi implementation on the myrinet network, in: Euro-PVM/MPI'99 (1999).
Myricom, The GM api, December 1998. http://www.myri.com/GM/ doc/gm_toc.html.
P. Geoffray, L. Prylli and B. Tourancheau, BIP-SMP: High performance message passing over a cluster of commodity SMPs, in: Supercomputing' 99 (SC99).
P. Husbands and J.C. Hoe, Mpi-start: Delivering network performance to numerical applications, in: SuperComputing (SC'98), Orlando, Florida, November 1998.
S.S. Lumetta, A.M. Mainwaring and D.E. Culler, Multi-protocol active messages on a cluster of smp's, in: SuperComputing (SC'97), University of California at Berkeley, August 1997.
T. von Eicken, Active messages: An efficient communication architecture for multiprocessors, Ph.D. thesis, University of California at Berkeley, November 1993.
A. Singhal, D. Broniarczyk, F. Cerauskis, J. Price, L. Yuan, C. Cheng, D. Doblar, S. Fosth, N. Agarwal, K. Harvey, E. Hagersten and B. Liencres, Gigaplane: A high performance bus for large smps, in: Hot Interconnects IV, Stanford, California, August 1996, pp. 41-52.
R.M. Butler and E.L. Lusk, Monitors, messages, and clusters: The p4 parallel programming system, Technical report, University of North Florida and Argonne National Laboratory, 1993. http://wwwfp. mcs.anl.gov/ lusk/p4/p4-paper/paper.html.
S.S. Lumetta and D.E. Culler, Managing concurrent access for shared memory active messages, in: International Parallel Processing Symposium, Orlando, Florida, April 1998.
X. Leroy, The linuxthreads library, June 1999. http://pauillac.inria. fr/ xleroy/linuxthreads/.
NASA, NAS Parallel Benchmark 2.3. http://science.nas.nasa.gov/ Software/NPB/.
J.J. Dongarra, Performance of various computers using standard linear equations software, Technical report CS-89-85, University of Tennessee Computer Science, 1999.
R.C. Whaley and J.J. Dongarra, Automatically tuned linear algebra software, in: SuperComputing (SC'98) (1998).
H. Brunst, CongDuc Pham and S. Fdida, Conservative simulation of load-balanced routing in a large ATM network model, in: 12th Parallel and Distributed Simulation Workshop (PADS'98), Banff, Canada, May 1998.
K. Chandy and J. Misra, Distributed simulation: A case study in design and verification of distributed programs, IEEE Trans. Software Engrg. 5(5) (1979).
J. Dongarra, Linpack benchmark-parallel, April 1999. http://performance. netlib.org/performance/html/linpack-parallel.data.col0.html.
L. Prylli, Bip user reference manual, Technical report TR97-02, LIP/ENS-LYON, Septembre 1997.