Hyper-systolic implementation of BLAS-3 routines on the APE100/Quadrics machine

  • Marco Coletta
  • Thomas Lippert
  • Paolo Palazzari
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1541)


Basic Linear Algebra Subroutines (BLAS-3) [1] are building blocks to solve a lot of numerical problems (Cholesky factorization, Gram-Schmidt ortonormalization, LU decomposition,…). Their efficient implementation on a given parallel machine is a key issue for the maximal exploitation of the system’s computational power. In this work we refer to a massively parallel processing SIMD machine (the APE100/Quadrics


Parallel Machine Communication Overhead Systolic Array Peak Performance Cholesky Factorization 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    J. Choi, J.J. Dongarra, D.W. Walker: ‘The design of scalable software libraries for distributed memory concurrent computers’. J.J. Dongarra and B. Tourancheau editors. Environments and Tools for Parallel Scientific Computing. Elsevier 1982.Google Scholar
  2. 2.
    A. Bartoloni et al: ‘A hardware implementation of the APE100 architecture’. International Journal of Modern Physics C4 1993.Google Scholar
  3. 3.
    T. Lippert, P. Palazzari, K. Schilling: ‘Automatic template generation for solving n 2 problems on parallel systems with arbitrary topology’. Proceedings of the IEEE Workshop on Parallel and Distributed Software Engineering, Boston (MA) May 1997.Google Scholar
  4. 4.
    T. Lippert et al.: ‘Hyper-systolic matrix multiplication’. Proceedings of the Proceedings of PDPTA ’97, CSREA 1997.Google Scholar
  5. 5.
    T. Lippert, A. Seyfried, A. Bode, and K. Schilling: ‘Hyper-Systolic Parallel Computing’, IEEE Trans. On Parallel and Distributed Systems, Vol. 9, No. 2, February 1998.Google Scholar
  6. 6.
    P. Palazzari, T. Lippert, K. Schilling: ’simulated Annealing Techniques for communication-efficient Hyper-Systolic parallel computing on Quadrics’. Nato advance Research whorkshop on High Performance Computing-Technology and Applications, June 24–25 1996, Cetraro (Italy).Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Marco Coletta
    • 1
  • Thomas Lippert
    • 2
  • Paolo Palazzari
    • 1
  1. 1.ENEA-HPCN Project-C.R.CasacciaRomeItaly
  2. 2.Department of PhysicsUniversity of WuppertalWuppertal(Germany)

Personalised recommendations