skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Level 3 blas in LU factorization on the CRAY-2, ETA-10P, and IBM 3090-200/VF

Journal Article · · International Journal of Supercomputer Application; (USA)
 [1];  [2]
  1. Cerfacs, 42 Avenue G. Coriolis, 31057 Toulouse Cedex (FR)
  2. Harwell Lab., Oxon 0X11 ORA (GB)

The authors study various implementations of block Gaussian elimination on full matrices and examine their performance on three vector supercomputers, the CRAY-2, the ETA-10P, and the IBM 3090-200/VF. They show that the use of Level 3 BLAS kernels allows portability without sacrifice of efficiency and that good speeds can be obtained if tuned versions of the kernels are available. Indeed our results show that without using any assembler language outside the kernels they can approach the performance of assembler-coded routines on all machines.

OSTI ID:
5702688
Journal Information:
International Journal of Supercomputer Application; (USA), Vol. 3:2; ISSN 0890-2720
Country of Publication:
United States
Language:
English