Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Level 3 blas in LU factorization on the CRAY-2, ETA-10P, and IBM 3090-200/VF

Journal Article · · International Journal of Supercomputer Application; (USA)
 [1];  [2]
  1. Cerfacs, 42 Avenue G. Coriolis, 31057 Toulouse Cedex (FR)
  2. Harwell Lab., Oxon 0X11 ORA (GB)

The authors study various implementations of block Gaussian elimination on full matrices and examine their performance on three vector supercomputers, the CRAY-2, the ETA-10P, and the IBM 3090-200/VF. They show that the use of Level 3 BLAS kernels allows portability without sacrifice of efficiency and that good speeds can be obtained if tuned versions of the kernels are available. Indeed our results show that without using any assembler language outside the kernels they can approach the performance of assembler-coded routines on all machines.

OSTI ID:
5702688
Journal Information:
International Journal of Supercomputer Application; (USA), Journal Name: International Journal of Supercomputer Application; (USA) Vol. 3:2; ISSN 0890-2720; ISSN IJSAE
Country of Publication:
United States
Language:
English

Similar Records

Use of level 3 BLAS in lu factorization in a multiprocessing environment on three vector multiprocessors; The Alliant FX/80, the CRAY-2, and the IBM 3090 VF
Journal Article · Mon Dec 31 23:00:00 EST 1990 · International Journal of Supercomputer Applications; (United States) · OSTI ID:5545213

Use of Level 3 BLAS in LU factorization in a multiprocessing environment on three vector multiprocessors: The ALLIANT FX/80, the CRAY-2, and the IBM 3090 VF
Technical Report · Sun Dec 31 23:00:00 EST 1989 · OSTI ID:5217043

Vectorization of a multiprocessor multifrontal code
Journal Article · Sat Dec 31 23:00:00 EST 1988 · International Journal of Supercomputer Application; (USA) · OSTI ID:5422748