Level 3 blas in LU factorization on the CRAY-2, ETA-10P, and IBM 3090-200/VF
Journal Article
·
· International Journal of Supercomputer Application; (USA)
- Cerfacs, 42 Avenue G. Coriolis, 31057 Toulouse Cedex (FR)
- Harwell Lab., Oxon 0X11 ORA (GB)
The authors study various implementations of block Gaussian elimination on full matrices and examine their performance on three vector supercomputers, the CRAY-2, the ETA-10P, and the IBM 3090-200/VF. They show that the use of Level 3 BLAS kernels allows portability without sacrifice of efficiency and that good speeds can be obtained if tuned versions of the kernels are available. Indeed our results show that without using any assembler language outside the kernels they can approach the performance of assembler-coded routines on all machines.
- OSTI ID:
- 5702688
- Journal Information:
- International Journal of Supercomputer Application; (USA), Vol. 3:2; ISSN 0890-2720
- Country of Publication:
- United States
- Language:
- English
Similar Records
Use of Level 3 BLAS in LU factorization in a multiprocessing environment on three vector multiprocessors: The ALLIANT FX/80, the CRAY-2, and the IBM 3090 VF
Use of level 3 BLAS in lu factorization in a multiprocessing environment on three vector multiprocessors; The Alliant FX/80, the CRAY-2, and the IBM 3090 VF
Vectorization of a multiprocessor multifrontal code
Technical Report
·
Mon Jan 01 00:00:00 EST 1990
·
OSTI ID:5702688
Use of level 3 BLAS in lu factorization in a multiprocessing environment on three vector multiprocessors; The Alliant FX/80, the CRAY-2, and the IBM 3090 VF
Journal Article
·
Tue Jan 01 00:00:00 EST 1991
· International Journal of Supercomputer Applications; (United States)
·
OSTI ID:5702688
Vectorization of a multiprocessor multifrontal code
Journal Article
·
Sun Jan 01 00:00:00 EST 1989
· International Journal of Supercomputer Application; (USA)
·
OSTI ID:5702688