PB-BLAS: A set of parallel block basic linear algebra subprograms
- Tennessee Univ., Knoxville, TN (United States). Dept. of Computer Science
- Oak Ridge National Lab., TN (United States)
We propose a new library of routines for performing dense linear algebra computations on block-partitioned matrices. The routines are referred to as the Block Basic Linear Algebra Subprograms, and their use is restricted to computations in which one or more of the matrices involved consists of a single row or column of blocks, and in which no more than one of the matrices consists of an unrestricted two-dimensional array of blocks. The functionality of the block BLAS routines can also be provided by Level 2 and 3 BLAS routines. However, for Non-Uniform Memory Access machines the use of the block BLAS permit certain optimizations in memory access to be taken advantage of. This is particularly true for distributed memory machines, for which the block BLAS are referred to as the Parallel Block Basic Linear Algebra Subprograms (PB-BLAS). The PB-BLAS are the main focus of this paper, and for a block-cyclic data distribution, a single row or column of blocks lies in a single row or column of the processor template. The PB-BLAS consist of calls to the sequential BLAS for local computations, and calls to the BLACS for communication. The PB-BLAS are the building blocks for implementing ScaLAPACK, the distributed-memory version of LAPACK, and provide the same ease-of-use and portability for ScaLAPACK that the BLAS provide for LAPACK. The PB-BLAS consists of all nine Level 3 BLAS routines, four of the Level-2 BLAS routines, and 2 auxiliary transpose routines. The PB-BLAS are currently available for all numeric data types, i.e., single and double precision real and complex.
- Research Organization:
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE, Washington, DC (United States); Department of Defense, Washington, DC (United States)
- DOE Contract Number:
- AC05-84OR21400
- OSTI ID:
- 78703
- Report Number(s):
- CONF-9405100-14; ON: DE95012872; CNN: DAAL03-91-C-0047
- Resource Relation:
- Conference: Scalable high performance computing conference, Knoxville, TN (United States), 23 May 1994; Other Information: PBD: [1994]
- Country of Publication:
- United States
- Language:
- English
Similar Records
The design of linear algebra libraries for high performance computers
LINPACK; simultaneous linear algebraic equations. [IBM360,370 (designed to be machine-independent); FORTRAN]