Parallel matrix transpose algorithms on distributed memory concurrent computers

Choi, Jaeyoung; Dongarra, J; Walker, D W

Parallel matrix transpose algorithms on distributed memory concurrent computers

Conference · Fri Dec 30 23:00:00 EST 1994

OSTI ID:233300

Choi, Jaeyoung ^[1]; Dongarra, J ^[2]; Walker, D W ^[2]

Tennessee Univ., Knoxville, TN (United States)
Oak Ridge National Lab., TN (United States)

This paper describes parallel matrix transpose algorithms on distributed memory concurrent processors. We assume that the matrix is distributed over a P {times} Q processor template with a block scattered data distribution. P, Q, and the block size can be arbitrary, so the algorithms have wide applicability. The algorithms make use of non-blocking, point-to-point communication between processors. The use of nonblocking communication allows a processor to overlap the messages that it sends to different processors, thereby avoiding unnecessary synchronization. Combined with the matrix multiplication routine, C = A {center_dot} B, the algorithms are used to compute parallel multiplications of transposed matrices, C = A{sup T} {center_dot} B{sup T}, in the PUMMA package. Details of the parallel implementation of the algorithms are given, and results are presented for runs on the Intel Touchstone Delta computer.

Research Organization:: Oak Ridge National Lab., TN (United States)

Sponsoring Organization:: USDOE, Washington, DC (United States); Defense Advanced Research Projects Agency, Arlington, VA (United States); Department of the Air Force, Washington, DC (United States)

DOE Contract Number:: AC05-84OR21400

OSTI ID:: 233300

Report Number(s):: CONF-9310220--8; ON: DE96010006; CNN: Contract DAAL03-91-C-0047

Country of Publication:: United States

Language:: English

Similar Records

Parallel matrix transpose algorithms on distributed memory concurrent computers

Technical Report · Fri Oct 01 00:00:00 EDT 1993 · OSTI ID:10193001

PUMMA: Parallel Universal Matrix Multiplication Algorithms on distributed memory concurrent computers

Technical Report · Sun Aug 01 00:00:00 EDT 1993 · OSTI ID:10180105

The spectral decomposition of nonsymmetric matrices on distributed memory parallel computers

Journal Article · Mon Sep 01 00:00:00 EDT 1997 · SIAM Journal on Scientific Computing · OSTI ID:532990

Related Subjects

99 GENERAL AND MISCELLANEOUS
ALGORITHMS
COMPUTER CALCULATIONS
DESIGN
MATHEMATICS
MATRICES
PARALLEL PROCESSING
TRANSFORMATIONS

Parallel matrix transpose algorithms on distributed memory concurrent computers

Citation Formats

Similar Records

Related Subjects