Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Matrix multiplication on the Intel Touchstone Delta

Conference ·
OSTI ID:54431
; ;  [1];  [2]
  1. Supercomputing Research Center, Bowie, MD (United States)
  2. CONVEX Computer Corp., Richardson, TX (United States)

Matrix multiplication is a key primitive in block matrix algorithms such as those found in LAPACK. We present results from our study of matrix multiplication algorithms on the Intel Touchstone Delta, a distributed memory message-passing architecture with a two-dimensional mesh topology. We obtain an implementation that uses communication primitives highly suited to the Delta and exploits the single node assembly-coded matrix multiplication. Our algorithm is completely general, able to deal with arbitrary mesh aspect ratios and matrix dimensions, and has achieved parallel efficiency of 86% with overall peak performance in excess of 8 Gflops on 256 nodes for an 8800 {times} 8800 matrix. We describe our algorithm design and implementation, and present performance results that demonstrate scalability and robust behavior over varying mesh topologies.

OSTI ID:
54431
Report Number(s):
DOE/ER/25151--1-Vol.1; CONF-930331--Vol.1; CNN: Contract DM28E04120
Country of Publication:
United States
Language:
English