Memory Efficient Parallel Matrix Multiplication Operation for Irregular Problems

Krishnan, Manoj Kumar; Nieplocha, Jarek

Title: Memory Efficient Parallel Matrix Multiplication Operation for Irregular Problems

Conference · Wed May 03 00:00:00 EDT 2006

OSTI ID:887376

Krishnan, Manoj Kumar; Nieplocha, Jarek

Regular distributions for storing dense matrices on parallel systems are not always used in practice. In many scientific applications, matrix distribution is based on the underlying physical problem and might involve variable block sizes on individual processors. This paper describes a generalization of the Shared and Remote-memory based Universal Matrix Multiplication Algorithm (SRUMMA) [1] to handle irregularly distributed matrices. Our approach relies on a distribution independent algorithm that provides dynamic load balancing by exploiting data locality and achieves performance as good as the traditional approach which relies on temporary arrays with regular distribution, data redistribution, and matrix multiplication for regular matrices to handle the irregular case. The proposed algorithm is memory-efficient because temporary matrices are not needed. This feature is critical for systems like the IBM Blue Gene/L that offer very limited amount of memory per node. The experimental results demonstrate very good performance across the range matrix distributions and problem sizes motivated by real applications.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-76RL01830

OSTI ID:: 887376

Report Number(s):: PNNL-SA-47061; KP1704020; TRN: US200618%%22

Resource Relation:: Conference: Proceedings of the 3rd Conference on Computing Frontiers CF '06, Ischia, Italy May 03 - 05, 2006, 229-240

Country of Publication:: United States

Language:: English

Similar Records

Migration of vectorized iterative solvers to distributed memory architectures

Conference · Sat Dec 31 00:00:00 EST 1994 · OSTI ID:887376

Pommerell, C; Ruehl, R

Parallel sparse matrix computations in iterative solvers on distributed memory machines

Conference · Fri Dec 01 00:00:00 EST 1995 · OSTI ID:887376

Basermann, A

Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)

Technical Report · Fri Nov 29 00:00:00 EST 2019 · OSTI ID:887376

Shen, Xipeng

Related Subjects

99 GENERAL AND MISCELLANEOUS//MATHEMATICS, COMPUTING, AND INFORMATION SCIENCE
PARALLEL PROCESSING
MEMORY MANAGEMENT
ALGORITHMS
MATRICES
PERFORMANCE
parallel matrix multiplication
irregular distribution
remote memory access
SRUMMA

Title: Memory Efficient Parallel Matrix Multiplication Operation for Irregular Problems

Citation Formats

Similar Records

Related Subjects