Sparse matrix-vector multiplication on a reconfigurable supercomputer

Dubois, David H; Dubois, Andrew J; Boorman, Thomas M; Connor, Carolyn M; Poole, Steve

doi:10.1109/FCCM.2008.53

Title: Sparse matrix-vector multiplication on a reconfigurable supercomputer

Journal Article · Tue Jan 01 00:00:00 EST 2008 · ACM Transactions on Reconfigurable Technology and Systems (TRETS)

DOI:https://doi.org/10.1109/FCCM.2008.53· OSTI ID:962276

Dubois, David H ^[1]; Dubois, Andrew J ^[1]; Boorman, Thomas M ^[1]; Connor, Carolyn M ^[1]; Poole, Steve ^[2]

Los Alamos National Laboratory
ORNL

Double precision floating point Sparse Matrix-Vector Multiplication (SMVM) is a critical computational kernel used in iterative solvers for systems of sparse linear equations. The poor data locality exhibited by sparse matrices along with the high memory bandwidth requirements of SMVM result in poor performance on general purpose processors. Field Programmable Gate Arrays (FPGAs) offer a possible alternative with their customizable and application-targeted memory sub-system and processing elements. In this work we investigate two separate implementations of the SMVM on an SRC-6 MAPStation workstation. The first implementation investigates the peak performance capability, while the second implementation balances the amount of instantiated logic with the available sustained bandwidth of the FPGA subsystem. Both implementations yield the same sustained performance with the second producing a much more efficient solution. The metrics of processor and application balance are introduced to help provide some insight into the efficiencies of the FPGA and CPU based solutions explicitly showing the tight coupling of the available bandwidth to peak floating point performance. Due to the FPGA's ability to balance the amount of implemented logic to the available memory bandwidth it can provide a much more efficient solution. Finally, making use of the lessons learned implementing the SMVM, we present an fully implemented nonpreconditioned Conjugate Gradient Algorithm utilizing the second SMVM design.

View Journal Article

Cite

Export

Save

Research Organization:: Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC52-06NA25396

OSTI ID:: 962276

Report Number(s):: LA-UR-08-06989; LA-UR-08-6989; TRN: US200919%%40

Journal Information:: ACM Transactions on Reconfigurable Technology and Systems (TRETS), Journal Name: ACM Transactions on Reconfigurable Technology and Systems (TRETS)

Country of Publication:: United States

Language:: English

References (6)

The Idea Behind Krylov Methods Ipsen, IIse C. F.; Meyer, Carl D. The American Mathematical Monthly, Vol. 105, Issue 10, p. 889-899 https://doi.org/10.2307/2589281	journal	January 1998
Sparse Matrix-Vector multiplication on FPGAs Zhuo, Ling; Prasanna, Viktor K. Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays https://doi.org/10.1145/1046192.1046202	conference	February 2005
Improving the memory-system performance of sparse-matrix vector multiplication Toledo, S. IBM Journal of Research and Development, Vol. 41, Issue 6 https://doi.org/10.1147/rd.416.0711	journal	November 1997
FPGAs vs. CPUs Underwood, Keith Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays https://doi.org/10.1145/968280.968305	conference	February 2004
Floating-point sparse matrix-vector multiply for FPGAs deLorimier, Michael; DeHon, André Proceedings of the 2005 ACM/SIGDA 13th international symposium on Field-programmable gate arrays https://doi.org/10.1145/1046192.1046203	conference	February 2005
Sparse Matrix-Vector Multiplication Design on FPGAs Sun, Junqing; Peterson, Gregory; Storaasli, Olaf 15th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM 2007) https://doi.org/10.1109/FCCM.2007.56	conference	April 2007

Similar Records

A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer

Journal Article · Tue Jan 01 00:00:00 EST 2008 · Journal: ACM TRETS (Transactions on Reconfigurable Tech and Systems and Systems) · OSTI ID:962276

Dubois, David H; Dubois, Andrew J; Connor, Carolyn M; +2 more

Mapping Sparse Matrix-Vector Multiplication on FPGAs

Conference · Mon Jan 01 00:00:00 EST 2007 · OSTI ID:962276

Sun, Junqing; Peterson, Greg D; Storaasli, Olaf O

Petascale Computing Enabling Technologies Project Final Report

Technical Report · Sun Feb 14 00:00:00 EST 2010 · OSTI ID:962276

de Supinski, B R

Related Subjects

97 MATHEMATICS AND COMPUTING
ACCURACY
ALGORITHMS
DESIGN
IMPLEMENTATION
KERNELS
MATRICES
METRICS
PERFORMANCE
PROCESSING
SUPERCOMPUTERS

Title: Sparse matrix-vector multiplication on a reconfigurable supercomputer

Citation Formats

References (6)

Similar Records

Related Subjects