Non-preconditioned conjugate gradient on cell and FPCA-based hybrid supercomputer nodes
Journal Article
·
OSTI ID:962335
- Los Alamos National Laboratory
This work presents a detailed implementation of a double precision, Non-Preconditioned, Conjugate Gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecture{trademark} in conjunction with x86 Opteron{trademark} processors from AMD. We implement a common Conjugate Gradient algorithm, on a variety of systems, to compare and contrast performance. Implementation results are presented for the Roadrunner hybrid supercomputer, SRC Computers, Inc. MAPStation SRC-6 FPGA enhanced hybrid supercomputer, and AMD Opteron only. In all hybrid implementations wall clock time is measured, including all transfer overhead and compute timings.
- Research Organization:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC52-06NA25396
- OSTI ID:
- 962335
- Report Number(s):
- LA-UR-09-01455; LA-UR-09-1455; TRN: US200919%%96
- Country of Publication:
- United States
- Language:
- English
Similar Records
Non-preconditioned conjugate gradient on cell and FPGA based hybrid supercomputer nodes
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Implementation of a cell-wise Block-Gauss-Seidel iterative method for SN transport on a hybrid parallel computer architecture
Conference
·
Thu Jan 01 00:00:00 EST 2009
·
OSTI ID:962335
+1 more
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Journal Article
·
Tue Jan 01 00:00:00 EST 2008
· Journal: ACM TRETS (Transactions on Reconfigurable Tech and Systems and Systems)
·
OSTI ID:962335
+2 more
Implementation of a cell-wise Block-Gauss-Seidel iterative method for SN transport on a hybrid parallel computer architecture
Conference
·
Tue Dec 14 00:00:00 EST 2010
·
OSTI ID:962335