Non-preconditioned conjugate gradient on cell and FPGA based hybrid supercomputer nodes
- Los Alamos National Laboratory
This work presents a detailed implementation of a double precision, non-preconditioned, Conjugate Gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecture{sup TM} in conjunction with x86 Opteron{sup TM} processors from AMD. We implement a common Conjugate Gradient algorithm, on a variety of systems, to compare and contrast performance. Implementation results are presented for the Roadrunner hybrid supercomputer, SRC Computers, Inc. MAPStation SRC-6 FPGA enhanced hybrid supercomputer, and AMD Opteron only. In all hybrid implementations wall clock time is measured, including all transfer overhead and compute timings.
- Research Organization:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC52-06NA25396
- OSTI ID:
- 956510
- Report Number(s):
- LA-UR-09-00248; LA-UR-09-248; TRN: US201014%%1879
- Resource Relation:
- Conference: Field-Programmable Custom Computing Machines 2009 (FCCM'09) ; April 5, 2009 ; Napa, CA
- Country of Publication:
- United States
- Language:
- English
Similar Records
Non-preconditioned conjugate gradient on cell and FPCA-based hybrid supercomputer nodes
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Roadrunner Supercomputer Breaks the Petaflop Barrier
Journal Article
·
Tue Mar 10 00:00:00 EDT 2009
·
OSTI ID:956510
+1 more
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Journal Article
·
Tue Jan 01 00:00:00 EST 2008
· Journal: ACM TRETS (Transactions on Reconfigurable Tech and Systems and Systems)
·
OSTI ID:956510
+2 more
Roadrunner Supercomputer Breaks the Petaflop Barrier
Multimedia
·
Mon Jun 09 00:00:00 EDT 2008
·
OSTI ID:956510