Non-preconditioned conjugate gradient on cell and FPGA based hybrid supercomputer nodes
- Los Alamos National Laboratory
This work presents a detailed implementation of a double precision, non-preconditioned, Conjugate Gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecture{sup TM} in conjunction with x86 Opteron{sup TM} processors from AMD. We implement a common Conjugate Gradient algorithm, on a variety of systems, to compare and contrast performance. Implementation results are presented for the Roadrunner hybrid supercomputer, SRC Computers, Inc. MAPStation SRC-6 FPGA enhanced hybrid supercomputer, and AMD Opteron only. In all hybrid implementations wall clock time is measured, including all transfer overhead and compute timings.
- Research Organization:
- Los Alamos National Laboratory (LANL)
- Sponsoring Organization:
- DOE
- DOE Contract Number:
- AC52-06NA25396
- OSTI ID:
- 956510
- Report Number(s):
- LA-UR-09-00248; LA-UR-09-248
- Country of Publication:
- United States
- Language:
- English
Similar Records
Non-preconditioned conjugate gradient on cell and FPCA-based hybrid supercomputer nodes
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
369 TFlop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer
Journal Article
·
Tue Mar 10 00:00:00 EDT 2009
·
OSTI ID:962335
A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Journal Article
·
Mon Dec 31 23:00:00 EST 2007
· Journal: ACM TRETS (Transactions on Reconfigurable Tech and Systems and Systems)
·
OSTI ID:957777
369 TFlop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer
Conference
·
Mon Dec 31 23:00:00 EST 2007
·
OSTI ID:964971