Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Non-preconditioned conjugate gradient on cell and FPCA-based hybrid supercomputer nodes

Journal Article ·
OSTI ID:962335

This work presents a detailed implementation of a double precision, Non-Preconditioned, Conjugate Gradient algorithm on a Roadrunner heterogeneous supercomputer node. These nodes utilize the Cell Broadband Engine Architecture{trademark} in conjunction with x86 Opteron{trademark} processors from AMD. We implement a common Conjugate Gradient algorithm, on a variety of systems, to compare and contrast performance. Implementation results are presented for the Roadrunner hybrid supercomputer, SRC Computers, Inc. MAPStation SRC-6 FPGA enhanced hybrid supercomputer, and AMD Opteron only. In all hybrid implementations wall clock time is measured, including all transfer overhead and compute timings.

Research Organization:
Los Alamos National Laboratory (LANL)
Sponsoring Organization:
DOE
DOE Contract Number:
AC52-06NA25396
OSTI ID:
962335
Report Number(s):
LA-UR-09-01455; LA-UR-09-1455
Country of Publication:
United States
Language:
English

Similar Records

Non-preconditioned conjugate gradient on cell and FPGA based hybrid supercomputer nodes
Conference · Wed Dec 31 23:00:00 EST 2008 · OSTI ID:956510

A complete implementation of the conjugate gradient algorithm on a reconfigurable supercomputer
Journal Article · Mon Dec 31 23:00:00 EST 2007 · Journal: ACM TRETS (Transactions on Reconfigurable Tech and Systems and Systems) · OSTI ID:957777

Implementation of a cell-wise Block-Gauss-Seidel iterative method for SN transport on a hybrid parallel computer architecture
Conference · Mon Dec 13 23:00:00 EST 2010 · OSTI ID:1044156