Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Scalable Out-of-Core Solvers on Xeon Phi Cluster

Book ·
OSTI ID:1324039
 [1];  [2];  [3];  [1]
  1. ORNL
  2. Chinese University of Hong Kong (CUHK)
  3. Center for Computational Materials Science

This paper documents the implementation of a distributive out-of-core (OOC) solver for performing LU and Cholesky factorizations of a large dense matrix on clusters of many-core programmable co-processors. The out-of- core algorithm combines both the left-looking and right-looking schemes aimed to minimize the movement of data between the CPU host and the co-processor, optimizing data locality as well as computing throughput. The OOC solver is built to align with the format of the ScaLAPACK software library, making it readily portable to any existing codes using ScaLAPACK. A runtime analysis conducted on Beacon (an Intel Xeon plus Intel Xeon Phi cluster which composed of 48 nodes of multi-core CPU and MIC) at the Na- tional Institute for Computational Sciences is presented. Comparison of the performance on the Intel Xeon Phi and GPU clusters are also provided.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
ORNL LDRD Director's R&D
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1324039
Country of Publication:
United States
Language:
English

Similar Records

QCD For Intel(R) Xeon Phi(tm) and Xeon(tm) processors
Software · Thu Sep 11 00:00:00 EDT 2014 · OSTI ID:1231842

QCD For Intel(R) Xeon Phi(tm) and Xeon(tm) processors
Software · Tue Sep 09 20:00:00 EDT 2014 · OSTI ID:code-2924

NWQ-sim
Software · Wed Sep 22 20:00:00 EDT 2021 · OSTI ID:code-64325

Related Subjects