Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

symPACK: A GPU-Capable Fan-Out Sparse Cholesky Solver

Conference ·
OSTI ID:2006802

Sparse symmetric positive definite systems of equations are ubiquitous in scientific workloads and applications. Parallel sparse Cholesky factorization is the method of choice for solving such linear systems. Therefore, the development of parallel sparse Cholesky codes that can efficiently run on today’s large-scale heterogeneous distributed-memory platforms is of vital importance. Modern supercomputers offer nodes that contain a mix of CPUs and GPUs. To fully utilize the computing power of these nodes, scientific codes must be adapted to offload expensive computations to GPUs. We present symPACK, a GPU-capable parallel sparse Cholesky solver that uses one-sided communication primitives and remote procedure calls provided by the UPC++ library. We also utilize the UPC++ "memory kinds" feature to enable efficient communication of GPU-resident data. We show that on a number of large problems, symPACK outperforms comparable state-of-the-art GPU-capable Cholesky factorization codes by up to 14x on the NERSC Perlmutter supercomputer.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR) (SC-21)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
2006802
Country of Publication:
United States
Language:
English

Similar Records

An Asynchronous Task-based Fan-Both Sparse Cholesky Solver
Conference · Thu Dec 31 23:00:00 EST 2015 · OSTI ID:1598521

GPU Accelerated Sparse Cholesky Factorization
Conference · Sat Nov 16 23:00:00 EST 2024 · OSTI ID:2586562

symPACK v2.0
Software · Sun Nov 24 19:00:00 EST 2019 · OSTI ID:code-53811

Related Subjects