Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Toward performance-portable PETSc for GPU-based exascale systems

Journal Article · · Parallel Computing
 [1];  [2];  [1];  [3];  [1];  [4];  [5];  [6];  [1];  [7];  [8];  [9];  [1];  [1]
  1. Argonne National Lab. (ANL), Lemont, IL (United States)
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  3. Univ. of Colorado, Boulder, CO (United States)
  4. Univ. at Buffalo, NY (United States)
  5. Tech-X, Boulder, CO (United States)
  6. Univ. of Chicago, IL (United States)
  7. Argonne National Lab. (ANL), Lemont, IL (United States); TU Wien (Austria)
  8. Flatiron Institute, New York, NY (United States)
  9. King Abdullah University of Science and Technology (KAUST), Thuwal (Saudi Arabia)

The Portable Extensible Toolkit for Scientific computation (PETSc) library delivers scalable solvers for nonlinear time-dependent differential and algebraic equations and for numerical optimization. The PETSc design for performance portability addresses fundamental GPU accelerator challenges and stresses flexibility and extensibility by separating the programming model used by the application from that used by the library, and it enables application developers to use their preferred programming model, such as Kokkos, RAJA, SYCL, HIP, CUDA, or OpenCL, on upcoming exascale systems. Furthermore, a blueprint for using GPUs from PETSc-based codes is provided, and case studies emphasize the flexibility and high performance achieved on current GPU-based systems.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC); USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1834595
Journal Information:
Parallel Computing, Journal Name: Parallel Computing Vol. 108; ISSN 0167-8191
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (13)

A performance spectrum for parallel computational frameworks that solve PDEs: A performance spectrum for parallel computational frameworks that solve PDEs journal December 2017
KSPHPDDM and PCHPDDM: Extending PETSc with advanced Krylov methods and robust multilevel overlapping Schwarz preconditioners journal February 2021
Accelerating sparse matrix–matrix multiplication with GPU Tensor Cores journal December 2020
An investigation of the performance portability of OpenCL journal November 2013
Kokkos: Enabling manycore performance portability through polymorphic memory access patterns journal December 2014
Preparing sparse solvers for exascale computing
  • Anzt, Hartwig; Boman, Erik; Falgout, Rob
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 378, Issue 2166 https://doi.org/10.1098/rsta.2019.0053
journal January 2020
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems journal May 2010
ViennaCL---Linear Algebra Library for Multi- and Many-Core Architectures journal January 2016
Landau Collision Integral Solver with Adaptive Mesh Refinement on Emerging Architectures journal January 2017
Comparative Study of Finite Element Methods Using the Time-Accuracy-Size(TAS) Spectrum Analysis journal January 2018
Mixed Precision Block Fused Multiply-Add: Error Analysis and Application to GPU Tensor Cores journal January 2020
Sparse Matrix-Vector Multiplication on GPGPUs journal January 2017
Supporting Mixed-domain Mixed-precision Matrix Multiplication within the BLIS Framework
  • Van Zee, Field G.; Parikh, Devangi N.; Geijn, Robert A. Van De
  • ACM Transactions on Mathematical Software, Vol. 47, Issue 2 https://doi.org/10.1145/3402225
journal April 2021

Similar Records

PETSc/TAO developments for GPU-based early exascale systems
Journal Article · Fri Jan 17 23:00:00 EST 2025 · The International Journal of High Performance Computing Applications · OSTI ID:2997026

Case Study of Using Kokkos and SYCLs Performance-Portable Frameworks for Milc-Dslash Benchmark on NVIDIA, AMD and Intel GPUs
Conference · Thu Dec 31 23:00:00 EST 2020 · OSTI ID:1892057