Kokkos 3: Programming Model Extensions for the Exascale Era
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Argonne National Lab. (ANL), Lemont, IL (United States)
- Swiss National Supercomputing Centre, Lugano (Switzerland)
- Sandia National Lab. (SNL-CA), Livermore, CA (United States)
As the push towards exascale hardware has increased the diversity of system architectures, performance portability has become a critical aspect for scientific software. We describe the Kokkos Performance Portable Programming Model that allows developers to write single source applications for diverse high performance computing architectures. Kokkos provides key abstractions for both the compute and memory hierarchy of modern hardware. Here, we describe the novel abstractions that have been added to Kokkos recently such as hierarchical parallelism, containers, task graphs, and arbitrary-sized atomic operations. We demonstrate the performance of these new features with reproducible benchmarks on CPUs and GPUs.
- Research Organization:
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- NA0003525; AC05-00OR22725; AC02-05CH11231
- OSTI ID:
- 1825555
- Alternate ID(s):
- OSTI ID: 1822222; OSTI ID: 1825556; OSTI ID: 1829069; OSTI ID: 1867786
- Report Number(s):
- SAND-2021-7666J; 697015
- Journal Information:
- IEEE Transactions on Parallel and Distributed Systems, Vol. 33, Issue 4; ISSN 1045-9219
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Kokkos 3: Programming Model Extensions for the Exascale Era
Data Locality Enhancement of Dynamic Simulations for Exascale Computing (Final Report)