Performance Portability of a Wilson Dslash Stencil Operator Mini-App Using Kokkos and SYCL
We describe our experiences in creating mini-apps for the Wilson-Dslash stencil operator for Lattice Quantum Chromodynamics using the Kokkos and SYCL programming models. In particular we comment on the performance achieved on a variety of hardware architectures, limitations we have reached in both programming models and how these have been resolved by us, or may be resolved by the developers of these models.
- Research Organization:
- Thomas Jefferson National Accelerator Facility (TJNAF), Newport News, VA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Nuclear Physics (NP)
- DOE Contract Number:
- AC05-06OR23177
- OSTI ID:
- 1976171
- Report Number(s):
- JLAB-CIO-19-3085; DOE/OR/23177-4806
- Resource Relation:
- Conference: 2019 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)
- Country of Publication:
- United States
- Language:
- English
Similar Records
Case Study of Using Kokkos and SYCLs Performance-Portable Frameworks for Milc-Dslash Benchmark on NVIDIA, AMD and Intel GPUs
Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels
Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels
Conference
·
2021
·
OSTI ID:1892057
+4 more
Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels
Conference
·
2024
·
OSTI ID:2283705
+11 more
Application of performance portability solutions for GPUs and many-core CPUs to track reconstruction kernels
Conference
·
2024
·
OSTI ID:2438811
+11 more