Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Understanding Performance Portability of SYCL Kernels: A Case Study with the All-Pairs Distance Calculation in Bioinformatics on GPUs

Conference ·

SYCL is a portable programming model. Toward the goal of a better understanding of performance portability of SYCL kernels on GPUs, we select a bioinformatics kernel for computing the all-pairs distance as a case study. After migrating the kernel from CUDA to HIP and SYCL, we evaluate the performance of the CUDA, HIP, and SYCL kernels on NVIDIA V100 and AMD MI210 GPUs. We analyze the GPU instructions from the kernels to explain performance gaps between SYCL and CUDA/HIP. We hope that the findings are valuable for improving performance portability of SYCL.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1996690
Country of Publication:
United States
Language:
English

References (19)

Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL book November 2020
Automatic OpenCL Code Generation for Multi-device Heterogeneous Architectures conference September 2015
Evaluating the Performance of the hipSYCL Toolchain for HPC Kernels on NVIDIA V100 GPUs conference April 2020
Performance portability study of epistasis detection using SYCL on NVIDIA GPU conference August 2022
Bringing performant support for Nvidia® hardware to SYCL conference April 2020
Kokkos 3: Programming Model Extensions for the Exascale Era journal January 2021
Experiences Porting NAMD to the Data Parallel C++ Programming Model conference May 2022
Exploiting graphics processing units for computational biology and bioinformatics journal July 2010
SkelCL: a high-level extension of OpenCL for multi-GPU systems journal May 2014
Evaluation of Intel's DPC++ Compatibility Tool in heterogeneous computing journal July 2022
NVIDIA Tesla: A Unified Graphics and Computing Architecture journal March 2008
Parallel Computing Experiences with CUDA journal July 2008
Understanding Performance Portability of Bioinformatics Applications in SYCL on an NVIDIA GPU conference December 2022
Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level conference February 2018
Evaluating the performance of HPC-style SYCL applications conference April 2020
Rodinia: A benchmark suite for heterogeneous computing conference October 2009
The Xe GPU Architecture conference August 2020
Evaluating Performance and Portability of a core bioinformatics kernel on multiple vendor GPUs conference November 2021
Alpaka -- An Abstraction Library for Parallel Kernel Acceleration conference May 2016