Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Performance portability study of epistasis detection using SYCL on NVIDIA GPU

Conference ·
We describe the experience of converting a CUDA implementation of a high-order epistasis detection algorithm to SYCL. The goals are for our work to be useful to application and compiler developers with a detailed description of migration paths between CUDA and SYCL. Evaluating the CUDA and SYCL applications on an NVIDIA V100 GPU, we find that the optimization of loop unrolling needs to be applied manually to the SYCL kernel for obtaining comparable performance. The performance of the SYCL group reduce function, an alternative to the CUDA warp-based reduction, depends on the problem and work group sizes. The 64-bit popcount operation implemented with tree of adders is slightly faster than the built-in popcount operation. When the number of OpenMP threads is four, the highest performance of the SYCL and CUDA applications are comparable.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1883813
Country of Publication:
United States
Language:
English

References (7)

Data Parallel C++: Mastering DPC++ for Programming of Heterogeneous Systems using C++ and SYCL book November 2020
D10S1423 identifies a susceptibility locus for Alzheimer's disease (AD7) in a prospective, longitudinal, double‐blind study of asymptomatic individuals: Results at 14 years
  • Zubenko, George S.; Hughes, Hugh B.; Zubenko, Wendy N.
  • American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, Vol. 153B, Issue 2 https://doi.org/10.1002/ajmg.b.31017
journal February 2010
Population Count on Intel® CPU, GPU and FPGA conference May 2020
Experiences Porting NAMD to the Data Parallel C++ Programming Model conference May 2022
SkelCL: a high-level extension of OpenCL for multi-GPU systems journal May 2014
Parallel Computing Experiences with CUDA journal July 2008
A Comparison of SYCL, OpenCL, CUDA, and OpenMP for Massively Parallel Support Vector Machine Classification on Multi-Vendor Hardware conference May 2022

Similar Records

Related Subjects