skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Towards Achieving Performance Portability Using Directives for Accelerators

Conference ·

In this paper we explore the performance portability of directives provided by OpenMP 4 and OpenACC to program various types of node architectures with attached accelerators, both self-hosted multicore and offload multicore/GPU. Our goal is to examine how successful OpenACC and the newer offload features of OpenMP 4.5 are for moving codes between architectures, how much tuning might be required and what lessons we can learn from this experience. To do this, we use examples of algorithms with varying computational intensities for our evaluation, as both compute and data access efficiency are important considerations for overall application performance. We implement these kernels using various methods provided by newer OpenACC and OpenMP implementations, and we evaluate their performance on various platforms including both X86_64 with attached NVIDIA GPUs, self-hosted Intel Xeon Phi KNL, as well as an X86_64 host system with Intel Xeon Phi coprocessors. In this paper, we explain what factors affected the performance portability such as how to pick the right programming model, its programming style, its availability on different platforms, and how well compilers can optimize and target to multiple platforms.

Research Organization:
Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1567436
Resource Relation:
Conference: 2016 Third Workshop on Accelerator Programming Using Directives (WACCPD)
Country of Publication:
United States
Language:
English

Similar Records

Investigation of Portable Event-Based Monte Carlo Transport Using the NVIDIA Thrust Library
Journal Article · Wed Jun 15 00:00:00 EDT 2016 · Transactions of the American Nuclear Society · OSTI ID:1567436

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks
Conference · Sun Jul 01 00:00:00 EDT 2018 · OSTI ID:1567436

HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model
Journal Article · Thu Apr 11 00:00:00 EDT 2019 · Geoscientific Model Development (Online) · OSTI ID:1567436