Towards Achieving Performance Portability Using Directives for Accelerators

Lopez, M. Graham; Larrea, Veronica Vergara; Joubert, Wayne; Hernandez, Oscar; Haidar, Azzam; Tomov, Stanimire; Dongarra, Jack

doi:10.1109/WACCPD.2016.006

Towards Achieving Performance Portability Using Directives for Accelerators

Conference · Tue Nov 01 00:00:00 EDT 2016

DOI:https://doi.org/10.1109/WACCPD.2016.006· OSTI ID:1567436

Lopez, M. Graham; Larrea, Veronica Vergara; Joubert, Wayne; Hernandez, Oscar; Haidar, Azzam; Tomov, Stanimire; Dongarra, Jack

In this paper we explore the performance portability of directives provided by OpenMP 4 and OpenACC to program various types of node architectures with attached accelerators, both self-hosted multicore and offload multicore/GPU. Our goal is to examine how successful OpenACC and the newer offload features of OpenMP 4.5 are for moving codes between architectures, how much tuning might be required and what lessons we can learn from this experience. To do this, we use examples of algorithms with varying computational intensities for our evaluation, as both compute and data access efficiency are important considerations for overall application performance. We implement these kernels using various methods provided by newer OpenACC and OpenMP implementations, and we evaluate their performance on various platforms including both X86_64 with attached NVIDIA GPUs, self-hosted Intel Xeon Phi KNL, as well as an X86_64 host system with Intel Xeon Phi coprocessors. In this paper, we explain what factors affected the performance portability such as how to pick the right programming model, its programming style, its availability on different platforms, and how well compilers can optimize and target to multiple platforms.

🛈

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1567436

Country of Publication:: United States

Language:: English

Similar Records

Evaluating Performance Portability of Accelerator Programming Models using SPEC ACCEL 1.2 Benchmarks

Conference · Sun Jul 01 00:00:00 EDT 2018 · OSTI ID:1468172

HOMMEXX 1.0: a performance-portable atmospheric dynamical core for the Energy Exascale Earth System Model

Journal Article · Thu Apr 11 00:00:00 EDT 2019 · Geoscientific Model Development (Online) · OSTI ID:1529244

Early Experiences Writing Performance Portable OpenMP 4 Codes

Conference · Thu Dec 31 23:00:00 EST 2015 · OSTI ID:1324101

Related Subjects

97 MATHEMATICS AND COMPUTING
Computer Science

Towards Achieving Performance Portability Using Directives for Accelerators

Citation Formats

Similar Records

Related Subjects