Performance Portability Evaluation of OpenCL Benchmarks across Intel and NVIDIA Platforms

Bertoni, Colleen; Kwack, Jaehyuk; Applencourt, Thomas; Ghadar, Yasaman; Homerding, Brian; Knight, Christopher; Videau, Brice; Zheng, Huihuo; Morozov, Vitali; Parker, Scott

doi:10.1109/IPDPSW50202.2020.00067

Title: Performance Portability Evaluation of OpenCL Benchmarks across Intel and NVIDIA Platforms

Conference · Wed Jan 01 00:00:00 EST 2020

DOI:https://doi.org/10.1109/IPDPSW50202.2020.00067· OSTI ID:1804079

Bertoni, Colleen; Kwack, Jaehyuk; Applencourt, Thomas; Ghadar, Yasaman; Homerding, Brian; Knight, Christopher; Videau, Brice; Zheng, Huihuo; Morozov, Vitali; Parker, Scott

We evaluate the capabilities of vendor-provided OpenCL implementations for performance portability across multiple computing platforms. The Rodinia benchmark suite is used for this evaluation. We apply the metric defined by Pennycook et al., and we use roofline efficiency from the Roofline performance model as the "performance efficiency" in the metric's definition. We found that the delivered performance portability is similar for several benchmarks, even if the roofline-based performance efficiencies across platforms are very different among the benchmarks. To help distinguish between these instances, we extend the metric by adding the standard deviation of the performance efficiencies for each benchmark. We argue that the standard deviation gives additional insight into performance portability assessment since it adds the performance variability across platforms. Additionally, we discuss the challenges to measure performance portability associated with algorithms and system software. In terms of algorithms, we need to carefully construct the benchmarks and appropriately use the concurrency available on a platform. In terms of system software, we depend on the vendor performance tools to support the desired programming model and runtime to be able to measure the metrics of interest.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Argonne National Lab. (ANL), Argonne, IL (United States)

Sponsoring Organization:: USDOE Office of Science - Office of Basic Energy Sciences - Scientific User Facilities Division; USDOE Exascale Computing Project

DOE Contract Number:: AC02-06CH11357

OSTI ID:: 1804079

Resource Relation:: Conference: 34th IEEE International Parallel and Distributed Processing Symposium, 05/18/20 - 05/22/20, New Orleans, LA, US

Country of Publication:: United States

Language:: English

Similar Records

Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL

Technical Report · Tue May 23 00:00:00 EDT 2017 · OSTI ID:1804079

Jin, Zheming; Yoshii, Kazutomo; Finkel, Hal; +1 more

Case Study of Using Kokkos and SYCLs Performance-Portable Frameworks for Milc-Dslash Benchmark on NVIDIA, AMD and Intel GPUs

Conference · Fri Jan 01 00:00:00 EST 2021 · OSTI ID:1804079

Dufek, Amanda S; Gayatri, Rahulkumar; Mehta, Neil A; +4 more

Toward Evaluating High-Level Synthesis Portability and Performance between Intel and Xilinx FPGAs

Conference · Thu Apr 01 00:00:00 EDT 2021 · OSTI ID:1804079

Cabrera, Anthony; Young, Aaron; Lambert, Jacob; +8 more

Related Subjects

GPU
OpenCL
high performance computing
performance efficiency
performance portability
roofline performance analysis

Title: Performance Portability Evaluation of OpenCL Benchmarks across Intel and NVIDIA Platforms

Citation Formats

Similar Records

Related Subjects