Optimization with the OpenACC-to-FPGA framework on the Arria 10 and Stratix 10 FPGAs
Journal Article
·
· Parallel Computing
- Univ. of Oregon, Eugene, OR (United States)
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
The reconfigurable computing paradigm with field programmable gate arrays (FPGAs) has received renewed interest in the high-performance computing field due to FPGAs’ unique combination of performance and energy efficiency. However, difficulties in programming and optimizing FPGAs have prevented them from being widely accepted as general-purpose computing devices. In accelerator-based heterogeneous computing, portability across diverse heterogeneous devices is also an important issue, but the unique architectural features in FPGAs make this difficult to achieve. To address these issues, a directive-based, high-level FPGA programming and optimization framework was previously developed. In this work, developed optimizations were combined holistically using the directive-based approach to show that each individual benchmark requires a unique set of optimizations to maximize performance. We perform this exploration on Intel Arria 10 and Stratix 10 FPGAs. We also explored the relationships between performance, resource usages, and compilation times, and investigated implications for performance portability. Finally, we present an initial evaluation of a real-world proxy application, LULESH.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1807204
- Alternate ID(s):
- OSTI ID: 1894388
- Journal Information:
- Parallel Computing, Journal Name: Parallel Computing Journal Issue: 0 Vol. 104-105; ISSN 0167-8191
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Rodinia: A benchmark suite for heterogeneous computing
|
conference | October 2009 |
Aspen: A domain specific language for performance modeling
|
conference | November 2012 |
OmpSs: A PROPOSAL FOR PROGRAMMING HETEROGENEOUS MULTI-CORE ARCHITECTURES
|
journal | June 2011 |
Stateful dataflow multigraphs: a data-centric model for performance portability on heterogeneous architectures
|
conference | November 2019 |
Design and Preliminary Evaluation of OpenACC Compiler for FPGA with OpenCL and Stream Processing DSL
|
conference | January 2020 |
Similar Records
In-Depth Optimization with the OpenACC-to-FPGA Framework on an Arria 10 FPGA
Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL
OpenACC to FPGA: A Framework for Directive-based High-Performance Reconfigurable Computing
Conference
·
Fri May 01 00:00:00 EDT 2020
·
OSTI ID:1671395
Evaluation of CHO Benchmarks on the Arria 10 FPGA using Intel FPGA SDK for OpenCL
Technical Report
·
Tue May 23 00:00:00 EDT 2017
·
OSTI ID:1372106
OpenACC to FPGA: A Framework for Directive-based High-Performance Reconfigurable Computing
Conference
·
Sun May 01 00:00:00 EDT 2016
·
OSTI ID:1261388