A Synthesis Methodology for Intelligent Memory Interfaces in Accelerator Systems
Conference
·
OSTI ID:2587170
Domain-specific systems improve the performance of a specific set of applications compared to general-purpose processing systems by deploying custom hardware accelerators. These hardware accelerators are generated using high-level synthesis (HLS) tools. The HLS tools enable a comprehensive design space exploration to optimize the compute performance of the generated accelerators. However, they often ignore the challenges of implementing the accelerators in a system-on-chip, particularly how the accelerators access memory. Our work introduces a buffering system design that improves accelerators' memory accesses by intelligently employing burst transactions to prefetch useful data from external memory to on-chip local buffers. Our design is dynamic, parametric, and transparent to the accelerators generated by HLS tools. We derive the buffering system parameters using appropriate compiler-based analysis passes and memory channel latency constraints. The proposed buffering system design results in, on average, 8.8x performance improvements while lowering memory channel utilization on average by 53.2% for a set of PolyBench kernels.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 2587170
- Report Number(s):
- PNNL-SA-181728
- Country of Publication:
- United States
- Language:
- English
Similar Records
A one`s complement cache memory
Logic in Memory Emulator
Low latency memory access and synchronization
Conference
·
Fri Dec 30 23:00:00 EST 1994
·
OSTI ID:98913
Logic in Memory Emulator
Software
·
Tue Apr 28 20:00:00 EDT 2020
·
OSTI ID:code-40234
Low latency memory access and synchronization
Patent
·
Mon Feb 05 23:00:00 EST 2007
·
OSTI ID:949197