Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

SPARTA: High-Level Synthesis of Parallel Multi-Threaded Accelerators

Journal Article · · ACM Transactions on Reconfigurable Technology and Systems
DOI:https://doi.org/10.1145/3677035· OSTI ID:2570267
This article presents a methodology for the Synthesis of PARallel multi-Threaded Accelerators (SPARTA) from OpenMP annotated C/C++ specifications. SPARTA extends an open-source HLS tool, enabling the generation of accelerators that provide latency tolerance for irregular memory accesses through multithreading, support fine-grained memory-level parallelism through a hot-potato deflection-based network-on-chip (NoC), support synchronization constructs, and can instantiate memory-side caches. Our approach is based on a custom runtime OpenMP library, providing flexibility and extensibility. Experimental results show high scalability when synthesizing irregular graph kernels. The accelerators generated with our approach are, on average, 2.29x faster than state-of-the-art HLS methodologies.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
2570267
Report Number(s):
PNNL-SA--185565
Journal Information:
ACM Transactions on Reconfigurable Technology and Systems, Journal Name: ACM Transactions on Reconfigurable Technology and Systems Journal Issue: 1 Vol. 18; ISSN 1936-7414; ISSN 1936-7406
Publisher:
Association for Computing Machinery (ACM)Copyright Statement
Country of Publication:
United States
Language:
English

References (20)

A Survey on Graph Processing Accelerators: Challenges and Opportunities journal March 2019
Parallel heuristics for scalable community detection journal August 2015
LUBM: A benchmark for OWL knowledge base systems journal October 2005
Enhancing Butterfly Fat Tree NoCs for FPGAs with Lightweight Flow Control conference April 2019
Hoplite: Building austere overlay NoCs for FPGAs conference September 2015
From software threads to parallel hardware in high-level synthesis for FPGAs conference December 2013
MachSuite: Benchmarks for accelerator design and customized architectures conference October 2014
Exploring DataVortex Systems for Irregular Applications conference May 2017
The Data Vortex Optical Packet Switched Interconnection Network journal July 2008
Svelto: High-Level Synthesis of Multi-Threaded Accelerators for Graph Analytics journal March 2022
A case for bufferless routing in on-chip networks conference June 2009
A scalable processing-in-memory accelerator for parallel graph processing conference June 2015
Boosting the Performance of FPGA-based Graph Processor using Hybrid Memory Cube conference February 2017
Hoplite journal March 2017
An FPGA framework for edge-centric graph processing conference May 2018
Gpop conference February 2019
Automated accelerator optimization aided by graph neural networks conference July 2022
Accelerator design with decoupled hardware customizations conference July 2022
HLS-based High-throughput and Work-efficient Synthesizable Graph Processing Template Pipeline
  • Ahangari, Hamzeh; Özdal, Muhammet Mustafa; Öztürk, Özcan
  • ACM Transactions on Embedded Computing Systems, Vol. 22, Issue 2 https://doi.org/10.1145/3529256
journal January 2023
FPGA HLS Today: Successes, Challenges, and Opportunities journal August 2022

Similar Records

A Parallel Graph Environment for Real-World Data Analytics Workflows
Conference · Mon Mar 25 00:00:00 EDT 2019 · OSTI ID:1591771

Hierarchical resilience with lightweight threads.
Technical Report · Sat Oct 01 00:00:00 EDT 2011 · OSTI ID:1029809

Characterizing and Mitigating Work Time Inflation in Task Parallel Programs
Journal Article · Mon Dec 31 19:00:00 EST 2012 · Scientific Programming · OSTI ID:1197887