Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

ASAP: Automatic Synthesis of Area-Efficient and Precision-Aware CGRAs

Conference ·

Coarse-grained reconfigurable accelerators (CGRAs) are a promising accelerator design choice that strikes a balance between performance and adaptability to different computing patterns across various applications domains. Designing a CGRA for a specific application domain involves enormous software/hardware engineering effort. Recent research works explore loop transformations, functional unit types, network topology, and memory size to identify optimal CGRA designs given a set of kernels from a specific application do- main. Unfortunately, the impact of functional units with different precision support has rarely been investigated. To address this gap, we propose ASAP – a hardware/software co-design framework that automatically identifies and synthesizes optimal precision-aware CGRA for a set of applications of interest. Our evaluation shows that ASAP generates specialized designs 3.2×, 4.21×, and 5.8× more efficient (in terms of performance per unit of energy or area) than non-specialized homogeneous CGRAs, for the scientific computing, embedded, and edge machine learning domains, respectively, with limited accuracy loss. Moreover, ASAP provides more efficient designs than other state-of-the-art synthesis frameworks for specialized CGRAs.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
DOE Contract Number:
Report Number(s):
Resource Relation:
Conference: Proceedings of the 36th ACM International Conference on Supercomputing (ICS 2022), June 28-30, 2022, Virtual, Online
Country of Publication:
United States

References (30)

Toward Approximate Computing for Coarse-Grained Reconfigurable Architectures journal November 2018
EGRA: A Coarse Grained Reconfigurable Architectural Template journal June 2011
A high-level synthesis and verification tool for fixed to floating point conversion conference August 2012
CGRA-ME: A unified framework for CGRA modelling and exploration conference July 2017
BioCare: An Energy-Efficient CGRA for Bio-Signal Processing at the Edge conference May 2021
Xilinx Adaptive Compute Acceleration Platform conference February 2019
CQNN: a CGRA-based QNN Framework conference September 2020
Low-Power Digital Signal Processing Using Approximate Adders journal January 2013
Branch-Aware Loop Mapping on CGRAs conference June 2014
PyMTL3: A Python Framework for Open-Source Hardware Modeling, Generation, Simulation, and Verification journal July 2020
In-Datacenter Performance Analysis of a Tensor Processing Unit conference January 2017
HyCUBE conference June 2017
Dnestmap conference June 2018
Spatial: a language and compiler for application accelerators
  • Koeplinger, David; Feldman, Matthew; Prabhakar, Raghu
  • PLDI '18: ACM SIGPLAN Conference on Programming Language Design and Implementation, Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation
conference June 2018
Trading Accuracy for Power with an Underdesigned Multiplier Architecture
  • Kulkarni, P.; Gupta, P.; Ercegovac, M.
  • 2011 24th International Conference on VLSI Design: concurrently with the 10th International Conference on Embedded Systems Design, 2011 24th Internatioal Conference on VLSI Design
conference January 2011
MAERI: Enabling Flexible Dataflow Mapping over DNN Accelerators via Reconfigurable Interconnects journal November 2018
LLVM: A compilation framework for lifelong program analysis & transformation conference January 2004
ADRES: An Architecture with Tightly Coupled VLIW Processor and Coarse-Grained Reconfigurable Matrix book January 2003
Exploiting loop-level parallelism on coarse-grained reconfigurable architectures using modulo scheduling journal January 2003
Polymorphic pipeline array conference December 2009
Plasticine conference June 2017
Using mobile phone barometer for low-power transportation context detection conference November 2014
DynPaC: Coarse-Grained, Dynamic, and Partially Reconfigurable Array for Streaming Applications conference October 2021
Stitch: Fusible Heterogeneous Accelerators Enmeshed with Many-Core Architecture for Wearables conference June 2018
  • Tan, Cheng; Kulkarni, Aditi; Venkataramani, Vanchinathan
  • ACM Transactions on Embedded Computing Systems, Vol. 17, Issue 1
journal November 2017
ARENA: Asynchronous Reconfigurable Accelerator Ring to Enable Data-Centric Parallel Computing journal December 2021
OpenCGRA: An Open-Source Unified Framework for Modeling, Testing, and Evaluating CGRAs conference October 2020
Ultra-Elastic CGRAs for Irregular Loop Specialization conference February 2021
DSAGEN: Synthesizing Programmable Spatial Accelerators conference May 2020
Iterative Search for Reconfigurable Accelerator Blocks With a Compiler in the Loop journal March 2019