Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
- Korea Aerospace Univ., Goyang (Korea, Republic of)
- Georgia Inst. of Technology, Atlanta, GA (United States)
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
There is a growing interest in custom spatial accelerators for machine learning applications. These accelerators employ a spatial array of processing elements (PEs) interacting via custom buffer hierarchies and networks-on-chip. The efficiency of these accelerators comes from employing optimized dataflow (i.e., spatial/temporal partitioning of data across the PEs and fine-grained scheduling) strategies to optimize data reuse. The focus of this work is to evaluate these accelerator architectures using a tiled general matrix-matrix multiplication (GEMM) kernel. To do so, we develop a framework that finds optimized mappings (dataflow and tile sizes) for a tiled GEMM for a given spatial accelerator and workload combination, leveraging an analytical cost model for runtime and energy. Our evaluations over five spatial accelerators demonstrate that the tiled GEMM mappings systematically generated by our framework achieve high performance on various GEMM workloads and accelerators.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- DOE Contract Number:
- AC04-94AL85000
- OSTI ID:
- 1808019
- Report Number(s):
- SAND2021-7479R; 697053
- Country of Publication:
- United States
- Language:
- English
Similar Records
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication.
autoGEMM: Pushing the Limits of Irregular Matrix Multiplication on Arm Architectures
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operationson Spatial Accelerators
Journal Article
·
Tue Aug 10 20:00:00 EDT 2021
· IEEE Transactions on Parallel and Distributed Systems
·
OSTI ID:1820407
autoGEMM: Pushing the Limits of Irregular Matrix Multiplication on Arm Architectures
Conference
·
Fri Nov 01 00:00:00 EDT 2024
·
OSTI ID:2480030
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operationson Spatial Accelerators
Conference
·
Mon Oct 18 00:00:00 EDT 2021
·
OSTI ID:1972822