Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication.
Journal Article
·
· IEEE Transactions on Parallel and Distributed Systems
- Korea Aerospace Univ., Gyeonggi (Korea, Republic of)
- Georgia Inst. of Technology, Atlanta, GA (United States)
- Sandia National Lab. (SNL-NM), Albuquerque, NM (United States)
There is a growing interest in custom spatial accelerators for machine learning applications. These accelerators employ a spatial array of processing elements (PEs) interacting via custom buffer hierarchies and networks-on-chip. The efficiency of these accelerators comes from employing optimized dataflow (i.e., spatial/temporal partitioning of data across the PEs and fine-grained scheduling) strategies to optimize data reuse. The focus of this work is to evaluate these accelerator architectures using a tiled general matrix-matrix multiplication (GEMM) kernel. To do so, we develop a framework that finds optimized mappings (dataflow and tile sizes) for a tiled GEMM for a given spatial accelerator and workload combination, leveraging an analytical cost model for runtime and energy. Finally, our evaluations over five spatial accelerators demonstrate that the tiled GEMM mappings systematically generated by our framework achieve high performance on various GEMM workloads and accelerators.
- Research Organization:
- Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- NA0003525
- OSTI ID:
- 1820407
- Report Number(s):
- SAND--2021-9925J; 698422
- Journal Information:
- IEEE Transactions on Parallel and Distributed Systems, Journal Name: IEEE Transactions on Parallel and Distributed Systems Journal Issue: 4 Vol. 33; ISSN 1045-9219
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
autoGEMM: Pushing the Limits of Irregular Matrix Multiplication on Arm Architectures
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operationson Spatial Accelerators
Technical Report
·
Wed Jul 14 00:00:00 EDT 2021
·
OSTI ID:1808019
autoGEMM: Pushing the Limits of Irregular Matrix Multiplication on Arm Architectures
Conference
·
Fri Nov 01 00:00:00 EDT 2024
·
OSTI ID:2480030
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operationson Spatial Accelerators
Conference
·
Mon Oct 18 00:00:00 EDT 2021
·
OSTI ID:1972822