Preliminary Results of Autotuning GEMM Kernels for the NVIDIA Kepler Architecture- GeForce GTX 680
- Univ. of Tennessee, Knoxville, TN (United States)
- Univ. of Tennessee, Knoxville, TN (United States); Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Univ. of Manchester (United Kingdom)
Kepler is the newest GPU architecture from NVIDIA, and the GTX 680 is the first commercially available graphics card based on that architecture. Matrix multiplication is a canonical computational kernel, and often the main target of initial optimization efforts for a new chip. This article presents preliminary results of automatically tuning matrix multiplication kernels for the Kepler architecture using the GTX 680 card.
- Research Organization:
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC)
- DOE Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1173292
- Report Number(s):
- LBNL-5788E
- Country of Publication:
- United States
- Language:
- English
Similar Records
A performance model for GPUs with caches
Performance Analysis of Memory Transfers and GEMM Subroutines on NVIDIA Tesla GPU Cluster
Optimizing Approximate Weighted Matching on Nvidia Kepler K40
Journal Article
·
Tue Jun 24 00:00:00 EDT 2014
· IEEE Transactions on Parallel and Distributed Systems
·
OSTI ID:1173292
+2 more
Performance Analysis of Memory Transfers and GEMM Subroutines on NVIDIA Tesla GPU Cluster
Conference
·
Mon Aug 31 00:00:00 EDT 2009
·
OSTI ID:1173292
Optimizing Approximate Weighted Matching on Nvidia Kepler K40
Conference
·
Wed Sep 30 00:00:00 EDT 2015
·
OSTI ID:1173292
+2 more