Preliminary Results of Autotuning GEMM Kernels for the NVIDIA Kepler Architecture- GeForce GTX 680

Kurzak, Jakub; Luszczek, Pitor; Tomov, Stanimire; Dongarra, Jack

doi:10.2172/1173292

Title: Preliminary Results of Autotuning GEMM Kernels for the NVIDIA Kepler Architecture- GeForce GTX 680

Technical Report · Sun Apr 01 00:00:00 EDT 2012

DOI:https://doi.org/10.2172/1173292· OSTI ID:1173292

Kurzak, Jakub ^[1]; Luszczek, Pitor ^[1]; Tomov, Stanimire ^[1]; Dongarra, Jack ^[2]

Univ. of Tennessee, Knoxville, TN (United States)
Univ. of Tennessee, Knoxville, TN (United States); Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Univ. of Manchester (United Kingdom)

Kepler is the newest GPU architecture from NVIDIA, and the GTX 680 is the first commercially available graphics card based on that architecture. Matrix multiplication is a canonical computational kernel, and often the main target of initial optimization efforts for a new chip. This article presents preliminary results of automatically tuning matrix multiplication kernels for the Kepler architecture using the GTX 680 card.

View Technical Report

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: AC02-05CH11231

OSTI ID:: 1173292

Report Number(s):: LBNL-5788E

Country of Publication:: United States

Language:: English

Similar Records

A performance model for GPUs with caches

Journal Article · Tue Jun 24 00:00:00 EDT 2014 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1173292

Dao, Thanh Tuan; Kim, Jungwon; Seo, Sangmin; +2 more

Performance Analysis of Memory Transfers and GEMM Subroutines on NVIDIA Tesla GPU Cluster

Conference · Mon Aug 31 00:00:00 EDT 2009 · OSTI ID:1173292

Bode, Brett

Optimizing Approximate Weighted Matching on Nvidia Kepler K40

Conference · Wed Sep 30 00:00:00 EDT 2015 · OSTI ID:1173292

Naim, Md; Manne, Fredrik; Halappanavar, Mahantesh; +2 more

Related Subjects

97 MATHEMATICS AND COMPUTING

Title: Preliminary Results of Autotuning GEMM Kernels for the NVIDIA Kepler Architecture- GeForce GTX 680

Citation Formats

Similar Records

Related Subjects