skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Lattice Boltzmann simulation optimization on leading multicore platforms

Conference ·
 [1];  [2];  [2];  [2];  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Univ. of California, Berkeley, CA (United States)
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

We present an auto-tuning approach to optimize application performance on emerging multicore architectures. The methodology extends the idea of searchbased performance optimizations, popular in linear algebra and FFT libraries, to application-specific computational kernels. Our work applies this strategy to a lattice Boltzmann application (LBMHD) that historically has made poor use of scalar microprocessors due to its complex data structures and memory access patterns. We explore one of the broadest sets of multicore architectures in the HPC literature, including the Intel Clovertown, AMD Opteron X2, Sun Niagara2, STI Cell, as well as the single core Intel Itanium2. Rather than hand-tuning LBMHD for each system, we develop a code generator that allows us identify a highly optimized version for each platform, while amortizing the human programming effort. Results show that our autotuned LBMHD application achieves up to a 14 improvement compared with the original code. Additionally, we present detailed analysis of each optimization, which reveal surprising hardware bottlenecks and software challenges for future multicore systems and applications.

Research Organization:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
DOE Contract Number:
AC02-05CH11231
OSTI ID:
1407059
Resource Relation:
Conference: Parallel and Distributed Processing, 2008. IPDPS 2008, Miami, FL, USA, 14-18 April 2008
Country of Publication:
United States
Language:
English

Similar Records

Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms
Conference · Fri Feb 01 00:00:00 EST 2008 · OSTI ID:1407059

Optimization of a Lattice Boltzmann Computation on State-of-the-Art Multicore Platforms
Journal Article · Fri Apr 10 00:00:00 EDT 2009 · Journal of Parallel and Distributed Computing · OSTI ID:1407059

PERI - Auto-tuning Memory Intensive Kernels for Multicore
Conference · Tue Jun 24 00:00:00 EDT 2008 · OSTI ID:1407059

Related Subjects