Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4

Williams, Samuel; Carter, Jonathan; Oliker, Leonid; Shalf, John; Yelick, Katherine

Title: Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4

Conference · Mon May 04 00:00:00 EDT 2009

OSTI ID:962937

Williams, Samuel; Carter, Jonathan; Oliker, Leonid; Shalf, John; Yelick, Katherine

We apply auto-tuning to a hybrid MPI-pthreads lattice Boltzmann computation running on the Cray XT4 at National Energy Research Scientific Computing Center (NERSC). Previous work showed that multicore-specific auto-tuning can improve the performance of lattice Boltzmann magnetohydrodynamics (LBMHD) by a factor of 4x when running on dual- and quad-core Opteron dual-socket SMPs. We extend these studies to the distributed memory arena via a hybrid MPI/pthreads implementation. In addition to conventional auto-tuning at the local SMP node, we tune at the message-passing level to determine the optimal aspect ratio as well as the correct balance between MPI tasks and threads per MPI task. Our study presents a detailed performance analysis when moving along an isocurve of constant hardware usage: fixed total memory, total cores, and total nodes. Overall, our work points to approaches for improving intra- and inter-node efficiency on large-scale multicore systems for demanding scientific applications.

View Conference

Cite

Export

Save

Research Organization:: Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)

Sponsoring Organization:: Computational Research Division

DOE Contract Number:: DE-AC02-05CH11231

OSTI ID:: 962937

Report Number(s):: LBNL-2088E; TRN: US0902975

Resource Relation:: Conference: Cray UserGroup (CUG), 2009

Country of Publication:: United States

Language:: English

Similar Records

PERI - Auto-tuning Memory Intensive Kernels for Multicore

Conference · Tue Jun 24 00:00:00 EDT 2008 · OSTI ID:962937

Bailey, David H; Williams, Samuel; Datta, Kaushik; +5 more

Optimization of a Lattice Boltzmann Computation on State-of-the-Art Multicore Platforms

Journal Article · Fri Apr 10 00:00:00 EDT 2009 · Journal of Parallel and Distributed Computing · OSTI ID:962937

Williams, Samuel; Carter, Jonathan; Oliker, Leonid; +2 more

Lattice Boltzmann Simulation Optimization on Leading Multicore Platforms

Conference · Fri Feb 01 00:00:00 EST 2008 · OSTI ID:962937

Williams, Samuel; Carter, Jonathan; Oliker, Leonid; +2 more

Related Subjects

97
ASPECT RATIO
EFFICIENCY
IMPLEMENTATION
MAGNETOHYDRODYNAMICS
PERFORMANCE
Lattice Boltzmann
Hybrid
MPI
Multicore
Auto-tuning

Title: Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4

Citation Formats

Similar Records

Related Subjects