skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Designing a graphics processing unit accelerated petaflop capable lattice Boltzmann solver: Read aligned data layouts and asynchronous communication

Journal Article · · International Journal of High Performance Computing Applications
 [1];  [1];  [2]
  1. Åbo Akademi University, Faculty of Science and Engineering, Åbo, Finland
  2. Department of Physics and Nanoscience Center, University of Jyväskylä, Jyväskylä, Finland; Department of Physics, Tampere University of Technology, Tampere, Finland

The lattice Boltzmann method is a well-established numerical approach for complex fluid flow simulations. Recently, general-purpose graphics processing units (GPUs) have become available as high-performance computing resources at large scale. We report on designing and implementing a lattice Boltzmann solver for multi-GPU systems that achieves 1.79 PFLOPS performance on 16,384 GPUs. To achieve this performance, we introduce a GPU compatible version of the so-called bundle data layout and eliminate the halo sites in order to improve data access alignment. Furthermore, we make use of the possibility to overlap data transfer between the host central processing unit and the device GPU with computing on the GPU. As a benchmark case, we simulate flow in porous media and measure both strong and weak scaling performance with the emphasis being on large-scale simulations using realistic input data.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Office of Science (SC)
DOE Contract Number:
287703
OSTI ID:
1565624
Journal Information:
International Journal of High Performance Computing Applications, Vol. 31, Issue 3; ISSN 1094-3420
Publisher:
SAGE
Country of Publication:
United States
Language:
English

References (16)

Lattice-Boltzmann Method for Complex Flows journal January 2010
Accelerating Lattice Boltzmann Fluid Flow Simulations Using Graphics Processors conference September 2009
The lattice Boltzmann equation: theory and applications journal December 1992
A Knudsen layer theory for lattice gases journal January 1991
A framework for hybrid parallel flow simulations with a trillion cells in complex geometries
  • Godenschwager, Christian; Schornbaum, Florian; Bauer, Martin
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2503273
conference January 2013
Scaling soft matter physics to thousands of graphics processing units in parallel journal March 2015
High-precision synthetic computed tomography of reconstructed porous media journal December 2011
Comparison of implementations of the lattice-Boltzmann method journal April 2008
A prospect for computing in porous materials research: Very large fluid flow simulations journal January 2016
Petascale Application of a Coupled CPU-GPU Algorithm for Simulation and Analysis of Multiphase Flow Solutions in Porous Medium Systems
  • McClure, James E.; Wang, Hao; Prins, Jan F.
  • 2014 IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2014 IEEE 28th International Parallel and Distributed Processing Symposium https://doi.org/10.1109/IPDPS.2014.67
conference May 2014
Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures conference January 2004
Lattice BGK Models for Navier-Stokes Equation journal February 1992
Lattice Boltzmann Simulations at Petascale on Multi-GPU Systems with Asynchronous Data Transfer and Strictly Enforced Memory Read Alignment
  • Robertsen, Fredrik; Westerholm, Jan; Mattila, Keijo
  • 2015 23rd Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing https://doi.org/10.1109/PDP.2015.71
conference March 2015
Parallelization Strategies and Efficiency of CFD Computations in Complex Geometries Using Lattice Boltzmann Methods on High-Performance Computers book January 2002
On the single processor performance of simple lattice Boltzmann kernels journal September 2006
Comparison of different propagation steps for lattice Boltzmann methods journal March 2013

Similar Records

Related Subjects