Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Moment representation in the lattice Boltzmann method on massively parallel hardware

Conference ·
 [1];  [2];  [3];  [4];  [5]
  1. Duke University
  2. ORNL
  3. Universidade do Estado de Santa Catarina
  4. Lawrence Livermore National Laboratory (LLNL)
  5. Duke university Duhram, NC
The widely-used lattice Boltzmann method (LBM) for computational fluid dynamics is highly scalable, but also significantly memory bandwidth-bound on current architectures. This paper presents a new regularized LBM implementation that reduces the memory footprint by only storing macroscopic, moment-based data. We show that the amount of data that must be stored in memory during a simulation is reduced by up to 47%. We also present a technique for cache-aware data re-utilization and show that optimizing cache utilization to limit data motion results in a similar improvement in time to solution. These new algorithms are implemented in the hemodynamics solver HARVEY and demonstrated using both idealized and realistic biological geometries. We develop a performance model for the moment representation algorithm and evaluate the performance on Summit.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1607181
Country of Publication:
United States
Language:
English

References (34)

Optimization and Profiling of the Cache Performance of Parallel Lattice Boltzmann Codes journal December 2003
Extracting ultra-scale Lattice Boltzmann performance via hierarchical and distributed auto-tuning
  • Williams, Samuel; Oliker, Leonid; Carter, Jonathan
  • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '11 https://doi.org/10.1145/2063384.2063458
conference January 2011
Multiple–relaxation–time lattice Boltzmann models in three dimensions
  • d'Humières, Dominique
  • Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, Vol. 360, Issue 1792 https://doi.org/10.1098/rsta.2001.0955
journal March 2002
Multiscale Simulation of Cardiovascular flows on the IBM Bluegene/P: Full Heart-Circulation System at Red-Blood Cell Resolution
  • Peters, Amanda; Melchionna, Simone; Kaxiras, Efthimios
  • 2010 SC - International Conference for High Performance Computing, Networking, Storage and Analysis, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1109/SC.2010.33
conference November 2010
Lattice Boltzmann benchmark kernels as a testbed for performance analysis journal August 2018
Esoteric Twist: An Efficient in-Place Streaming Algorithmus for the Lattice Boltzmann Method on Massively Parallel Hardware journal March 2017
Lattice Boltzmann simulation optimization on leading multicore platforms
  • Williams, Samuel; Carter, Jonathan; Oliker, Leonid
  • Distributed Processing Symposium (IPDPS), 2008 IEEE International Symposium on Parallel and Distributed Processing https://doi.org/10.1109/IPDPS.2008.4536295
conference April 2008
An efficient swap algorithm for the lattice Boltzmann method journal February 2007
Lattice Boltzmann method with regularized pre-collision distribution functions journal September 2006
A Model for Collision Processes in Gases. I. Small Amplitude Processes in Charged and Neutral One-Component Systems journal May 1954
On the single processor performance of simple lattice Boltzmann kernels journal September 2006
High-Reynolds-number turbulent cavity flow using the lattice Boltzmann method journal October 2018
Multiscale modeling of fluid transport in heterogeneous materials using discrete Boltzmann methods journal December 2002
A framework for hybrid parallel flow simulations with a trillion cells in complex geometries
  • Godenschwager, Christian; Schornbaum, Florian; Bauer, Martin
  • Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '13 https://doi.org/10.1145/2503210.2503273
conference January 2013
Roofline: An Insightful Visual Performance Model for Floating-Point Programs and Multicore Architectures report September 2009
Efficiently using memory in lattice Boltzmann simulations journal August 2004
Introducing a parallel cache oblivious blocking approach for the lattice Boltzmann method journal January 2008
GPU Data Access on Complex Geometries for D3Q19 Lattice Boltzmann Method conference May 2018
Designing a Parallel Memory-Aware Lattice Boltzmann Algorithm on Manycore Systems conference September 2018
Straight velocity boundaries in the lattice Boltzmann method journal May 2008
Noninvasive Derivation of Fractional Flow Reserve From Coronary Computed Tomographic Angiography journal March 2018
Lattice Boltzmann Method for Fluid Flows journal January 1998
Accelerating the Parallelization of Lattice Boltzmann Method by Exploiting the Temporal Locality
  • Liu, Song; Zou, Nianjun; Cui, Yuanzhen
  • 2017 IEEE International Symposium on Parallel and Distributed Processing with Applications and 2017 IEEE International Conference on Ubiquitous Computing and Communications (ISPA/IUCC) https://doi.org/10.1109/ISPA/IUCC.2017.00178
conference December 2017
Comparison of different propagation steps for lattice Boltzmann methods journal March 2013
Towards Exascale Lattice Boltzmann computing journal March 2019
Efficient Temporal Blocking for Stencil Computations by Multicore-Aware Wavefront Parallelization conference July 2009
On pressure and velocity boundary conditions for the lattice Boltzmann BGK model journal June 1997
Accelerating Lattice Boltzmann Fluid Flow Simulations Using Graphics Processors conference September 2009
Massively parallel models of the human circulatory system
  • Randles, Amanda; Draeger, Erik W.; Oppelstrup, Tomas
  • SC15: The International Conference for High Performance Computing, Networking, Storage and Analysis, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis https://doi.org/10.1145/2807591.2807676
conference November 2015
A parallel workload balanced and memory efficient lattice-Boltzmann algorithm with single unit BGK relaxation time for laminar Newtonian flows journal September 2010
Performance Analysis of the Lattice Boltzmann Model Beyond Navier-Stokes
  • Randles, Amanda Peters; Kale, Vivek; Hammond, Jeff
  • 2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on Parallel and Distributed Processing https://doi.org/10.1109/IPDPS.2013.109
conference May 2013
Petaflop biofluidics simulations on a two million-core system
  • Bernaschi, Massimo; Bisson, Mauro; Endo, Toshio
  • Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on - SC '11 https://doi.org/10.1145/2063384.2063389
conference January 2011
A scalable interface-resolved simulation of particle-laden flow using the lattice Boltzmann method journal September 2017
Performance Evaluation of Parallel Large-Scale Lattice Boltzmann Applications on Three Supercomputing Architectures conference January 2004

Similar Records

Propagation Pattern for Moment Representation of the Lattice Boltzmann Method
Journal Article · Tue Jul 20 20:00:00 EDT 2021 · IEEE Transactions on Parallel and Distributed Systems · OSTI ID:1814345

Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs
Conference · Wed Nov 01 00:00:00 EDT 2023 · OSTI ID:2224172

Related Subjects