Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

An OpenMP GPU-offload implementation of a non-equilibrium solidification cellular automata model for additive manufacturing

Journal Article · · Computer Physics Communications
Here, in this paper, performance strategies on GPU-based HPC platforms of a cellular automata (CA) simulation code for non-equilibrium solidification, including nucleation, grain growth, solute partitioning and transport for the metal additive manufacturing (AM) process are investigated using OpenMP 4.5. To accurately report the speed-up for multicore CPUs and GPUs, a rigorous performance analysis employed optimizations appropriate for both CPU-only code (baseline) and GPU offload codes for an isothermal test problem. The performance results on Summit at the Oak Ridge Leadership Computing Facility indicate that using a precomputed list of interface cells significantly decreased the wall-clock time on GPUs. The speedup due to GPU acceleration was evaluated for a full Summit node and measured to be 1.8X when comparing a 6 MPI tasks run with 6 GPUs versus 36 MPI tasks on the CPU only. That speed-up was found to be 7.9X when comparing 6 MPI tasks with 6 GPUs versus the 6 MPI tasks running on the CPU only. Performance measurements showed that system total time is almost constant for runs with more than 96 MPI tasks (or GPUs), indicating that the GPU-accelerated code showed an excellent weak scaling performance. Finally, a rapid directional solidification problem was considered to demonstrate the CA code capability on Summit. It was found that a mesh size of at least 0.05 μm is recommended for the AM-like simulations in order to obtain accurate elongated grain microstructure and elongated subgrain features, which are in qualitative good agreement with experimental data. The results presented in this study indicate that the performance strategies on GPU-based HPC platforms for the CA code are appropriate for novel HPC exascale platforms.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE National Nuclear Security Administration (NNSA); USDOE Office of Science (SC)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1908088
Journal Information:
Computer Physics Communications, Journal Name: Computer Physics Communications Vol. 284; ISSN 0010-4655
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (21)

A parallelized three-dimensional cellular automaton model for grain growth during additive manufacturing journal January 2018
Additive Manufacturing of Nickel Superalloys: Opportunities for Innovation and Challenges Related to Qualification journal June 2018
Fluid Dynamics Effects on Microstructure Prediction in Single-Laser Tracks for Additive Manufacturing of IN625 journal March 2020
A model of solidification microstructures in nickel-based superalloys: predicting primary dendrite spacing selection journal June 2003
A 3D Cellular Automaton algorithm for the prediction of dendritic grain growth journal May 1997
Dendrite growth simulation during solidification in the LENS process journal February 2010
A new mechanism for freckle initiation based on microstructural level simulation journal July 2012
Three-dimensional modeling of the microstructure evolution during metal additive manufacturing journal January 2018
Development of a CPU/GPU portable software library for Lagrangian–Eulerian simulations of liquid sprays journal July 2020
Multi-GPU implementation of a cellular automaton model for dendritic growth of binary alloy journal September 2021
Two-dimensional simulation of grain structure growth within selective laser melted AA-2024 journal January 2017
A cellular automaton finite volume method for microstructure evolution during additive manufacturing journal May 2019
Modeling and simulation of thermal field and solidification in laser powder bed fusion of nickel alloy IN625 journal January 2019
Analysis of OpenMP 4.5 Offloading in Implementations: Correctness and Overhead journal November 2019
OpenMP application experiences: Porting to accelerated nodes journal March 2022
Assessment of offload-based programming environments for hybrid CPU–MIC platforms in numerical modeling of solidification journal September 2018
Dendritic solidification under natural and forced convection in binary alloys: 2D versus 3D simulation journal May 2010
Columnar-to-equiaxed transition in a laser scan for metal additive manufacturing journal May 2020
Pre-exascale accelerated application development: The ORNL Summit experience journal May 2020
Calculation of Thermophysical Properties of Ni-based Superalloys journal January 2006
GPU-Accelerated Cellular Automaton Model for Grain Growth during Directional Solidification of Nickel-Based Superalloy journal February 2021

Similar Records

ExaCA: A performance portable exascale cellular automata application for alloy solidification modeling
Journal Article · Sun Aug 07 20:00:00 EDT 2022 · Computational Materials Science · OSTI ID:1881144

OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems
Conference · Wed Jun 01 00:00:00 EDT 2022 · OSTI ID:1885285

Sum Reduction with OpenMP Offload on NVIDIA Grace-Hopper System
Conference · Thu Oct 31 20:00:00 EDT 2024 · OSTI ID:2483412