skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Optimizing the performance of reactive molecular dynamics simulations for many-core architectures

Abstract

Reactive molecular dynamics simulations are computationally demanding. Reaching spatial and temporal scales where interesting scientific phenomena can be observed requires efficient and scalable implementations on modern hardware. In this article, we focus on optimizing the performance of the widely used LAMMPS/ReaxC package for many-core architectures. As hybrid parallelism allows better leverage of the increasing on-node parallelism, we adopt thread parallelism in the construction of bonded and nonbonded lists and in the computation of complex ReaxFF interactions. To mitigate the I/O overheads due to large volumes of trajectory data produced and to save users the burden of post-processing, we also develop a novel in situ tool for molecular species analysis. We analyze the performance of the resulting ReaxC-OMP package on two different architectures: (i) Mira, an IBM Blue Gene/Q system and (ii) Cori-II, a Cray XC-40 sytem with Knights Landing processors. For Pentaerythritol tetranitrate (PETN) systems of sizes ranging from 32 thousand to 16.6 million particles, we observe speedups in the range of 1.5-4.5x. We observe sustained performance improvements for up to 262,144 cores (1,048,576 processes) of Mira and a weak scaling efficiency of 91.5% in large simulations containing 16.6 million particles. The in situ molecular species analysis tool incurs onlymore » insignificant overheads across various system sizes and runs configurations.« less

Authors:
; ; ; ; ;
Publication Date:
Research Org.:
Argonne National Lab. (ANL), Argonne, IL (United States)
Sponsoring Org.:
Argonne National Laboratory - Argonne Leadership Computing Facility
OSTI Identifier:
1530430
DOE Contract Number:  
AC02-06CH11357
Resource Type:
Journal Article
Journal Name:
International Journal of High Performance Computing Applications
Additional Journal Information:
Journal Volume: 33; Journal Issue: 2
Country of Publication:
United States
Language:
English
Subject:
Hybrid parallelism; LAMMPS-ReaxC-OMP; Multi-core architectures; Reactive Molecular Dynamics; ReaxFF

Citation Formats

Aktulga, Hasan Metin, Knight, Chris, Coffman, Paul, O’Hearn, Kurt A, Shan, Tzu-ray, and Jiang, Wei. Optimizing the performance of reactive molecular dynamics simulations for many-core architectures. United States: N. p., 2019. Web. doi:10.1177/1094342017746221.
Aktulga, Hasan Metin, Knight, Chris, Coffman, Paul, O’Hearn, Kurt A, Shan, Tzu-ray, & Jiang, Wei. Optimizing the performance of reactive molecular dynamics simulations for many-core architectures. United States. doi:10.1177/1094342017746221.
Aktulga, Hasan Metin, Knight, Chris, Coffman, Paul, O’Hearn, Kurt A, Shan, Tzu-ray, and Jiang, Wei. Fri . "Optimizing the performance of reactive molecular dynamics simulations for many-core architectures". United States. doi:10.1177/1094342017746221.
@article{osti_1530430,
title = {Optimizing the performance of reactive molecular dynamics simulations for many-core architectures},
author = {Aktulga, Hasan Metin and Knight, Chris and Coffman, Paul and O’Hearn, Kurt A and Shan, Tzu-ray and Jiang, Wei},
abstractNote = {Reactive molecular dynamics simulations are computationally demanding. Reaching spatial and temporal scales where interesting scientific phenomena can be observed requires efficient and scalable implementations on modern hardware. In this article, we focus on optimizing the performance of the widely used LAMMPS/ReaxC package for many-core architectures. As hybrid parallelism allows better leverage of the increasing on-node parallelism, we adopt thread parallelism in the construction of bonded and nonbonded lists and in the computation of complex ReaxFF interactions. To mitigate the I/O overheads due to large volumes of trajectory data produced and to save users the burden of post-processing, we also develop a novel in situ tool for molecular species analysis. We analyze the performance of the resulting ReaxC-OMP package on two different architectures: (i) Mira, an IBM Blue Gene/Q system and (ii) Cori-II, a Cray XC-40 sytem with Knights Landing processors. For Pentaerythritol tetranitrate (PETN) systems of sizes ranging from 32 thousand to 16.6 million particles, we observe speedups in the range of 1.5-4.5x. We observe sustained performance improvements for up to 262,144 cores (1,048,576 processes) of Mira and a weak scaling efficiency of 91.5% in large simulations containing 16.6 million particles. The in situ molecular species analysis tool incurs only insignificant overheads across various system sizes and runs configurations.},
doi = {10.1177/1094342017746221},
journal = {International Journal of High Performance Computing Applications},
number = 2,
volume = 33,
place = {United States},
year = {2019},
month = {3}
}