Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Robust training of machine learning interatomic potentials with dimensionality reduction and stratified sampling

Journal Article · · npj Computational Materials
Abstract

Machine learning interatomic potentials (MLIPs) enable accurate simulations of materials at scales beyond that accessible by ab initio methods and play an increasingly important role in the study and design of materials. However, MLIPs are only as accurate and robust as the data on which they are trained. Here, we present DImensionality-Reduced Encoded Clusters with sTratified (DIRECT) sampling as an approach to select a robust training set of structures from a large and complex configuration space. By applying DIRECT sampling on the Materials Project relaxation trajectories dataset with over one million structures and 89 elements, we develop an improved materials 3-body graph network (M3GNet) universal potential that extrapolates more reliably to unseen structures. We further show that molecular dynamics (MD) simulations with the M3GNet universal potential can be used instead of expensive ab initio MD to rapidly create a large configuration space for target systems. We combined this scheme with DIRECT sampling to develop a reliable moment tensor potential for titanium hydrides without the need for iterative augmentation of training structures. This work paves the way for robust high-throughput development of MLIPs across any compositional complexity.

Sponsoring Organization:
USDOE
OSTI ID:
2311933
Journal Information:
npj Computational Materials, Journal Name: npj Computational Materials Journal Issue: 1 Vol. 10; ISSN 2057-3960
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (49)

Frustration in Super‐Ionic Conductors Unraveled by the Density of Atomistic States journal February 2023
The H−Ti (Hydrogen-Titanium) system journal February 1987
Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set journal July 1996
Hydrogen and deuterium diffusion in titanium dihydrides/dideuterides journal August 1997
Perspectives on Titanium Science and Technology journal February 2013
Active learning of linearly parametrized interatomic potentials journal December 2017
Accelerating high-throughput searches for new alloys with active learning of interatomic potentials journal January 2019
Hydriding of titanium: Recent trends and perspectives in advanced characterization and multiscale modeling journal December 2022
DeePMD-kit: A deep learning package for many-body potential energy representation and molecular dynamics journal July 2018
LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales journal February 2022
Spectral neighbor analysis method for automated generation of quantum-accurate interatomic potentials journal March 2015
Hydrogen diffusion in plutonium hydrides from first principles journal December 2021
Compositionally complex perovskite oxides: Discovering a new class of solid electrolytes with interface-enabled conductivity improvements journal July 2023
Bridging the gap between simulated and experimental ionic conductivities in lithium superionic conductors journal November 2021
Graph Networks as a Universal Machine Learning Framework for Molecules and Crystals journal April 2019
Machine Learning Force Fields journal March 2021
Performance and Cost Assessment of Machine Learning Interatomic Potentials journal October 2019
Constructing High-Dimensional Neural Network Potential Energy Surfaces for Gas–Surface Scattering and Reactions journal January 2018
Thermodynamics and Kinetics of the Cathode–Electrolyte Interface in All-Solid-State Li–S Batteries journal September 2022
E(3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials journal May 2022
Machine-learned interatomic potentials by active learning: amorphous and liquid hafnium dioxide journal July 2020
On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events journal March 2020
Complex strengthening mechanisms in the NbMoTaW multi-principal element alloy journal June 2020
AtomSets as a hierarchical transfer learning framework for small and large materials datasets journal October 2021
Training data selection for accuracy and transferability of interatomic potentials journal September 2022
Hyperactive learning for data-driven interatomic potentials journal September 2023
Origins of structural and electronic transitions in disordered silicon journal January 2021
CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling journal September 2023
Learning properties of ordered and disordered materials from multi-fidelity data journal January 2021
A universal graph deep learning interatomic potential for the periodic table journal November 2022
Uncertainty-driven dynamics for active learning of interatomic potentials journal March 2023
Diffusion of hydrogen in titanium, Ti88Al12 and Ti3Al journal January 1996
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
An analysis of hydrated proton diffusion in ab initio molecular dynamics journal January 2015
Reactive atomistic simulations of Diels-Alder reactions: The importance of molecular rotations journal September 2019
An entropy-maximization approach to automated training set generation for interatomic potentials journal September 2020
Optimal data generation for machine learned interatomic potentials journal December 2022
Hydrogen diffusion in bccTiHxandTi1−yVyHx journal April 1988
Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set journal October 1996
High-dimensional neural network potentials for metal surfaces: A prototype study for copper journal January 2012
Lattice dynamics and electron-phonon coupling calculations using nondiagonal supercells journal November 2015
Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons journal April 2010
Generalized Gradient Approximation Made Simple journal October 1996
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007
Active learning of uniformly accurate interatomic potentials for materials simulation journal February 2019
Moment Tensor Potentials: A Class of Systematically Improvable Interatomic Potentials journal January 2016
BIRCH: an efficient data clustering method for very large databases journal June 1996
Solubility and Diffusion of Hydrogen in Pure Metals and Alloys journal January 2001
Mpf.2021.2.8 dataset January 2022

Similar Records

Benchmarking structural evolution methods for training of machine learned interatomic potentials
Journal Article · Thu Jul 21 20:00:00 EDT 2022 · Journal of Physics. Condensed Matter · OSTI ID:1982442

Systematic softening in universal machine learning interatomic potentials
Journal Article · Thu Jan 09 19:00:00 EST 2025 · npj Computational Materials · OSTI ID:2528028

Related Subjects