DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A nearsighted force-training approach to systematically generate training data for the machine learning of large atomic structures

Journal Article · · Journal of Chemical Physics
DOI: https://doi.org/10.1063/5.0079314 · OSTI ID:1979022

A challenge of atomistic machine-learning (ML) methods is ensuring that the training data are suitable for the system being simulated, which is particularly challenging for systems with large numbers of atoms. Most atomistic ML approaches rely on the nearsightedness principle (“all chemistry is local”), using information about the position of an atom’s neighbors to predict a per-atom energy. Here, in this work, we develop a framework that exploits the nearsighted nature of ML models to systematically produce an appropriate training set for large structures. We use a per-atom uncertainty estimate to identify the most uncertain atoms and extract chunks centered around these atoms. It is crucial that these small chunks are both large enough to satisfy the ML’s nearsighted principle (that is, filling the cutoff radius) and are large enough to be converged with respect to the electronic structure calculation. We present data indicating when the electronic structure calculations are converged with respect to the structure size, which fundamentally limits the accuracy of any nearsighted ML calculator. These new atomic chunks are calculated in electronic structures, and crucially, only a single force—that of the central atom—is added to the growing training set, preventing the noisy and irrelevant information from the piece’s boundary from interfering with ML training. The resulting ML potentials are robust, despite requiring single-point calculations on only small reference structures and never seeing large training structures. We demonstrated our approach via structure optimization of a 260-atom structure and extended the approach to clusters with up to 1415 atoms.

Research Organization:
Brown Univ., Providence, RI (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
SC0019441
OSTI ID:
1979022
Journal Information:
Journal of Chemical Physics, Journal Name: Journal of Chemical Physics Journal Issue: 6 Vol. 156; ISSN 0021-9606
Publisher:
American Institute of Physics (AIP)Copyright Statement
Country of Publication:
United States
Language:
English

References (48)

Machine learning for quantum mechanics in a nutshell journal July 2015
Accelerating high-throughput searches for new alloys with active learning of interatomic potentials journal January 2019
Amp: A modular approach to machine learning in atomistic simulations journal October 2016
DScribe: Library of descriptors for machine learning in materials science journal February 2020
Four Generations of High-Dimensional Neural Network Potentials journal March 2021
Interacting Quantum Atoms:  A Correlated Energy Decomposition Scheme Based on the Quantum Theory of Atoms in Molecules journal November 2005
Long-Range Electron Transfer over Graphene-Based Catalyst for High-Performing Oxygen Reduction Reactions: Importance of Size, N-doping, and Metallic Impurities journal June 2014
Investigation of Catalytic Finite-Size-Effects of Platinum Metal Clusters journal December 2012
Quantum-chemical insights from deep tensor neural networks journal January 2017
A fourth-generation high-dimensional neural network potential with accurate electrostatics including non-local charge transfer journal January 2021
De novo exploration and self-guided learning of potential-energy surfaces journal October 2019
Evidence for supercritical behaviour of high-pressure liquid hydrogen journal September 2020
Addressing uncertainty in atomistic machine learning journal January 2017
Atom-centered symmetry functions for constructing high-dimensional neural network potentials journal February 2011
A neural network potential-energy surface for the water dimer based on environment-dependent atomic energies and charges journal February 2012
Accuracy of buffered-force QM/MM simulations of silica journal February 2015
Acceleration of saddle-point searches with machine learning journal August 2016
Machine learning of molecular properties: Locality and active learning journal June 2018
SchNet – A deep learning architecture for molecules and materials journal June 2018
Comparison of permutationally invariant polynomials, neural networks, and Gaussian approximation potentials in representing water interactions through many-body expansions journal June 2018
Automatic selection of atomic fingerprints and reference configurations for machine-learning potentials journal June 2018
Machine learning for interatomic potential models journal February 2020
Nearsightedness of electronic matter journal August 2005
CUR matrix decompositions for improved data analysis journal January 2009
Hybrid atomistic simulation methods for materials systems journal January 2009
\mathcal{O}(N) methods in electronic structure calculations journal February 2012
Multiscale hybrid simulation methods for material systems journal June 2005
Electronic structure calculations with GPAW: a real-space implementation of the projector augmented-wave method journal June 2010
Ab initio random structure searching journal January 2011
Modelling defects in Ni–Al with EAM and DFT calculations journal April 2016
Enabling robust offline active learning for machine learning potentials using simple physics-based priors journal January 2021
Effective-medium theory of chemical binding: Application to chemisorption journal March 1980
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Self-Consistent Equations Including Exchange and Correlation Effects journal November 1965
On-the-fly machine learning force field generation: Application to melting points journal July 2019
Multiscale simulations in simple metals: A density-functional-based methodology journal March 2005
High-dimensional neural network potentials for metal surfaces: A prototype study for copper journal January 2012
On representing chemical environments journal May 2013
Learning scheme to predict atomic forces and accelerate materials simulations journal September 2015
Accurate interatomic force fields via machine learning with covariant kernels journal June 2017
Neural-network-enhanced evolutionary algorithm applied to supported metal nanoparticles journal May 2018
Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons journal April 2010
Data-Driven Learning of Total and Local Energies in Elemental Boron journal April 2018
Density Functional and Density Matrix Method Scaling Linearly with the Number of Atoms journal April 1996
Charge-Density Patching Method for Unconventional Semiconductor Binary Systems journal June 2002
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007
Active learning of uniformly accurate interatomic potentials for materials simulation journal February 2019
An efficient MPI/OpenMP parallelization of the Hartree–Fock–Roothaan method for the first generation of Intel® Xeon Phi™ processor architecture journal February 2017