DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: FINETUNA: fine-tuning accelerated molecular simulations

Journal Article · · Machine Learning: Science and Technology

Abstract Progress towards the energy breakthroughs needed to combat climate change can be significantly accelerated through the efficient simulation of atomistic systems. However, simulation techniques based on first principles, such as density functional theory (DFT), are limited in their practical use due to their high computational expense. Machine learning approaches have the potential to approximate DFT in a computationally efficient manner, which could dramatically increase the impact of computational simulations on real-world problems. However, they are limited by their accuracy and the cost of generating labeled data. Here, we present an online active learning framework for accelerating the simulation of atomic systems efficiently and accurately by incorporating prior physical information learned by large-scale pre-trained graph neural network models from the Open Catalyst Project. Accelerating these simulations enables useful data to be generated more cheaply, allowing better models to be trained and more atomistic systems to be screened. We also present a method of comparing local optimization techniques on the basis of both their speed and accuracy. Experiments on 30 benchmark adsorbate-catalyst systems show that our method of transfer learning to incorporate prior information from pre-trained models accelerates simulations by reducing the number of DFT calculations by 91%, while meeting an accuracy threshold of 0.02 eV 93% of the time. Finally, we demonstrate a technique for leveraging the interactive functionality built in to Vienna ab initio Simulation Package (VASP) to efficiently compute single point calculations within our online active learning framework without the significant startup costs. This allows VASP to work in tandem with our framework while requiring 75% fewer self-consistent cycles than conventional single point calculations. The online active learning implementation, and examples using the VASP interactive code, are available in the open source FINETUNA package on Github.

Research Organization:
Brown University, Providence, RI (United States)
Sponsoring Organization:
USDOE; USDOE Office of Energy Efficiency and Renewable Energy (EERE); USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
SC0019441
OSTI ID:
1888053
Journal Information:
Machine Learning: Science and Technology, Journal Name: Machine Learning: Science and Technology Journal Issue: 3 Vol. 3; ISSN 2632-2153
Publisher:
IOP PublishingCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (34)

Machine learning for heterogeneous catalyst design and discovery journal May 2018
Recent Advances in Heterogeneous Catalysis for Ammonia Synthesis journal September 2020
Ab initio molecular dynamics for liquid metals journal December 1995
Efficiency of ab-initio total energy calculations for metals and semiconductors using a plane-wave basis set journal July 1996
The Brønsted–Evans–Polanyi relation and the volcano curve in heterogeneous catalysis journal May 2004
Heterogeneous Catalysis: A Central Science for a Sustainable Future journal March 2017
Enabling Catalyst Discovery through Machine Learning and High-Throughput Experimentation journal November 2019
TorchANI: A Free and Open Source PyTorch-Based Deep Learning Implementation of the ANI Neural Network Potentials journal June 2020
Extending the Applicability of the ANI Deep Learning Molecular Potential to Sulfur and Halogens journal June 2020
Advances in the Design of Heterogeneous Catalysts and Thermocatalytic Processes for CO 2 Utilization journal November 2020
Open Catalyst 2020 (OC20) Dataset and Community Challenges journal May 2021
Homogeneous, Heterogeneous, and Biological Catalysts for Electrochemical N 2 Reduction toward NH 3 under Ambient Conditions journal April 2019
Heterogeneous Catalytic Reactor for Hydrogen Production from Formic Acid and Its Use in Polymer Electrolyte Fuel Cells journal March 2018
Titanium-Based Hydrides as Heterogeneous Catalysts for Ammonia Synthesis journal December 2017
Density Functional Theory of Electronic Structure journal January 1996
CO2 hydrogenation to high-value products via heterogeneous catalysis journal December 2019
Active learning in materials science with emphasis on adaptive sampling using uncertainties for targeted design journal February 2019
On-the-fly active learning of interpretable Bayesian force fields for atomistic rare events journal March 2020
Accelerated discovery of CO2 electrocatalysts using active machine learning journal May 2020
The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules journal May 2020
Catalysts for nitrogen reduction to ammonia journal July 2018
Active learning across intermetallics to guide discovery of electrocatalysts for CO2 reduction and H2 evolution journal September 2018
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost journal January 2017
Active learning and neural network potentials accelerate molecular screening of ether-based solvate ionic liquids journal January 2020
High-throughput experimentation meets artificial intelligence: a new pathway to catalyst discovery journal January 2020
Acceleration of saddle-point searches with machine learning journal August 2016
Machine-learning accelerated geometry optimization in molecular simulation journal June 2021
The atomic simulation environment—a Python library for working with atoms journal June 2017
Enabling robust offline active learning for machine learning potentials using simple physics-based priors journal January 2021
Local Bayesian optimizer for atomic structures journal September 2019
Ab initio molecular-dynamics simulation of the liquid-metal–amorphous-semiconductor transition in germanium journal May 1994
Efficient iterative schemes for ab initio total-energy calculations using a plane-wave basis set journal October 1996
Low-Scaling Algorithm for Nudged Elastic Band Calculations Using a Surrogate Machine Learning Model journal April 2019
Generalized Neural-Network Representation of High-Dimensional Potential-Energy Surfaces journal April 2007