skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations

Journal Article · · MRS Communications
DOI:https://doi.org/10.1557/mrc.2019.107· OSTI ID:1578167
ORCiD logo [1]; ORCiD logo [1]; ORCiD logo [1];  [2];  [3];  [2]
  1. Argonne National Lab. (ANL), Lemont, IL (United States); Univ. of Chicago, Chicago, IL (United States)
  2. Argonne National Lab. (ANL), Lemont, IL (United States)
  3. Argonne National Lab. (ANL), Lemont, IL (United States); Univ. of Louisville, Louisville, KY (United States)

Recent studies illustrate how machine learning (ML) can be used to bypass a core challenge of molecular modeling: the trade-off between accuracy and computational cost. Here, we assess multiple ML approaches for predicting the atomization energy of organic molecules. Our resulting models learn the difference between low-fidelity, B3LYP, and high-accuracy, G4MP2, atomization energies and predict the G4MP2 atomization energy to 0.005 eV (mean absolute error) for molecules with less than nine heavy atoms (training set of 117,232 entries, test set 13,026) and 0.012 eV for a small set of 66 molecules with between 10 and 14 heavy atoms. As a result, our two best models, which have different accuracy/speed trade-offs, enable the efficient prediction of G4MP2-level energies for large molecules and are available through a simple web interface.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES); USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1578167
Journal Information:
MRS Communications, Vol. 9, Issue 3; ISSN 2159-6859
Publisher:
Materials Research Society - Cambridge University PressCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 26 works
Citation information provided by
Web of Science

References (35)

Boosting Quantum Machine Learning Models with a Multilevel Combination Technique: Pople Diagrams Revisited journal December 2018
The Materials Data Facility: Data Services to Advance Materials Science Research journal July 2016
Molecular graph convolutions: moving beyond fingerprints journal August 2016
MoleculeNet: a benchmark for molecular machine learning journal January 2018
A new mixing of Hartree–Fock and local density‐functional theories journal January 1993
XSEDE: Accelerating Scientific Discovery journal September 2014
SchNet – A deep learning architecture for molecules and materials journal June 2018
The TensorMol-0.1 model chemistry: a neural network augmented with long-range physics journal January 2018
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach journal April 2015
Gaussian-4 theory using reduced order perturbation theory journal September 2007
Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals journal April 2017
Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 journal April 1996
Quantum chemistry structures and properties of 134 kilo molecules journal August 2014
G n theory : G
  • Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
  • Wiley Interdisciplinary Reviews: Computational Molecular Science, Vol. 1, Issue 5 https://doi.org/10.1002/wcms.59
journal June 2011
Next generation interatomic potentials for condensed systems journal July 2014
Multi-fidelity machine learning models for accurate bandgap predictions of solids journal March 2017
Open Babel: An open chemical toolbox journal October 2011
Atomistic calculations and materials informatics: A review journal June 2017
Machine learning for quantum mechanics in a nutshell journal July 2015
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single- and binary-component solids journal February 2014
Alchemical and structural distribution based representation for universal quantum machine learning journal June 2018
Accurate quantum chemical energies for 133 000 organic molecules journal January 2019
SchNetPack: A Deep Learning Toolbox For Atomistic Systems journal November 2018
Genetic Optimization of Training Sets for Improved Machine Learning Models of Molecular Properties journal March 2017
Predicting molecular properties with covariant compositional networks journal June 2018
A graph-convolutional neural network model for the prediction of chemical reactivity journal January 2019
Genetic Optimization of Training Sets for Improved Machine Learning Models of Molecular Properties text January 2017
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach text January 2015
Quantum chemistry structures and properties of 134 kilo molecules text January 2014
Machine learning with systematic density-functional theory calculations: Application to melting temperatures of single and binary component solids text January 2013
MoleculeNet: A Benchmark for Molecular Machine Learning preprint January 2017
The TensorMol-0.1 Model Chemistry: a Neural Network Augmented with Long-Range Physics preprint January 2017
SchNet - a deep learning architecture for molecules and materials text January 2017
Generating equilibrium molecules with deep neural networks preprint January 2018
Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals [Supplemental Data] fileset June 2017

Cited By (1)

A data ecosystem to support machine learning in materials science journal October 2019

Similar Records

Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning
Journal Article · Wed Jul 05 00:00:00 EDT 2023 · Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory · OSTI ID:1578167

Accurate quantum chemical energies for 133 000 organic molecules
Journal Article · Wed Aug 07 00:00:00 EDT 2019 · Chemical Science · OSTI ID:1578167

Quantum-Chemically Informed Machine Learning: Prediction of Energies of Organic Molecules with 10 to 14 Non-hydrogen Atoms
Journal Article · Mon Jun 15 00:00:00 EDT 2020 · Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory · OSTI ID:1578167