skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach

Journal Article · · Journal of Chemical Theory and Computation
 [1];  [2];  [1];  [3]
  1. University of Basel (Switzerland)
  2. Max-Planck-Institut für Kohlenforschung, Mülheim an der Ruhr (Germany); Friedrich-Alexander University Erlangen-Nuremberg, Bamberg (Germany)
  3. University of Basel (Switzerland); Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne Leadership Computing Facility (ALCF)

Chemically accurate and comprehensive studies of the virtual space of all possible molecules are severely limited by the computational cost of quantum chemistry. Here we introduce a composite strategy that adds machine learning corrections to computationally inexpensive approximate legacy quantum methods. After training, highly accurate predictions of enthalpies, free energies, entropies, and electron correlation energies are possible, for significantly larger molecular sets than used for training. For thermochemical properties of up to 16k isomers of C7H10O2 we present numerical evidence that chemical accuracy can be reached. We also predict electron correlation energy in post Hartree-Fock methods, at the computational cost of HartreeFock, and we establish a qualitative relationship between molecular entropy and electron correlation. The transferability of our approach is demonstrated, using semiempirical quantum chemistry and machine learning models trained on 1 and 10% of 134k organic molecules, to reproduce enthalpies of all remaining molecules at density functional theory level of accuracy.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC); Swiss National Science Foundation (SNSF)
Grant/Contract Number:
AC02-06CH11357; PP00P2_138932
OSTI ID:
1392925
Journal Information:
Journal of Chemical Theory and Computation, Vol. 11, Issue 5; ISSN 1549-9618
Publisher:
American Chemical SocietyCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 486 works
Citation information provided by
Web of Science

References (43)

Chemical space journal December 2004
Stochastic Voyages into Uncharted Chemical Space Produce a Representative Library of All Possible Drug-Like Compounds journal May 2013
First principles view on chemical compound space: Gaining rigorous atomistic control of molecular properties journal February 2013
Combinatorial thinking in chemistry and biology journal April 1997
Click Chemistry: Diverse Chemical Function from a Few Good Reactions journal June 2001
Towards the computational design of solid catalysts journal April 2009
The Many Roles of Computation in Drug Discovery journal March 2004
De novodesign: balancing novelty and confined chemical space journal June 2010
The Harvard Clean Energy Project: Large-Scale Computational Screening and Design of Organic Photovoltaics on the World Community Grid journal August 2011
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
Inverse Strategies for Molecular Design journal January 1996
The inverse band-structure problem of finding an atomic configuration with given electronic properties journal November 1999
Variational Particle Number Approach for Rational Compound Design journal October 2005
Designing Molecules by Optimizing Potentials journal March 2006
A comprehensive chemical kinetic combustion model for the four butanol isomers journal June 2012
Combustion and pyrolysis of iso-butanol: Experimental and chemical kinetic modeling study journal October 2013
Chemical Kinetic Data Base for Combustion Chemistry. Part I. Methane and Related Compounds journal July 1986
Ab initio quantum chemistry: Methodology and applications journal May 2005
Gaussian‐1 theory: A general procedure for prediction of molecular energies journal May 1989
Gaussian‐2 theory for molecular energies of first‐ and second‐row compounds journal June 1991
Gaussian-4 theory journal February 2007
Gaussian-4 theory using reduced order perturbation theory journal September 2007
Quantum chemistry structures and properties of 134 kilo molecules journal August 2014
Neural network approach to quantum-chemistry data: Accurate prediction of density functional theory energies journal August 2009
Combined first-principles calculation and neural-network correction approach for heat of formation journal December 2003
First-principles energetics of water clusters and ice: A many-body analysis journal December 2013
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies journal July 2013
Machine learning of molecular electronic properties in chemical compound space journal September 2013
Enumeration of 166 Billion Organic Small Molecules in the Chemical Universe Database GDB-17 journal November 2012
Challenges for Density Functional Theory journal December 2011
Orthogonalization corrections for semiempirical methods journal April 2000
Discovering chemistry with an ab initio nanoreactor journal November 2014
Nearsightedness of electronic matter journal August 2005
Self-consistent-charge density-functional tight-binding method for simulations of complex materials properties journal September 1998
Functional designed to include surface effects in self-consistent density functional theory journal August 2005
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules journal February 1988
Open Babel: An open chemical toolbox journal October 2011
From atoms and bonds to three-dimensional atomic coordinates: automatic model builders journal November 1993
Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters journal November 2012
Ab Initio Calculation of Vibrational Absorption and Circular Dichroism Spectra Using Density Functional Force Fields journal November 1994
Data-mined similarity function between material compositions journal December 2013
Quantum chemistry structures and properties of 134 kilo molecules text January 2014

Cited By (23)

Computational Approach to Molecular Catalysis by 3d Transition Metals: Challenges and Opportunities journal October 2018
Boosting Quantum Machine Learning Models with a Multilevel Combination Technique: Pople Diagrams Revisited journal December 2018
Self-Parametrizing System-Focused Atomistic Models journal January 2020
Regression Clustering for Improved Accuracy and Training Costs with Molecular-Orbital-Based Machine Learning journal October 2019
Breaking the Coupled Cluster Barrier for Machine-Learned Potentials of Large Molecules: The Case of 15-Atom Acetylacetone journal May 2021
Learning a Local-Variable Model of Aromatic and Conjugated Systems journal December 2017
Read between the Molecules: Computational Insights into Organic Semiconductors journal November 2018
Quantum-chemical insights from deep tensor neural networks journal January 2017
Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning journal July 2019
Energy refinement and analysis of structures in the QM9 database via a highly accurate quantum chemical method journal July 2019
Energy-free machine learning force field for aluminum journal August 2017
Deep elastic strain engineering of bandgap through machine learning journal February 2019
A mixed quantum chemistry/machine learning approach for the fast and accurate prediction of biochemical redox potentials and its large-scale application to 315,000 redox reactions journal April 2019
Machine learning unifies the modeling of materials and molecules journal December 2017
Electronic structure at coarse-grained resolutions from supervised machine learning journal March 2019
Application of Computational Biology and Artificial Intelligence Technologies in Cancer Precision Drug Discovery journal November 2019
Deep Learning for Deep Chemistry: Optimizing the Prediction of Chemical Patterns journal November 2019
Modelling Chemical Reasoning to Predict Reactions text January 2016
Steering Orbital Optimization out of Local Minima and Saddle Points Toward Lower Energy text January 2017
A Density Functional Tight Binding Layer for Deep Learning of Chemical Hamiltonians preprint January 2018
Resolution limit of data-driven coarse-grained models spanning chemical space text January 2019
Inverse Design of Potential Singlet Fission Molecules using a Transfer Learning Based Approach preprint January 2020
Basis set convergence and extrapolation of connected triple excitation contributions (T) in computational thermochemistry: the W4-17 benchmark with up to k functions text January 2021

Similar Records

Machine learning of parameters for accurate semiempirical quantum chemical calculations
Journal Article · Tue Apr 14 00:00:00 EDT 2015 · Journal of Chemical Theory and Computation · OSTI ID:1392925

Machine Learning of Parameters for Accurate Semiempirical Quantum Chemical Calculations
Journal Article · Tue May 12 00:00:00 EDT 2015 · Journal of Chemical Theory and Computation · OSTI ID:1392925

Deep learning of dynamically responsive chemical Hamiltonians with semiempirical quantum mechanics
Journal Article · Tue Jul 05 00:00:00 EDT 2022 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:1392925