skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Deep learning of dynamically responsive chemical Hamiltonians with semiempirical quantum mechanics

Journal Article · · Proceedings of the National Academy of Sciences of the United States of America
ORCiD logo [1];  [2];  [3]; ORCiD logo [4];  [5]
  1. Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, Center of Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545
  2. Computer, Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM 87545
  3. Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545
  4. Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, Center of Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, NM 87545
  5. Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM 87545, Center for Integrated Nanotechnologies, Los Alamos National Laboratory, Los Alamos, NM 87545

Conventional machine-learning (ML) models in computational chemistry learn to directly predict molecular properties using quantum chemistry only for reference data. While these heuristic ML methods show quantum-level accuracy with speeds several orders of magnitude faster than traditional quantum chemistry methods, they suffer from poor extensibility and transferability; i.e., their accuracy degrades on large or new chemical systems. Incorporating quantum chemistry frameworks into the ML models directly solves this problem. Here we take the structure of semiempirical quantum mechanics (SEQM) methods to construct dynamically responsive Hamiltonians. SEQM methods use empirical parameters fitted to experimental properties to construct reduced-order Hamiltonians, facilitating much faster calculations than ab initio methods but with compromised accuracy. By replacing these static parameters with machine-learned dynamic values inferred from the local environment, we greatly improve the accuracy of the SEQM methods. Trained on molecular energies and atomic forces, these dynamically generated Hamiltonian parameters show a strong correlation with atomic hybridization and bonding. Trained with only about 60,000 small organic molecular conformers, the resulting model retains interpretability, extensibility, and transferability when testing on much larger chemical systems and predicting various molecular properties. Overall, this work demonstrates the virtues of incorporating physics-based descriptions with ML to develop models that are simultaneously accurate, transferable, and interpretable.

Research Organization:
Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES). Chemical Sciences, Geosciences & Biosciences Division; USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE National Nuclear Security Administration (NNSA)
Grant/Contract Number:
LDRD; 89233218CNA000001; FWP: LANLE3F2
OSTI ID:
1874630
Alternate ID(s):
OSTI ID: 1875159; OSTI ID: 1903543
Report Number(s):
LA-UR-22-31577; e2120333119
Journal Information:
Proceedings of the National Academy of Sciences of the United States of America, Journal Name: Proceedings of the National Academy of Sciences of the United States of America Vol. 119 Journal Issue: 27; ISSN 0027-8424
Publisher:
Proceedings of the National Academy of SciencesCopyright Statement
Country of Publication:
United States
Language:
English

References (54)

970 Million Druglike Small Molecules for Virtual Screening in the Chemical Universe Database GDB-13 journal July 2009
Predicting phosphorescence energies and inferring wavefunction localization with machine learning journal January 2021
The ORCA program system: The ORCA program system journal June 2011
Electronic Structure book September 2020
Semiempirical Quantum-Chemical Orthogonalization-Corrected Methods: Theory, Implementation, and Parameters journal January 2016
A full coupled‐cluster singles and doubles model: The inclusion of disconnected triples journal February 1982
Machine Learning for Electronically Excited States of Molecules journal November 2020
Virtual Exploration of the Small-Molecule Chemical Universe below 160 Daltons journal February 2005
Advanced Corrections of Hydrogen Bonding and Dispersion for Semiempirical Quantum Mechanical Methods journal December 2011
Less is more: Sampling chemical space with active learning journal June 2018
Machine learning of molecular electronic properties in chemical compound space journal September 2013
TorchANI: A Free and Open Source PyTorch-Based Deep Learning Implementation of the ANI Neural Network Potentials journal June 2020
A Density Functional Tight Binding Layer for Deep Learning of Chemical Hamiltonians journal October 2018
SchNet – A deep learning architecture for molecules and materials journal June 2018
Nonadiabatic Excited-State Dynamics with Machine Learning journal September 2018
Virtual Exploration of the Chemical Universe up to 11 Atoms of C, N, O, F:  Assembly of 26.4 Million Structures (110.9 Million Stereoisomers) and Analysis for New Ring Systems, Stereochemistry, Physicochemical Properties, Compound Classes, and Drug Discovery journal January 2007
Inclusion of Machine Learning Kernel Ridge Regression Potential Energy Surfaces in On-the-Fly Nonadiabatic Molecular Dynamics Simulation journal May 2018
OrbNet: Deep learning for quantum chemistry using symmetry-adapted atomic-orbital features journal September 2020
SchNetPack: A Deep Learning Toolbox For Atomistic Systems journal November 2018
Development and use of quantum mechanical molecular models. 76. AM1: a new general purpose quantum mechanical molecular model journal June 1985
Nobel Lecture: Electronic structure of matter—wave functions and density functionals journal October 1999
PhysNet: A Neural Network for Predicting Energies, Forces, Dipole Moments, and Partial Charges journal April 2019
Coupled-cluster theory in quantum chemistry journal February 2007
Machine Learning of Partial Charges Derived from High-Quality Quantum-Mechanical Calculations journal February 2018
Hierarchical modeling of molecular energies using a deep neural network journal June 2018
A semiempirical model for the two-center repulsion integrals in the NDDO approximation journal January 1977
Self-Consistent Equations Including Exchange and Correlation Effects journal November 1965
Discovering a Transferable Charge Assignment Model Using Machine Learning journal July 2018
SciPy 1.0: fundamental algorithms for scientific computing in Python journal February 2020
GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions journal January 2019
Optimization of parameters for semiempirical methods II. Applications journal March 1989
Optimization of parameters for semiempirical methods I. Method journal March 1989
Automated discovery of a robust interatomic potential for aluminum journal February 2021
S66: A Well-balanced Database of Benchmark Interaction Energies Relevant to Biomolecular Structures journal July 2011
Learning excited states from ground states by using an artificial neural network journal June 2020
Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters journal November 2012
Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning journal July 2019
Transferable Dynamic Molecular Charge Assignment Using Deep Neural Networks journal July 2018
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach journal April 2015
Machine-Learning-Assisted Accurate Band Gap Predictions of Functionalized MXene journal May 2018
Reparametrisation of Force Constants in MOPAC 6.0/7.0 for Better Description of the Activation Barrier of Peptide Bond Rotations journal September 1996
The ANI-1ccx and ANI-1x data sets, coupled-cluster and density functional theory properties for molecules journal May 2020
Multi-fidelity machine learning models for accurate bandgap predictions of solids journal March 2017
Machine learned Hückel theory: Interfacing physics and deep neural networks journal June 2021
Quantum Chemical Models (Nobel Lecture) journal July 1999
Some difficulties encountered with AM1 and PM3 calculations journal October 1998
Semiempirical Quantum-Chemical Methods with Orthogonalization and Dispersion Corrections journal January 2019
Electronic spectra from TDDFT and machine learning in chemical space journal August 2015
Optimization of parameters for semiempirical methods V: Modification of NDDO approximations and application to 70 elements journal September 2007
A Survey on Transfer Learning journal October 2010
Ground states of molecules. 38. The MNDO method. Approximations and parameters journal June 1977
DrugBank 4.0: shedding new light on drug metabolism journal November 2013
NEXMD Software Package for Nonadiabatic Excited State Molecular Dynamics Simulations journal July 2020
Graphics Processing Unit-Accelerated Semiempirical Born Oppenheimer Molecular Dynamics Using PyTorch journal July 2020