DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

This content will become publicly available on Fri Jul 05 00:00:00 EDT 2024

Title: Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning

Abstract

In previous work (Dandu et al., J. Phys. Chem. A, 2022, 126, 4528–4536), we were successful in predicting accurate atomization energies of organic molecules using machine learning (ML) models, obtaining an accuracy as low as 0.1 kcal/mol compared to the G4MP2 method. In this work, we extend the use of these ML models to adiabatic ionization potentials on data sets of energies generated using quantum chemical calculations. Atomic specific corrections that were found to improve atomization energies from quantum chemical calculations have also been used in this study to improve ionization potentials. Here, the quantum chemical calculations were performed on 3405 molecules containing eight or fewer non-hydrogen atoms derived from the QM9 data set, using the B3LYP functional with the 6–31G(2df,p) basis set for optimization. Low-fidelity IPs for these structures were obtained using two density functional methods: B3LYP/6–31+G(2df,p) and ωB97XD/6–311+G(3df,2p). Highly accurate G4MP2 calculations were performed on these optimized structures to obtain high-fidelity IPs to use in ML models based on the low-fidelity IPs. Our best performing ML methods gave IPs of organic molecules within a mean absolute deviation of 0.035 eV from the G4MP2 IPs for the whole data set. This work demonstrates that ML predictions assisted by quantummore » chemical calculations can be used to successfully predict IPs of organic molecules for use in high throughput screening.« less

Authors:
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [2];  [2]; ORCiD logo [2]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States); Univ. of Illinois, Chicago, IL (United States)
  2. Argonne National Laboratory (ANL), Argonne, IL (United States)
Publication Date:
Research Org.:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
OSTI Identifier:
2007518
Grant/Contract Number:  
AC02-06CH11357
Resource Type:
Accepted Manuscript
Journal Name:
Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory
Additional Journal Information:
Journal Volume: 127; Journal Issue: 28; Journal ID: ISSN 1089-5639
Publisher:
American Chemical Society
Country of Publication:
United States
Language:
English
Subject:
37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY; chemical calculations; density functional theory; energy; ionization; molecules

Citation Formats

Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., and Curtiss, Larry A. Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning. United States: N. p., 2023. Web. doi:10.1021/acs.jpca.3c00823.
Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., & Curtiss, Larry A. Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning. United States. https://doi.org/10.1021/acs.jpca.3c00823
Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., and Curtiss, Larry A. Wed . "Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning". United States. https://doi.org/10.1021/acs.jpca.3c00823.
@article{osti_2007518,
title = {Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning},
author = {Dandu, Naveen K. and Ward, Logan and Assary, Rajeev S. and Redfern, Paul C. and Curtiss, Larry A.},
abstractNote = {In previous work (Dandu et al., J. Phys. Chem. A, 2022, 126, 4528–4536), we were successful in predicting accurate atomization energies of organic molecules using machine learning (ML) models, obtaining an accuracy as low as 0.1 kcal/mol compared to the G4MP2 method. In this work, we extend the use of these ML models to adiabatic ionization potentials on data sets of energies generated using quantum chemical calculations. Atomic specific corrections that were found to improve atomization energies from quantum chemical calculations have also been used in this study to improve ionization potentials. Here, the quantum chemical calculations were performed on 3405 molecules containing eight or fewer non-hydrogen atoms derived from the QM9 data set, using the B3LYP functional with the 6–31G(2df,p) basis set for optimization. Low-fidelity IPs for these structures were obtained using two density functional methods: B3LYP/6–31+G(2df,p) and ωB97XD/6–311+G(3df,2p). Highly accurate G4MP2 calculations were performed on these optimized structures to obtain high-fidelity IPs to use in ML models based on the low-fidelity IPs. Our best performing ML methods gave IPs of organic molecules within a mean absolute deviation of 0.035 eV from the G4MP2 IPs for the whole data set. This work demonstrates that ML predictions assisted by quantum chemical calculations can be used to successfully predict IPs of organic molecules for use in high throughput screening.},
doi = {10.1021/acs.jpca.3c00823},
journal = {Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory},
number = 28,
volume = 127,
place = {United States},
year = {Wed Jul 05 00:00:00 EDT 2023},
month = {Wed Jul 05 00:00:00 EDT 2023}
}

Journal Article:
Free Publicly Available Full Text
This content will become publicly available on July 5, 2024
Publisher's Version of Record

Save / Share:

Works referenced in this record:

Gaussian-3 theory using reduced Mo/ller-Plesset order
journal, March 1999

  • Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
  • The Journal of Chemical Physics, Vol. 110, Issue 10
  • DOI: 10.1063/1.478385

Photoelectrochemical cells
journal, November 2001


Density‐functional thermochemistry. III. The role of exact exchange
journal, April 1993

  • Becke, Axel D.
  • The Journal of Chemical Physics, Vol. 98, Issue 7, p. 5648-5652
  • DOI: 10.1063/1.464913

Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals
journal, April 2017


Alchemical and structural distribution based representation for universal quantum machine learning
journal, June 2018

  • Faber, Felix A.; Christensen, Anders S.; Huang, Bing
  • The Journal of Chemical Physics, Vol. 148, Issue 24
  • DOI: 10.1063/1.5020710

Accurate quantum chemical energies for 133 000 organic molecules
journal, January 2019

  • Narayanan, Badri; Redfern, Paul C.; Assary, Rajeev S.
  • Chemical Science, Vol. 10, Issue 31
  • DOI: 10.1039/C9SC02834J

Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density
journal, January 1988


Δ-machine learning for potential energy surfaces: A PIP approach to bring a DFT-based PES to CCSD(T) level of theory
journal, February 2021

  • Nandi, Apurba; Qu, Chen; Houston, Paul L.
  • The Journal of Chemical Physics, Vol. 154, Issue 5
  • DOI: 10.1063/5.0038301

A Fragmentation-Based Graph Embedding Framework for QM/ML
journal, August 2021

  • Collins, Eric M.; Raghavachari, Krishnan
  • The Journal of Physical Chemistry A, Vol. 125, Issue 31
  • DOI: 10.1021/acs.jpca.1c06152

Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach
journal, April 2015

  • Ramakrishnan, Raghunathan; Dral, Pavlo O.; Rupp, Matthias
  • Journal of Chemical Theory and Computation, Vol. 11, Issue 5
  • DOI: 10.1021/acs.jctc.5b00099

Spotting trends in organocatalysis for the next decade
journal, July 2020


The X1 family of methods that combines B3LYP with neural network corrections for an accurate yet efficient prediction of thermochemistry
journal, April 2015

  • Wu, Jianming; Zhou, Yuwei; Xu, Xin
  • International Journal of Quantum Chemistry, Vol. 115, Issue 16
  • DOI: 10.1002/qua.24919

Machine learning corrected alchemical perturbation density functional theory for catalysis applications
journal, October 2020

  • Griego, Charles D.; Zhao, Lingyan; Saravanan, Karthikeyan
  • AIChE Journal, Vol. 66, Issue 12
  • DOI: 10.1002/aic.17041

Effective Molecular Descriptors for Chemical Accuracy at DFT Cost: Fragmentation, Error-Cancellation, and Machine Learning
journal, July 2020

  • Collins, Eric M.; Raghavachari, Krishnan
  • Journal of Chemical Theory and Computation, Vol. 16, Issue 8
  • DOI: 10.1021/acs.jctc.0c00236

Improving the Accuracy of Composite Methods: A G4MP2 Method with G4-like Accuracy and Implications for Machine Learning
journal, July 2022

  • Dandu, Naveen K.; Assary, Rajeev S.; Redfern, Paul C.
  • The Journal of Physical Chemistry A, Vol. 126, Issue 27
  • DOI: 10.1021/acs.jpca.2c01327

Accelerating Electrolyte Discovery for Energy Storage with High-Throughput Screening
journal, January 2015

  • Cheng, Lei; Assary, Rajeev S.; Qu, Xiaohui
  • The Journal of Physical Chemistry Letters, Vol. 6, Issue 2
  • DOI: 10.1021/jz502319n

Gaussian-4 theory using reduced order perturbation theory
journal, September 2007

  • Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
  • The Journal of Chemical Physics, Vol. 127, Issue 12
  • DOI: 10.1063/1.2770701

Gaussian‐1 theory of molecular energies for second‐row compounds
journal, August 1990

  • Curtiss, Larry A.; Jones, Christopher; Trucks, Gary W.
  • The Journal of Chemical Physics, Vol. 93, Issue 4
  • DOI: 10.1063/1.458892

Assessment of Gaussian-3 and density-functional theories on the G3/05 test set of experimental energies
journal, September 2005

  • Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
  • The Journal of Chemical Physics, Vol. 123, Issue 12
  • DOI: 10.1063/1.2039080

Quantum-Chemically Informed Machine Learning: Prediction of Energies of Organic Molecules with 10 to 14 Non-hydrogen Atoms
journal, June 2020

  • Dandu, Naveen; Ward, Logan; Assary, Rajeev S.
  • The Journal of Physical Chemistry A, Vol. 124, Issue 28
  • DOI: 10.1021/acs.jpca.0c01777

Transferable MP2-Based Machine Learning for Accurate Coupled-Cluster Energies
journal, November 2020

  • Townsend, Jacob; Vogiatzis, Konstantinos D.
  • Journal of Chemical Theory and Computation, Vol. 16, Issue 12
  • DOI: 10.1021/acs.jctc.0c00927

Machine learning of molecular electronic properties in chemical compound space
journal, September 2013


Assessment of Gaussian-2 and density functional theories for the computation of enthalpies of formation
journal, January 1997

  • Curtiss, Larry A.; Raghavachari, Krishnan; Redfern, Paul C.
  • The Journal of Chemical Physics, Vol. 106, Issue 3
  • DOI: 10.1063/1.473182

Quantum chemical accuracy from density functional approximations via machine learning
journal, October 2020

  • Bogojeski, Mihail; Vogt-Maranto, Leslie; Tuckerman, Mark E.
  • Nature Communications, Vol. 11, Issue 1
  • DOI: 10.1038/s41467-020-19093-1

A new mixing of Hartree–Fock and local density‐functional theories
journal, January 1993

  • Becke, Axel D.
  • The Journal of Chemical Physics, Vol. 98, Issue 2
  • DOI: 10.1063/1.464304

SchNet – A deep learning architecture for molecules and materials
journal, June 2018

  • Schütt, K. T.; Sauceda, H. E.; Kindermans, P. -J.
  • The Journal of Chemical Physics, Vol. 148, Issue 24
  • DOI: 10.1063/1.5019779

Semiempirical GGA-type density functional constructed with a long-range dispersion correction
journal, January 2006

  • Grimme, Stefan
  • Journal of Computational Chemistry, Vol. 27, Issue 15, p. 1787-1799
  • DOI: 10.1002/jcc.20495

Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels
journal, June 2018

  • Yang, GuanYa; Wu, Jiang; Chen, ShuGuang
  • The Journal of Chemical Physics, Vol. 148, Issue 24
  • DOI: 10.1063/1.5024442

Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations
journal, August 2019

  • Ward, Logan; Blaiszik, Ben; Foster, Ian
  • MRS Communications, Vol. 9, Issue 3
  • DOI: 10.1557/mrc.2019.107

Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity
journal, October 2016

  • Huang, Bing; von Lilienfeld, O. Anatole
  • The Journal of Chemical Physics, Vol. 145, Issue 16
  • DOI: 10.1063/1.4964627

Comparing molecules and solids across structural and alchemical space
journal, January 2016

  • De, Sandip; Bartók, Albert P.; Csányi, Gábor
  • Physical Chemistry Chemical Physics, Vol. 18, Issue 20
  • DOI: 10.1039/C6CP00415F

Coupled-cluster reference values for the GW27 and GW100 test sets for the assessment of GW methods
journal, March 2015