This content will become publicly available on Fri Jul 05 00:00:00 EDT 2024
Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning
Abstract
In previous work (Dandu et al., J. Phys. Chem. A, 2022, 126, 4528–4536), we were successful in predicting accurate atomization energies of organic molecules using machine learning (ML) models, obtaining an accuracy as low as 0.1 kcal/mol compared to the G4MP2 method. In this work, we extend the use of these ML models to adiabatic ionization potentials on data sets of energies generated using quantum chemical calculations. Atomic specific corrections that were found to improve atomization energies from quantum chemical calculations have also been used in this study to improve ionization potentials. Here, the quantum chemical calculations were performed on 3405 molecules containing eight or fewer non-hydrogen atoms derived from the QM9 data set, using the B3LYP functional with the 6–31G(2df,p) basis set for optimization. Low-fidelity IPs for these structures were obtained using two density functional methods: B3LYP/6–31+G(2df,p) and ωB97XD/6–311+G(3df,2p). Highly accurate G4MP2 calculations were performed on these optimized structures to obtain high-fidelity IPs to use in ML models based on the low-fidelity IPs. Our best performing ML methods gave IPs of organic molecules within a mean absolute deviation of 0.035 eV from the G4MP2 IPs for the whole data set. This work demonstrates that ML predictions assisted by quantummore »
- Authors:
-
- Argonne National Laboratory (ANL), Argonne, IL (United States); Univ. of Illinois, Chicago, IL (United States)
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Publication Date:
- Research Org.:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Basic Energy Sciences (BES)
- OSTI Identifier:
- 2007518
- Grant/Contract Number:
- AC02-06CH11357
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory
- Additional Journal Information:
- Journal Volume: 127; Journal Issue: 28; Journal ID: ISSN 1089-5639
- Publisher:
- American Chemical Society
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY; chemical calculations; density functional theory; energy; ionization; molecules
Citation Formats
Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., and Curtiss, Larry A. Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning. United States: N. p., 2023.
Web. doi:10.1021/acs.jpca.3c00823.
Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., & Curtiss, Larry A. Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning. United States. https://doi.org/10.1021/acs.jpca.3c00823
Dandu, Naveen K., Ward, Logan, Assary, Rajeev S., Redfern, Paul C., and Curtiss, Larry A. Wed .
"Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning". United States. https://doi.org/10.1021/acs.jpca.3c00823.
@article{osti_2007518,
title = {Accurate Prediction of Adiabatic Ionization Potentials of Organic Molecules using Quantum Chemistry Assisted Machine Learning},
author = {Dandu, Naveen K. and Ward, Logan and Assary, Rajeev S. and Redfern, Paul C. and Curtiss, Larry A.},
abstractNote = {In previous work (Dandu et al., J. Phys. Chem. A, 2022, 126, 4528–4536), we were successful in predicting accurate atomization energies of organic molecules using machine learning (ML) models, obtaining an accuracy as low as 0.1 kcal/mol compared to the G4MP2 method. In this work, we extend the use of these ML models to adiabatic ionization potentials on data sets of energies generated using quantum chemical calculations. Atomic specific corrections that were found to improve atomization energies from quantum chemical calculations have also been used in this study to improve ionization potentials. Here, the quantum chemical calculations were performed on 3405 molecules containing eight or fewer non-hydrogen atoms derived from the QM9 data set, using the B3LYP functional with the 6–31G(2df,p) basis set for optimization. Low-fidelity IPs for these structures were obtained using two density functional methods: B3LYP/6–31+G(2df,p) and ωB97XD/6–311+G(3df,2p). Highly accurate G4MP2 calculations were performed on these optimized structures to obtain high-fidelity IPs to use in ML models based on the low-fidelity IPs. Our best performing ML methods gave IPs of organic molecules within a mean absolute deviation of 0.035 eV from the G4MP2 IPs for the whole data set. This work demonstrates that ML predictions assisted by quantum chemical calculations can be used to successfully predict IPs of organic molecules for use in high throughput screening.},
doi = {10.1021/acs.jpca.3c00823},
journal = {Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory},
number = 28,
volume = 127,
place = {United States},
year = {Wed Jul 05 00:00:00 EDT 2023},
month = {Wed Jul 05 00:00:00 EDT 2023}
}
Works referenced in this record:
Gaussian-3 theory using reduced Mo/ller-Plesset order
journal, March 1999
- Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
- The Journal of Chemical Physics, Vol. 110, Issue 10
Density‐functional thermochemistry. III. The role of exact exchange
journal, April 1993
- Becke, Axel D.
- The Journal of Chemical Physics, Vol. 98, Issue 7, p. 5648-5652
Thirty years of density functional theory in computational chemistry: an overview and extensive assessment of 200 density functionals
journal, April 2017
- Mardirossian, Narbe; Head-Gordon, Martin
- Molecular Physics, Vol. 115, Issue 19
Alchemical and structural distribution based representation for universal quantum machine learning
journal, June 2018
- Faber, Felix A.; Christensen, Anders S.; Huang, Bing
- The Journal of Chemical Physics, Vol. 148, Issue 24
Accurate quantum chemical energies for 133 000 organic molecules
journal, January 2019
- Narayanan, Badri; Redfern, Paul C.; Assary, Rajeev S.
- Chemical Science, Vol. 10, Issue 31
Development of the Colle-Salvetti correlation-energy formula into a functional of the electron density
journal, January 1988
- Lee, Chengteh; Yang, Weitao; Parr, Robert G.
- Physical Review B, Vol. 37, Issue 2
Δ-machine learning for potential energy surfaces: A PIP approach to bring a DFT-based PES to CCSD(T) level of theory
journal, February 2021
- Nandi, Apurba; Qu, Chen; Houston, Paul L.
- The Journal of Chemical Physics, Vol. 154, Issue 5
A Fragmentation-Based Graph Embedding Framework for QM/ML
journal, August 2021
- Collins, Eric M.; Raghavachari, Krishnan
- The Journal of Physical Chemistry A, Vol. 125, Issue 31
Big Data Meets Quantum Chemistry Approximations: The Δ-Machine Learning Approach
journal, April 2015
- Ramakrishnan, Raghunathan; Dral, Pavlo O.; Rupp, Matthias
- Journal of Chemical Theory and Computation, Vol. 11, Issue 5
Spotting trends in organocatalysis for the next decade
journal, July 2020
- Lassaletta, José M.
- Nature Communications, Vol. 11, Issue 1
The X 1 family of methods that combines B 3LYP with neural network corrections for an accurate yet efficient prediction of thermochemistry
journal, April 2015
- Wu, Jianming; Zhou, Yuwei; Xu, Xin
- International Journal of Quantum Chemistry, Vol. 115, Issue 16
Machine learning corrected alchemical perturbation density functional theory for catalysis applications
journal, October 2020
- Griego, Charles D.; Zhao, Lingyan; Saravanan, Karthikeyan
- AIChE Journal, Vol. 66, Issue 12
Effective Molecular Descriptors for Chemical Accuracy at DFT Cost: Fragmentation, Error-Cancellation, and Machine Learning
journal, July 2020
- Collins, Eric M.; Raghavachari, Krishnan
- Journal of Chemical Theory and Computation, Vol. 16, Issue 8
Improving the Accuracy of Composite Methods: A G4MP2 Method with G4-like Accuracy and Implications for Machine Learning
journal, July 2022
- Dandu, Naveen K.; Assary, Rajeev S.; Redfern, Paul C.
- The Journal of Physical Chemistry A, Vol. 126, Issue 27
Accelerating Electrolyte Discovery for Energy Storage with High-Throughput Screening
journal, January 2015
- Cheng, Lei; Assary, Rajeev S.; Qu, Xiaohui
- The Journal of Physical Chemistry Letters, Vol. 6, Issue 2
Gaussian-4 theory using reduced order perturbation theory
journal, September 2007
- Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
- The Journal of Chemical Physics, Vol. 127, Issue 12
Gaussian‐1 theory of molecular energies for second‐row compounds
journal, August 1990
- Curtiss, Larry A.; Jones, Christopher; Trucks, Gary W.
- The Journal of Chemical Physics, Vol. 93, Issue 4
Assessment of Gaussian-3 and density-functional theories on the G3/05 test set of experimental energies
journal, September 2005
- Curtiss, Larry A.; Redfern, Paul C.; Raghavachari, Krishnan
- The Journal of Chemical Physics, Vol. 123, Issue 12
Quantum-Chemically Informed Machine Learning: Prediction of Energies of Organic Molecules with 10 to 14 Non-hydrogen Atoms
journal, June 2020
- Dandu, Naveen; Ward, Logan; Assary, Rajeev S.
- The Journal of Physical Chemistry A, Vol. 124, Issue 28
Transferable MP2-Based Machine Learning for Accurate Coupled-Cluster Energies
journal, November 2020
- Townsend, Jacob; Vogiatzis, Konstantinos D.
- Journal of Chemical Theory and Computation, Vol. 16, Issue 12
Machine learning of molecular electronic properties in chemical compound space
journal, September 2013
- Montavon, Grégoire; Rupp, Matthias; Gobre, Vivekanand
- New Journal of Physics, Vol. 15, Issue 9
Assessment of Gaussian-2 and density functional theories for the computation of enthalpies of formation
journal, January 1997
- Curtiss, Larry A.; Raghavachari, Krishnan; Redfern, Paul C.
- The Journal of Chemical Physics, Vol. 106, Issue 3
Quantum chemical accuracy from density functional approximations via machine learning
journal, October 2020
- Bogojeski, Mihail; Vogt-Maranto, Leslie; Tuckerman, Mark E.
- Nature Communications, Vol. 11, Issue 1
A new mixing of Hartree–Fock and local density‐functional theories
journal, January 1993
- Becke, Axel D.
- The Journal of Chemical Physics, Vol. 98, Issue 2
SchNet – A deep learning architecture for molecules and materials
journal, June 2018
- Schütt, K. T.; Sauceda, H. E.; Kindermans, P. -J.
- The Journal of Chemical Physics, Vol. 148, Issue 24
Semiempirical GGA-type density functional constructed with a long-range dispersion correction
journal, January 2006
- Grimme, Stefan
- Journal of Computational Chemistry, Vol. 27, Issue 15, p. 1787-1799
Alternative Approach to Chemical Accuracy: A Neural Networks-Based First-Principles Method for Heat of Formation of Molecules Made of H, C, N, O, F, S, and Cl
journal, June 2014
- Sun, Jian; Wu, Jiang; Song, Tao
- The Journal of Physical Chemistry A, Vol. 118, Issue 39
Size-independent neural networks based first-principles method for accurate prediction of heat of formation of fuels
journal, June 2018
- Yang, GuanYa; Wu, Jiang; Chen, ShuGuang
- The Journal of Chemical Physics, Vol. 148, Issue 24
Machine learning prediction of accurate atomization energies of organic molecules from low-fidelity quantum chemical calculations
journal, August 2019
- Ward, Logan; Blaiszik, Ben; Foster, Ian
- MRS Communications, Vol. 9, Issue 3
Communication: Understanding molecular representations in machine learning: The role of uniqueness and target similarity
journal, October 2016
- Huang, Bing; von Lilienfeld, O. Anatole
- The Journal of Chemical Physics, Vol. 145, Issue 16
Comparing molecules and solids across structural and alchemical space
journal, January 2016
- De, Sandip; Bartók, Albert P.; Csányi, Gábor
- Physical Chemistry Chemical Physics, Vol. 18, Issue 20
Coupled-cluster reference values for the GW27 and GW100 test sets for the assessment of GW methods
journal, March 2015
- Krause, Katharina; Harding, Michael E.; Klopper, Wim
- Molecular Physics, Vol. 113, Issue 13-14