Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method
Abstract
Abstract We report an evaluation of a semi-empirical quantum chemical method PM7 from the perspective of uncertainty quantification. Specifically, we apply Bound-to-Bound Data Collaboration, an uncertainty quantification framework, to characterize (a) variability of PM7 model parameter values consistent with the uncertainty in the training data and (b) uncertainty propagation from the training data to the model predictions. Experimental heats of formation of a homologous series of linear alkanes are used as the property of interest. The training data are chemically accurate , i.e., they have very low uncertainty by the standards of computational chemistry. The analysis does not find evidence of PM7 consistency with the entire data set considered as no single set of parameter values is found that captures the experimental uncertainties of all training data. A set of parameter values for PM7 was able to capture the training data within ±1 kcal/mol, but not to the smaller level of uncertainty in the reported data. Nevertheless, PM7 was found to be consistent for subsets of the training data. In such cases, uncertainty propagation from the chemically accurate training data to the predicted values preserves error within bounds of chemical accuracy if predictions are made for the molecules of comparable size.more »
- Authors:
- Publication Date:
- Research Org.:
- Univ. of Utah, Salt Lake City, UT (United States)
- Sponsoring Org.:
- USDOE
- OSTI Identifier:
- 1467754
- Alternate Identifier(s):
- OSTI ID: 1545322
- Grant/Contract Number:
- NA0002375
- Resource Type:
- Published Article
- Journal Name:
- Scientific Reports
- Additional Journal Information:
- Journal Name: Scientific Reports Journal Volume: 8 Journal Issue: 1; Journal ID: ISSN 2045-2322
- Publisher:
- Nature Publishing Group
- Country of Publication:
- United Kingdom
- Language:
- English
- Subject:
- 42 ENGINEERING
Citation Formats
Oreluk, James, Liu, Zhenyuan, Hegde, Arun, Li, Wenyu, Packard, Andrew, Frenklach, Michael, and Zubarev, Dmitry. Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method. United Kingdom: N. p., 2018.
Web. doi:10.1038/s41598-018-31677-y.
Oreluk, James, Liu, Zhenyuan, Hegde, Arun, Li, Wenyu, Packard, Andrew, Frenklach, Michael, & Zubarev, Dmitry. Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method. United Kingdom. https://doi.org/10.1038/s41598-018-31677-y
Oreluk, James, Liu, Zhenyuan, Hegde, Arun, Li, Wenyu, Packard, Andrew, Frenklach, Michael, and Zubarev, Dmitry. Wed .
"Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method". United Kingdom. https://doi.org/10.1038/s41598-018-31677-y.
@article{osti_1467754,
title = {Diagnostics of Data-Driven Models: Uncertainty Quantification of PM7 Semi-Empirical Quantum Chemical Method},
author = {Oreluk, James and Liu, Zhenyuan and Hegde, Arun and Li, Wenyu and Packard, Andrew and Frenklach, Michael and Zubarev, Dmitry},
abstractNote = {Abstract We report an evaluation of a semi-empirical quantum chemical method PM7 from the perspective of uncertainty quantification. Specifically, we apply Bound-to-Bound Data Collaboration, an uncertainty quantification framework, to characterize (a) variability of PM7 model parameter values consistent with the uncertainty in the training data and (b) uncertainty propagation from the training data to the model predictions. Experimental heats of formation of a homologous series of linear alkanes are used as the property of interest. The training data are chemically accurate , i.e., they have very low uncertainty by the standards of computational chemistry. The analysis does not find evidence of PM7 consistency with the entire data set considered as no single set of parameter values is found that captures the experimental uncertainties of all training data. A set of parameter values for PM7 was able to capture the training data within ±1 kcal/mol, but not to the smaller level of uncertainty in the reported data. Nevertheless, PM7 was found to be consistent for subsets of the training data. In such cases, uncertainty propagation from the chemically accurate training data to the predicted values preserves error within bounds of chemical accuracy if predictions are made for the molecules of comparable size. Otherwise, the error grows linearly with the relative size of the molecules.},
doi = {10.1038/s41598-018-31677-y},
journal = {Scientific Reports},
number = 1,
volume = 8,
place = {United Kingdom},
year = {Wed Sep 05 00:00:00 EDT 2018},
month = {Wed Sep 05 00:00:00 EDT 2018}
}
https://doi.org/10.1038/s41598-018-31677-y
Web of Science
Figures / Tables:
Works referenced in this record:
Transforming data into knowledge—Process Informatics for combustion chemistry
journal, January 2007
- Frenklach, Michael
- Proceedings of the Combustion Institute, Vol. 31, Issue 1
Semiempirical quantum–chemical methods
journal, July 2013
- Thiel, Walter
- Wiley Interdisciplinary Reviews: Computational Molecular Science, Vol. 4, Issue 2
Polarizable Force Fields: History, Test Cases, and Prospects
journal, September 2007
- Warshel, Arieh; Kato, Mitsunori; Pisliakov, Andrei V.
- Journal of Chemical Theory and Computation, Vol. 3, Issue 6
Quest for a universal density functional: the accuracy of density functionals across a broad spectrum of databases in chemistry and physics
journal, March 2014
- Peverati, Roberto; Truhlar, Donald G.
- Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 372, Issue 2011
Computational prediction of protein interfaces: A review of data driven methods
journal, October 2015
- Xue, Li C.; Dobbs, Drena; Bonvin, Alexandre M. J. J.
- FEBS Letters, Vol. 589, Issue 23
Uncertainty quantification: Making predictions of complex reaction systems reliable
journal, October 2010
- Russi, Trent; Packard, Andy; Frenklach, Michael
- Chemical Physics Letters, Vol. 499, Issue 1-3
The Elements of Statistical Learning
book, January 2009
- Hastie, Trevor; Tibshirani, Robert; Friedman, Jerome
- Springer Series in Statistics
Comparison of Statistical and Deterministic Frameworks of Uncertainty Quantification
journal, January 2016
- Frenklach, Michael; Packard, Andrew; Garcia-Donato, Gonzalo
- SIAM/ASA Journal on Uncertainty Quantification, Vol. 4, Issue 1
Towards cleaner combustion engines through groundbreaking detailed chemical kinetic models
journal, January 2011
- Battin-Leclerc, Frédérique; Blurock, Edward; Bounaceur, Roda
- Chemical Society Reviews, Vol. 40, Issue 9
Optimization of parameters for semiempirical methods VI: more modifications to the NDDO approximations and re-optimization of parameters
journal, November 2012
- Stewart, James J. P.
- Journal of Molecular Modeling, Vol. 19, Issue 1
Uncertainty quantification in thermochemistry, benchmarking electronic structure computations, and Active Thermochemical Tables
journal, January 2014
- Ruscic, Branko
- International Journal of Quantum Chemistry, Vol. 114, Issue 17
Deep learning for computational chemistry
journal, March 2017
- Goh, Garrett B.; Hodas, Nathan O.; Vishnu, Abhinav
- Journal of Computational Chemistry, Vol. 38, Issue 16
Semiempirical Quantum-Chemical Orthogonalization-Corrected Methods: Theory, Implementation, and Parameters
journal, January 2016
- Dral, Pavlo O.; Wu, Xin; Spörkel, Lasse
- Journal of Chemical Theory and Computation, Vol. 12, Issue 3
Semiempirical Quantum-Chemical Orthogonalization-Corrected Methods: Benchmarks for Ground-State Properties
journal, January 2016
- Dral, Pavlo O.; Wu, Xin; Spörkel, Lasse
- Journal of Chemical Theory and Computation, Vol. 12, Issue 3
Machine-learned approximations to Density Functional Theory Hamiltonians
journal, February 2017
- Hegde, Ganesh; Bowen, R. Chris
- Scientific Reports, Vol. 7, Issue 1
Prediction uncertainty from models and data
conference, January 2002
- Frenklach, M.; Packard, A.; Seiler, P.
- Proceedings of 2002 American Control Conference, Proceedings of the 2002 American Control Conference (IEEE Cat. No.CH37301)
The Effects of Computational Modeling Errors on the Estimation of Statistical Mechanical Variables
journal, March 2012
- Faver, John C.; Yang, Wei; Merz, Kenneth M.
- Journal of Chemical Theory and Computation, Vol. 8, Issue 10
Highly Accurate First-Principles Benchmark Data Sets for the Parametrization and Validation of Density Functional and Other Approximate Methods. Derivation of a Robust, Generally Applicable, Double-Hybrid Functional for Thermochemistry and Thermochemical Kinetics †
journal, December 2008
- Karton, Amir; Tarnopolsky, Alex; Lamère, Jean-François
- The Journal of Physical Chemistry A, Vol. 112, Issue 50
QSAR Modeling: Where Have You Been? Where Are You Going To?
journal, January 2014
- Cherkasov, Artem; Muratov, Eugene N.; Fourches, Denis
- Journal of Medicinal Chemistry, Vol. 57, Issue 12
Consistency of a Reaction Dataset
journal, November 2004
- Feeley, Ryan; Seiler, Pete; Packard, Andrew
- The Journal of Physical Chemistry A, Vol. 108, Issue 44
Density functional theory is straying from the path toward the exact functional
journal, January 2017
- Medvedev, Michael G.; Bushmarinov, Ivan S.; Sun, Jianwei
- Science, Vol. 355, Issue 6320
Comparison of Molecular Mechanics, Semi-Empirical Quantum Mechanical, and Density Functional Theory Methods for Scoring Protein–Ligand Interactions
journal, June 2013
- Yilmazer, Nusret Duygu; Korth, Martin
- The Journal of Physical Chemistry B, Vol. 117, Issue 27
Quantum-chemical insights from deep tensor neural networks
journal, January 2017
- Schütt, Kristof T.; Arbabzadah, Farhad; Chmiela, Stefan
- Nature Communications, Vol. 8, Issue 1
Hybrid Density Functional Methods Empirically Optimized for the Computation of 13 C and 1 H Chemical Shifts in Chloroform Solution
journal, May 2006
- Wiitala, Keith W.; Hoye, Thomas R.; Cramer, Christopher J.
- Journal of Chemical Theory and Computation, Vol. 2, Issue 4
Improving the accuracy of Møller-Plesset perturbation theory with neural networks
journal, October 2017
- McGibbon, Robert T.; Taube, Andrew G.; Donchev, Alexander G.
- The Journal of Chemical Physics, Vol. 147, Issue 16
Numerical approaches for collaborative data processing
journal, December 2006
- Seiler, Pete; Frenklach, Michael; Packard, Andrew
- Optimization and Engineering, Vol. 7, Issue 4
Atomic Radius and Charge Parameter Uncertainty in Biomolecular Solvation Energy Calculations
journal, January 2018
- Yang, Xiu; Lei, Huan; Gao, Peiyuan
- Journal of Chemical Theory and Computation, Vol. 14, Issue 2
Deep Architectures and Deep Learning in Chemoinformatics: The Prediction of Aqueous Solubility for Drug-Like Molecules
journal, July 2013
- Lusci, Alessandro; Pollastri, Gianluca; Baldi, Pierre
- Journal of Chemical Information and Modeling, Vol. 53, Issue 7
Error Assessment of Computational Models in Chemistry
journal, April 2017
- Simm, GregorN.; Proppe, Jonny; Reiher, Markus
- CHIMIA International Journal for Chemistry, Vol. 71, Issue 4
An Empirical Polarizable Force Field Based on the Classical Drude Oscillator Model: Development History and Recent Applications
journal, January 2016
- Lemkul, Justin A.; Huang, Jing; Roux, Benoît
- Chemical Reviews, Vol. 116, Issue 9
Design of Density Functionals by Combining the Method of Constraint Satisfaction with Parametrization for Thermochemistry, Thermochemical Kinetics, and Noncovalent Interactions
journal, January 2006
- Zhao, Yan; Schultz, Nathan E.; Truhlar, Donald G.
- Journal of Chemical Theory and Computation, Vol. 2, Issue 2
Perspective on density functional theory
journal, April 2012
- Burke, Kieron
- The Journal of Chemical Physics, Vol. 136, Issue 15
Semiempirical Quantum Mechanical Methods for Noncovalent Interactions for Chemical and Biochemical Applications
journal, April 2016
- Christensen, Anders S.; Kubař, Tomáš; Cui, Qiang
- Chemical Reviews, Vol. 116, Issue 9
Additivity rules for the estimation of thermochemical properties
journal, June 1969
- Benson, Sidney W.; Cruickshank, F. R.; Golden, D. M.
- Chemical Reviews, Vol. 69, Issue 3
Chemical Kinetics and Combustion Modeling
journal, October 1990
- Miller, J. A.; Kee, R. J.; Westbrook, C. K.
- Annual Review of Physical Chemistry, Vol. 41, Issue 1
Figures / Tables found in this record: