skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Real-time prediction of 1 H and 13 C chemical shifts with DFT accuracy using a 3D graph neural network

Journal Article · · Chemical Science
DOI:https://doi.org/10.1039/D1SC03343C· OSTI ID:1812410

Nuclear magnetic resonance (NMR) is one of the primary techniques used to elucidate the chemical structure, bonding, stereochemistry, and conformation of organic compounds. The distinct chemical shifts in an NMR spectrum depend upon each atom's local chemical environment and are influenced by both through-bond and through-space interactions with other atoms and functional groups. The in silico prediction of NMR chemical shifts using quantum mechanical (QM) calculations is now commonplace in aiding organic structural assignment since spectra can be computed for several candidate structures and then compared with experimental values to find the best possible match. However, the computational demands of calculating multiple structural- and stereo-isomers, each of which may typically exist as an ensemble of rapidly-interconverting conformations, are expensive. Additionally, the QM predictions themselves may lack sufficient accuracy to identify a correct structure. In this work, we address both of these shortcomings by developing a rapid machine learning (ML) protocol to predict 1H and 13C chemical shifts through an efficient graph neural network (GNN) using 3D structures as input. Transfer learning with experimental data is used to improve the final prediction accuracy of a model trained using QM calculations. When tested on the CHESHIRE dataset, the proposed model predicts observed 13C chemical shifts with comparable accuracy to the best-performing DFT functionals (1.5 ppm) in around 1/6000 of the CPU time. An automated prediction webserver and graphical interface are accessible online at http://nova.chem.colostate.edu/cascade/. We further demonstrate the model in three applications: first, we use the model to decide the correct organic structure from candidates through experimental spectra, including complex stereoisomers; second, we automatically detect and revise incorrect chemical shift assignments in a popular NMR database, the NMRShiftDB; and third, we use NMR chemical shifts as descriptors for determination of the sites of electrophilic aromatic substitution.

Research Organization:
National Renewable Energy Lab. (NREL), Golden, CO (United States)
Sponsoring Organization:
USDOE Advanced Research Projects Agency - Energy (ARPA-E)
Grant/Contract Number:
AC36-08GO28308
OSTI ID:
1812410
Alternate ID(s):
OSTI ID: 1822396
Report Number(s):
NREL/JA-2700-80593; CSHCBM
Journal Information:
Chemical Science, Journal Name: Chemical Science Vol. 12 Journal Issue: 36; ISSN 2041-6520
Publisher:
Royal Society of Chemistry (RSC)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (71)

NMReDATA, a standard to report the NMR assignment and parameters of organic compounds journal May 2018
NMRShiftDBConstructing a Free Chemical Information System with Open-Source Components journal November 2003
Crystal Structure Prediction via Deep Learning journal June 2018
MoleculeNet: a benchmark for molecular machine learning journal January 2018
Predicting NMR Spectra by Computational Methods:  Structure Revision of Hexacyclinol journal June 2006
Addressing the Stereochemistry of Complex Organic Molecules by Density Functional Theory-NMR: Vannusal B in Retrospective journal April 2011
Deep learning journal May 2015
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost journal January 2017
The value of universally available raw NMR data for transparency, reproducibility, and integrity in natural product research journal January 2019
Assigning the Stereochemistry of Pairs of Diastereoisomers Using GIAO NMR Shift Calculation journal June 2009
Better Informed Distance Geometry: Using What We Know To Improve Conformation Generation journal November 2015
Beyond DP4: an Improved Probability for the Stereochemical Assignment of Isomeric Compounds using Quantum Chemical Calculations of NMR Shifts journal December 2015
Doubling the power of DP4 for computational structure elucidation journal January 2017
A computer program for the prediction of 13-C-NMR chemical shifts of organic compounds journal January 1990
Toward More Reliable 13 C and 1 H Chemical Shift Prediction:  A Systematic Comparison of Neural-Network and Least-Squares Regression Based Approaches journal December 2007
A Multi-standard Approach for GIAO 13 C NMR Calculations journal October 2009
Assigning Stereochemistry to Single Diastereoisomers by GIAO NMR Calculation: The DP4 Probability journal September 2010
On representing chemical environments journal May 2013
Message-passing neural networks for high-throughput polymer screening journal June 2019
IMPRESSION – prediction of NMR parameters for 3-dimensional chemical structures using machine learning with near quantum chemical accuracy journal January 2020
Demonstrating the Transferability and the Descriptive Power of Sketch-Map journal February 2013
Prediction of 1 H NMR Chemical Shifts Using Neural Networks journal January 2002
SchNet – A deep learning architecture for molecules and materials journal June 2018
The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals journal July 2007
Hose — a novel substructure code journal December 1978
Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties journal February 2021
Walking in the woods with quantum chemistry – applications of quantum chemical calculations in natural products research journal January 2013
Transferable Machine-Learning Model of the Electron Density journal December 2018
Fourier series of atomic radial distribution functions: A molecular fingerprint for machine learning models of quantum chemical properties
  • von Lilienfeld, O. Anatole; Ramakrishnan, Raghunathan; Rupp, Matthias
  • International Journal of Quantum Chemistry, Vol. 115, Issue 16 https://doi.org/10.1002/qua.24912
journal April 2015
Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 journal April 1996
Use of 13 C NMR Chemical Shift as QSAR/QSPR Descriptor journal March 2011
Fast and accurate prediction of the regioselectivity of electrophilic aromatic substitution reactions journal January 2018
Exchange functionals with improved long-range behavior and adiabatic connection methods without adjustable parameters: The mPW and mPW1PW models journal January 1998
Determination of Relative Configuration in Organic Compounds by NMR Spectroscopy and Computational Methods journal September 2007
Using Neural Networks for 13C NMR Chemical Shift Prediction–Comparison with Traditional Methods journal August 2002
A Predictive Tool for Electrophilic Aromatic Substitutions Using Machine Learning journal October 2018
Using 1 H and 13 C NMR chemical shifts to determine cyclic peptide conformations: a combined molecular dynamics and quantum mechanics approach journal January 2018
Rapid prediction of NMR spectral properties with quantified uncertainty journal August 2019
Regio-selectivity prediction with a machine-learned reaction representation and on-the-fly quantum mechanical descriptors journal January 2021
Building blocks for automated elucidation of metabolites: Machine learning methods for NMR prediction journal January 2008
Efficient implementation of the gauge-independent atomic orbital method for NMR chemical shift calculations journal November 1990
Atom-centered symmetry functions for constructing high-dimensional neural network potentials journal February 2011
Machine Learning for Quantum Mechanical Properties of Atoms in Molecules journal July 2015
Carbon-13 NMR Chemical Shift: A Descriptor for Electronic Structure and Reactivity of Organometallic Compounds journal July 2019
Application of the Multi-standard Methodology for Calculating 1 H NMR Chemical Shifts journal July 2012
Molecular graph convolutions: moving beyond fingerprints journal August 2016
Fast and Accurate Modeling of Molecular Atomization Energies with Machine Learning journal January 2012
GFN2-xTB—An Accurate and Broadly Parametrized Self-Consistent Tight-Binding Quantum Chemical Method with Multipole Electrostatics and Density-Dependent Dispersion Contributions journal January 2019
Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure–Property Relationships journal November 2017
Approaching coupled cluster accuracy with a general-purpose neural network potential through transfer learning journal July 2019
Reinvestigation of a robotically revealed reaction journal June 2019
DP4-AI automated NMR data analysis: straight from spectrometer to structure journal January 2020
Analyzing Learned Molecular Representations for Property Prediction journal July 2019
The Correct Structure of Aquatolide—Experimental Validation of a Theoretically-Predicted Structural Revision journal November 2012
Machine learning unifies the modeling of materials and molecules journal December 2017
Bypassing the Kohn-Sham equations with machine learning journal October 2017
PotentialNet for Molecular Property Prediction journal November 2018
Synergy of synthesis, computation and NMR reveals correct baulamycin structures journal July 2017
Quantum chemical calculations for over 200,000 organic radical species and 40,000 associated closed-shell molecules journal July 2020
Neural Message Passing for NMR Chemical Shift Prediction journal April 2020
Neural network potential-energy surfaces in chemistry: a tool for large-scale simulations journal January 2011
Computational Prediction of 1 H and 13 C Chemical Shifts: A Useful Tool for Natural Product, Mechanistic, and Synthetic Organic Chemistry journal November 2011
Chemical shifts in molecular solids by machine learning journal October 2018
A Survey on Transfer Learning journal October 2010
Performance Validation of Neural Network Based 13 C NMR Prediction Using a Publicly Available Data Source journal February 2008
Can Two Molecules Have the Same NMR Spectrum? Hexacyclinol Revisited journal February 2009
Quantum-chemical insights from deep tensor neural networks journal January 2017
Total Synthesis of (−)-Himalensine A journal November 2017
A Synthesis of Echinopine B journal June 2012
Stereostructure Assignment of Flexible Five-Membered Rings by GIAO 13 C NMR Calculations: Prediction of the Stereochemistry of Elatenyne journal May 2008
Development of a 13 C NMR Chemical Shift Prediction Procedure Using B3LYP/cc-pVDZ and Empirically Derived Systematic Error Correction Terms: A Computational Small Molecule Structure Elucidation Method journal April 2017

Similar Records

General Protocol for the Accurate Prediction of Molecular 13C/1H NMR Chemical Shifts via Machine Learning Augmented DFT
Journal Article · Tue Jun 30 00:00:00 EDT 2020 · Journal of Chemical Information and Modeling · OSTI ID:1812410

Machine Learning Full NMR Chemical Shift Tensors of Silicon Oxides with Equivariant Graph Neural Networks
Journal Article · Thu Mar 02 00:00:00 EST 2023 · Journal of Physical Chemistry. A, Molecules, Spectroscopy, Kinetics, Environment, and General Theory · OSTI ID:1812410

Applying Crystallography and 19 F NMR to investigate dynamics and partner protein interactions in a long chain Flavodoxin
Journal Article · Fri May 13 00:00:00 EDT 2022 · FASEB Journal · OSTI ID:1812410