Neural Network Analysis of Nuclear Magnetic Resonance and Infrared Spectra
Thesis/Dissertation
·
OSTI ID:2572129
Nuclear magnetic resonance (NMR) spectroscopy and infrared (IR) spectroscopy are powerful chemical characterization techniques with broad general usage. However, the manual evaluation of the resulting spectra is time-consuming and requires significant expertise, preventing insights from being used in real-time applications. With recent advances in computation and artificial intelligence (AI), new tools are available for automating spectral interpretation. In this work, machine learning (ML) algorithms using 1-dimensional convolutional neural networks (CNNs) were applied to identify common functional groups from spectral information. Raw spectra were collected virtually from the Human Metabolome Database (HMDB) and National Institute of Standards and Technology (NIST) Chemistry WebBook and processed into a suitable standard. Algorithm design was tailored to best fit the nature of the problem, with built-in flexibility to accommodate relevant parameters beyond the raw spectral input, specifically solvent identity and magnetic frequency for NMR. The predictive capability of the algorithm in identifying functional groups is displayed in several examples. This methodology has been compiled into a code repository and could easily be modified to adapt alternative data sources, including other spectrum types. To mitigate overfitting, a common problem in mathematical modeling where overfamiliarity with training data produces trends that are not representative of the general data, a novel metric was developed, referred to as Accufit. Accufit includes a parameter that penalizes substantial differences in the training accuracy and the accuracy of an independent validation set. Examples are presented showing the effectiveness of Accufit in maintaining the model’s predictive capability while controlling the overfitting when used as a custom metric for hyperparameter tuning.
- Research Organization:
- Kansas City Plant (KCP)
- Sponsoring Organization:
- USDOE National Nuclear Security Administration (NNSA)
- DOE Contract Number:
- NA0002839
- OSTI ID:
- 2572129
- Report Number(s):
- NSC-614-7354
- Country of Publication:
- United States
- Language:
- English
Similar Records
Array-Based Machine Learning for Functional Group Detection in Electron Ionization Mass Spectrometry
An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
Electron energy loss spectroscopy database synthesis and automation of core-loss edge recognition by deep-learning neural networks
Journal Article
·
Wed Jun 28 20:00:00 EDT 2023
· ACS Omega
·
OSTI ID:2420500
An initial investigation of accuracy required for the identification of small molecules in complex samples using quantum chemical calculated NMR chemical shifts
Journal Article
·
Wed Sep 21 20:00:00 EDT 2022
· Journal of Cheminformatics
·
OSTI ID:1925225
Electron energy loss spectroscopy database synthesis and automation of core-loss edge recognition by deep-learning neural networks
Journal Article
·
Thu Dec 22 19:00:00 EST 2022
· Scientific Reports
·
OSTI ID:1906537