DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Comparing Designed Training Sets to Optimize Multivariate Regression Models for Pr, Nd, and Nitric Acid Using Spectrophotometry

Journal Article · · Applied Spectroscopy Practica

Chemometric regression models were developed for the quantification of praseodymium (Pr, 0–1000 µg/mL), neodymium (Nd, 0–1000 µg/mL), and nitric acid (HNO3, 0.1–5 M) using spectrophotometry. Designed calibration sets were composed of 20 samples each: 10 model points and 10 lack-of-fit (LOF) points. The D-optimal designs effectively minimized the number of samples required to build models, and each design resulted in similar prediction performance, suggesting that statistical design of experiments can provide a reliable framework for selecting training set samples in three-variable systems. Partial least squares regression (PLSR) models were validated against a one-factor-at-a-time validation set composed of 125 samples (three variables, five levels). The top PLS-1 models resulted in average percent root mean square error of prediction error values of 3.5%, 1.7%, and 1.2% for Pr(III), Nd(III), and HNO3, respectively. Power set augmentations of the model and LOF samples were investigated to optimize the number of training set samples. PLSR models built using just required model points (10) had similar predictive capabilities as models including the LOF points (20) but with fewer samples. The number of validation samples was also varied systematically to learn how many samples are needed to validate regression models. This work addresses long-standing questions in the field of chemometrics to help make this approach amenable to the near-real-time quantification of hazardous species in remote settings.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Nuclear Physics (NP)
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
2333817
Journal Information:
Applied Spectroscopy Practica, Journal Name: Applied Spectroscopy Practica Journal Issue: 1 Vol. 2; ISSN 2755-1857
Publisher:
SageCopyright Statement
Country of Publication:
United States
Language:
English

References (32)

A closer look at the bias-variance trade-off in multivariate calibration journal March 1999
The utility of multivariate design in PLS modeling journal March 2004
A comparison of methods for testing differences in predictive ability journal September 2005
Assessing and improving the stability of chemometric models in small sample size situations journal January 2008
UV–Vis spectroscopy with chemometric data treatment: an option for on-line control in nuclear industry journal April 2017
Concentration determination of inorganic acids that do not absorb near-infrared (NIR) radiation through recognizing perturbed NIR water bands by them and investigation of accuracy dependency on their acidities journal June 2018
A high-bias, low-variance introduction to Machine Learning for physicists journal May 2019
Nd(III) hypersensitive peak as an optical absorption probe for determining nitric acid in aqueous solution: An application to aqueous raffinate solutions in nuclear reprocessing journal August 2021
Leveraging visible and near-infrared spectroelectrochemistry to calibrate a robust model for Vanadium(IV/V) in varying nitric acid and temperature levels journal July 2023
Multivariate chemometric methods and Vis-NIR spectrophotometry for monitoring plutonium-238 anion exchange column effluent in a radiochemical hot cell journal August 2022
Breaking with trends in pre-processing? journal October 2013
Water O–H Stretching Raman Signature for Strong Acid Monitoring via Multivariate Analysis journal March 2013
Practical Guide to Chemometric Analysis of Optical Spectroscopic Data journal June 2023
Characterization of the ALSEP Process at Equilibrium: Speciation and Stoichiometry of the Extracted Complex journal March 2020
Design of Experiments, Chemometrics, and Raman Spectroscopy for the Quantification of Hydroxylammonium, Nitrate, and Nitric Acid journal February 2022
Pursuit of the Ultimate Regression Model for Samarium(III), Europium(III), and LiCl Using Laser-Induced Fluorescence, Design of Experiments, and a Genetic Algorithm for Feature Selection journal January 2023
Complexation of Lanthanides with Nitrate at Variable Temperatures: Thermodynamics and Coordination Modes journal February 2009
Optimal experimental design journal July 2018
Electronic Energy Levels in the Trivalent Lanthanide Aquo Ions. I. Pr 3+ , Nd 3+ , Pm 3+ , Sm 3+ , Dy 3+ , Ho 3+ , Er 3+ , and Tm 3+ journal November 1968
Fraction of Design Space to Assess Prediction Capability of Response Surface Designs journal October 2003
An Expository Paper on Optimal Design journal July 2011
Memorizing without overfitting: Bias, variance, and interpolation in overparameterized models journal March 2022
A Piecewise Local Partial Least Squares (PLS) Method for the Quantitative Analysis of Plutonium Nitrate Solutions journal October 2017
Monitoring the Caustic Dissolution of Aluminum Alloy in a Radiochemical Hot Cell Using Raman Spectroscopy journal July 2020
Chemometrics and Experimental Design for the Quantification of Nitrate Salts in Nitric Acid: Near-Infrared Spectroscopy Absorption Analysis journal January 2021
Measuring Nd(III) Solution Concentration in the Presence of Interfering Er(III) and Cu(II) Ions: A Partial Least Squares Analysis of Ultraviolet–Visible Spectra journal October 2021
Effect of Experimental Design on the Prediction Performance of Calibration Models Based on Near-Infrared Spectroscopy for Pharmaceutical Applications journal December 2012
Feasibility Study of Spectrophotometry to Support a Promethium Production Program at ORNL report June 2022
In-Cell Recording Optical Spectrometer. report January 1967
Mitigating the Multicollinearity Problem and Its Machine Learning Approach: A Review journal April 2022
Comparison of Multivariate Regression Models Based on Water- and Carbohydrate-Related Spectral Regions in the Near-Infrared for Aqueous Solutions of Glucose journal October 2019
Partial Least Squares, Experimental Design, and Near-Infrared Spectrophotometry for the Remote Quantification of Nitric Acid Concentration and Temperature journal April 2023