DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Multi-component background learning automates signal detection for spectroscopic data

Journal Article · · npj Computational Materials

Abstract Automated experimentation has yielded data acquisition rates that supersede human processing capabilities. Artificial Intelligence offers new possibilities for automating data interpretation to generate large, high-quality datasets. Background subtraction is a long-standing challenge, particularly in settings where multiple sources of the background signal coexist, and automatic extraction of signals of interest from measured signals accelerates data interpretation. Herein, we present an unsupervised probabilistic learning approach that analyzes large data collections to identify multiple background sources and establish the probability that any given data point contains a signal of interest. The approach is demonstrated on X-ray diffraction and Raman spectroscopy data and is suitable to any type of data where the signal of interest is a positive addition to the background signals. While the model can incorporate prior knowledge, it does not require knowledge of the signals since the shapes of the background signals, the noise levels, and the signal of interest are simultaneously learned via a probabilistic matrix factorization framework. Automated identification of interpretable signals by unsupervised probabilistic learning avoids the injection of human bias and expedites signal extraction in large datasets, a transformative capability with many applications in the physical sciences and beyond.

Sponsoring Organization:
USDOE
OSTI ID:
1619644
Journal Information:
npj Computational Materials, Journal Name: npj Computational Materials Vol. 5 Journal Issue: 1; ISSN 2057-3960
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English
Citation Metrics:
Cited by: 17 works
Citation information provided by
Web of Science

References (22)

Automatic collection of powder data from photographs journal February 1975
Inelastic background intensities in XPS spectra journal August 1984
Discovering Ce-rich oxygen evolution catalysts, from high throughput screening to water electrolysis journal January 2014
What are the shapes of response time distributions in visual search?
  • Palmer, Evan M.; Horowitz, Todd S.; Torralba, Antonio
  • Journal of Experimental Psychology: Human Perception and Performance, Vol. 37, Issue 1 https://doi.org/10.1037/a0020747
journal January 2011
XCIII. On the theory of X-ray absorption and of the continuous X-ray spectrum journal November 1923
Solar fuel photoanodes prepared by inkjet printing of copper vanadates journal January 2016
Saddlepoint methods for option pricing journal September 2009
The quantitative analysis of surfaces by XPS: A review journal December 1980
The 2019 materials by design roadmap journal October 2018
Accelerating the discovery of materials for clean energy in the era of smart automation journal April 2018
Automated Autofluorescence Background Subtraction Algorithm for Biomedical Raman Spectroscopy journal November 2007
Calculation of the electron–electron bremsstrahlung cross-section in the field of atomic electrons journal February 2008
Perspective: Composition–structure–property mapping in high-throughput experiments: Turning data into knowledge journal May 2016
Combinatorial metallurgical synthesis and processing of high-entropy alloys journal July 2018
Algorithm for automatic x-ray photoelectron spectroscopy data processing and x-ray photoelectron spectroscopy imaging journal July 2005
Expediting Combinatorial Data Set Analysis by Combining Human and Algorithmic Analysis journal December 2016
Theory of Bremsstrahlung and Pair Production. II. Integral Cross Section for Pair Production journal February 1954
Multivariate Characterization of a Continuous Soot Monitoring System Based on Raman Spectroscopy journal September 2015
Combinatorial approaches as effective tools in the study of phase diagrams and composition–structure–property relationships journal July 2006
Theory of Bremsstrahlung and Pair Production. I. Differential Cross Section journal February 1954
Exponentially modified Gaussian (EMG) relevance to distributions related to cell proliferation and differentiation journal January 2010
Über die Interferenzerscheinungen an planparallelen Platten journal January 1904