DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Automated generation and ensemble-learned matching of X-ray absorption spectra

Journal Article · · npj Computational Materials
 [1];  [2];  [1];  [1];  [1];  [3];  [4];  [4];  [4];  [5];  [2]; ORCiD logo [1]
  1. Univ. of California, San Diego, CA (United States). Dept. of NanoEngineering
  2. Univ. of California, Berkeley, CA (United States). Dept. of Materials Science
  3. National Inst. for Occupational Safety and Health, Centers for Disease Control, Cincinnati, OH (United States). Div. of Applied Research and Technology
  4. Univ. of Washington, Seattle, WA (United States). Dept. of Physics
  5. Binghamton Univ., NY (United States). Dept. of Physics, Applied Physics and Astronomy and Materials Science and Engineering

© 2018 The Author(s). X-ray absorption spectroscopy (XAS) is a widely used materials characterization technique to determine oxidation states, coordination environment, and other local atomic structure information. Analysis of XAS relies on comparison of measured spectra to reliable reference spectra. However, existing databases of XAS spectra are highly limited both in terms of the number of reference spectra available as well as the breadth of chemistry coverage. In this work, we report the development of XASdb, a large database of computed reference XAS, and an Ensemble-Learned Spectra IdEntification (ELSIE) algorithm for the matching of spectra. XASdb currently hosts more than 800,000 K-edge X-ray absorption near-edge spectra (XANES) for over 40,000 materials from the open-science Materials Project database. We discuss a high-throughput automation framework for FEFF calculations, built on robust, rigorously benchmarked parameters. FEFF is a computer program uses a real-space Green's function approach to calculate X-ray absorption spectra. We will demonstrate that the ELSIE algorithm, which combines 33 weak "learners" comprising a set of preprocessing steps and a similarity metric, can achieve up to 84.2% accuracy in identifying the correct oxidation state and coordination environment of a test set of 19 K-edge XANES spectra encompassing a diverse range of chemistries and crystal structures. The XASdb with the ELSIE algorithm has been integrated into a web application in the Materials Project, providing an important new public resource for the analysis of XAS to all materials researchers. Finally, the ELSIE algorithm itself has been made available as part of veidt, an open source machine-learning library for materials science.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES)
Grant/Contract Number:
FG02-97ER45623; AC02-05CH11231
OSTI ID:
1490242
Alternate ID(s):
OSTI ID: 1559168
Journal Information:
npj Computational Materials, Vol. 4, Issue 1; ISSN 2057-3960
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 85 works
Citation information provided by
Web of Science

References (44)

Ab initio Bethe-Salpeter calculations of the x-ray absorption spectra of transition metals at the L -shell edges journal November 2012
ATHENA , ARTEMIS , HEPHAESTUS : data analysis for X-ray absorption spectroscopy using IFEFFIT journal June 2005
Generalized Gradient Approximation Made Simple journal October 1996
Decision combination in multiple classifier systems journal January 1994
Cu 2 ZnSnS 4 as a potential photovoltaic material: A hybrid Hartree-Fock density functional theory study journal March 2009
A Complete Overhaul of the Electron Energy-Loss Spectroscopy and X-Ray Absorption Spectroscopy Database: eelsdb.eu journal February 2016
Hybrid functionals based on a screened Coulomb potential journal May 2003
Thermodynamics, Kinetics and Structural Evolution of ε-LiVOPO 4 over Multiple Lithium Intercalation journal February 2016
Band-edge problem in the theoretical determination of defect energy levels: The O vacancy in ZnO as a benchmark case journal September 2011
A new matching algorithm for high resolution mass spectra journal August 2004
Using Similarity Metrics to Quantify Differences in High-Throughput Data Sets: Application to X-ray Diffraction Patterns journal December 2016
Introduction to XAFS book January 2010
Atomate: A high-level interface to generate, execute, and analyze computational materials science workflows journal November 2017
FireWorks: a dynamic workflow system designed for high-throughput applications: FireWorks: A Dynamic Workflow System Designed for High-Throughput Applications journal May 2015
High rate delithiation behaviour of LiFePO4 studied by quick X-ray absorption spectroscopy journal January 2012
X-ray absorption spectra of graphene and graphene oxide by full-potential multiple scattering calculations with self-consistent charge density journal September 2015
A practical introduction to multiple scattering theory journal September 2005
Commentary: The Materials Project: A materials genome approach to accelerating materials innovation journal July 2013
The Materials Application Programming Interface (API): A simple, flexible and efficient API for materials data based on REpresentational State Transfer (REST) principles journal February 2015
Fundamentals of XAFS journal January 2014
Structural and Electronic Properties of Micellar Au Nanoparticles: Size and Ligand Effects journal January 2014
Simultaneous Reduction of Co 3+ and Mn 4+ in P2-Na 2/3 Co 2/3 Mn 1/3 O 2 As Evidenced by X-ray Absorption Spectroscopy during Electrochemical Sodium Intercalation journal December 2013
Effects of cathode electrolyte interfacial (CEI) layer on long term cycling of all-solid-state thin-film batteries journal August 2016
Restoring the Density-Gradient Expansion for Exchange in Solids and Surfaces journal April 2008
Calculations of ZnO properties using the Heyd-Scuseria-Ernzerhof screened hybrid density functional journal October 2009
Parameter-free calculations of X-ray spectra with FEFF9 journal January 2010
Optimization and testing of mass spectral library search algorithms for compound identification journal September 1994
Local structural changes in LiMn1.5Ni0.5O4 spinel cathode material for lithium-ion batteries journal June 2014
Electron Energy-Loss Spectroscopy in the Electron Microscope book January 2011
Reevaluation of X-Ray Atomic Energy Levels journal January 1967
Machine learning tools formineral recognition and classification from Raman spectroscopy: Tools for mineral recognition and classification from RS journal August 2015
On the structural integrity and electrochemical activity of a 0.5Li2MnO3·0.5LiCoO2 cathode material for lithium-ion batteries journal January 2014
The NumPy Array: A Structure for Efficient Numerical Computation journal March 2011
Theoretical approaches to x-ray absorption fine structure journal July 2000
Calculations of electron energy loss and x-ray absorption spectra in periodic systems without a supercell journal June 2010
The thermodynamic scale of inorganic crystalline metastability journal November 2016
Python Materials Genomics (pymatgen): A robust, open-source python library for materials analysis journal February 2013
Experimental Observation of Redox-Induced Fe–N Switching Behavior as a Determinant Role for Oxygen Reduction Activity journal October 2015
Raman Imaging journal July 2012
CASSPER is a semantic segmentation-based particle picking algorithm for single-particle cryo-electron microscopy journal February 2021
Electron Energy-Loss Spectroscopy in the Electron Microscope book January 1995
Van der Waals density functionals applied to solids text January 2011
Band-edge problem in the theoretical determination of defect energy levels: the O vacancy in ZnO as a benchmark case text January 2012
Parameter-free calculations of X-ray spectra with FEFF9 journal January 2010

Cited By (11)

X‐Ray Absorption Spectroscopy Characterizations on PGM‐Free Electrocatalysts: Justification, Advantages, and Limitations journal December 2018
Deep Learning Spectroscopy: Neural Networks for Molecular Excitation Spectra journal January 2019
Data‐Driven Materials Science: Status, Challenges, and Perspectives journal September 2019
A Critical Review of Machine Learning of Energy Materials journal January 2020
Automated estimation of materials parameter from X-ray absorption and electron energy-loss spectra with similarity measures journal March 2019
Automatic oxidation threshold recognition of XAFS data using supervised machine learning journal January 2019
Machine learning and big scientific data
  • Hey, Tony; Butler, Keith; Jackson, Sam
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 378, Issue 2166 https://doi.org/10.1098/rsta.2019.0054
journal January 2020
Data‐Driven Materials Science: Status, Challenges, and Perspectives journal November 2019
Deep Learning Spectroscopy: Neural Networks for Molecular Excitation Spectra text January 2019
Data-driven materials science: status, challenges and perspectives text January 2019
Machine learning and big scientific data
  • Hey, Tony; Butler, Keith; Jackson, Sam
  • Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, Vol. 378, Issue 2166 https://doi.org/10.1098/rsta.2019.0054
journal January 2020

Figures / Tables (6)