Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Classifying and analyzing small-angle scattering data using weighted k nearest neighbors machine learning techniques

Journal Article · · Journal of Applied Crystallography (Online)
A consistent challenge for both new and expert practitioners of small-angle scattering (SAS) lies in determining how to analyze the data, given the limited information content of said data and the large number of models that can be employed. Machine learning (ML) methods are powerful tools for classifying data that have found diverse applications in many fields of science. Here, ML methods are applied to the problem of classifying SAS data for the most appropriate model to use for data analysis. The approach employed is built around the method of weighted k nearest neighbors (wKNN), and utilizes a subset of the models implemented in the SasView package (https://www.sasview.org/) for generating a well defined set of training and testing data. The prediction rate of the wKNN method implemented here using a subset of SasView models is reasonably good for many of the models, but has difficulty with others, notably those based on spherical structures. A novel expansion of the wKNN method was also developed, which uses Gaussian processes to produce local surrogate models for the classification, and this significantly improves the classification accuracy. Further, by integrating a stochastic gradient descent method during post-processing, it is possible to leverage the local surrogate model both to classify the SAS data with high accuracy and to predict the structural parameters that best describe the data. The linking of data classification and model fitting has the potential to facilitate the translation of measured data into results for both novice and expert practitioners of SAS.
Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1649508
Journal Information:
Journal of Applied Crystallography (Online), Journal Name: Journal of Applied Crystallography (Online) Journal Issue: 2 Vol. 53; ISSN 1600-5767
Publisher:
International Union of CrystallographyCopyright Statement
Country of Publication:
United States
Language:
English

References (13)

The Design and Analysis of Computer Experiments book January 2003
Minimax and maximin distance designs journal October 1990
Machine learning for molecular and materials science journal July 2018
Sequential Exploration of Complex Surfaces Using Minimum Energy Designs journal January 2015
Theory of reproducing kernels journal March 1950
Reduction and analysis of SANS and USANS data using IGOR Pro journal November 2006
SASfit : a tool for small-angle scattering data analysis using a library of analytical expressions journal September 2015
The suite of small-angle neutron scattering instruments at Oak Ridge National Laboratory journal February 2018
Bayesian calibration of computer models journal August 2001
Combining Field Data and Computer Simulations for Calibration and Prediction journal January 2004
Bayesian Treed Gaussian Process Models With an Application to Computer Modeling journal September 2008
Generalized Latin Hypercube Design for Computer Experiments journal November 2010
Accurate emulators for large-scale computer experiments journal December 2011

Similar Records

Adaptation of the fuzzy k-nearest neighbor classifier for manufacturing automation
Conference · Wed Dec 31 23:00:00 EST 1997 · OSTI ID:634137

ASK: Adversarial Soft k-Nearest Neighbor Attack and Defense
Journal Article · Sun Sep 25 20:00:00 EDT 2022 · IEEE Access · OSTI ID:1922471