DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Selecting XFEL single-particle snapshots by geometric machine learning

Abstract

A promising new route for structural biology is single-particle imaging with an X-ray Free-Electron Laser (XFEL). This method has the advantage that the samples do not require crystallization and can be examined at room temperature. However, high-resolution structures can only be obtained from a sufficiently large number of diffraction patterns of individual molecules, so-called single particles. Here, we present a method that allows for efficient identification of single particles in very large XFEL datasets, operates at low signal levels, and is tolerant to background. This method uses supervised Geometric Machine Learning (GML) to extract low-dimensional feature vectors from a training dataset, fuse test datasets into the feature space of training datasets, and separate the data into binary distributions of “single particles” and “non-single particles.” As a proof of principle, we tested simulated and experimental datasets of the Coliphage PR772 virus. We created a training dataset and classified three types of test datasets: First, a noise-free simulated test dataset, which gave near perfect separation. Second, simulated test datasets that were modified to reflect different levels of photon counts and background noise. These modified datasets were used to quantify the predictive limits of our approach. Third, an experimental dataset collected at themore » Stanford Linear Accelerator Center. The single-particle identification for this experimental dataset was compared with previously published results and it was found that GML covers a wide photon-count range, outperforming other single-particle identification methods. Moreover, a major advantage of GML is its ability to retrieve single particles in the presence of structural variability.« less

Authors:
ORCiD logo [1];  [1];  [1];  [1];  [1]; ORCiD logo [1]
  1. Univ. of Wisconsin, Milwaukee, WI (United States). Dept. of Physics
Publication Date:
Research Org.:
Univ. of Wisconsin, Milwaukee, WI (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Basic Energy Sciences (BES); National Science Foundation (NSF)
OSTI Identifier:
1817031
Alternate Identifier(s):
OSTI ID: 1766112
Grant/Contract Number:  
SC0002164; STC 1231306; DBI-2029533
Resource Type:
Accepted Manuscript
Journal Name:
Structural Dynamics
Additional Journal Information:
Journal Volume: 8; Journal Issue: 1; Journal ID: ISSN 2329-7778
Publisher:
American Crystallographic Association/AIP
Country of Publication:
United States
Language:
English
Subject:
71 CLASSICAL AND QUANTUM MECHANICS, GENERAL PHYSICS

Citation Formats

Cruz-Chú, Eduardo R., Hosseinizadeh, Ahmad, Mashayekhi, Ghoncheh, Fung, Russell, Ourmazd, Abbas, and Schwander, Peter. Selecting XFEL single-particle snapshots by geometric machine learning. United States: N. p., 2021. Web. doi:10.1063/4.0000060.
Cruz-Chú, Eduardo R., Hosseinizadeh, Ahmad, Mashayekhi, Ghoncheh, Fung, Russell, Ourmazd, Abbas, & Schwander, Peter. Selecting XFEL single-particle snapshots by geometric machine learning. United States. https://doi.org/10.1063/4.0000060
Cruz-Chú, Eduardo R., Hosseinizadeh, Ahmad, Mashayekhi, Ghoncheh, Fung, Russell, Ourmazd, Abbas, and Schwander, Peter. Thu . "Selecting XFEL single-particle snapshots by geometric machine learning". United States. https://doi.org/10.1063/4.0000060. https://www.osti.gov/servlets/purl/1817031.
@article{osti_1817031,
title = {Selecting XFEL single-particle snapshots by geometric machine learning},
author = {Cruz-Chú, Eduardo R. and Hosseinizadeh, Ahmad and Mashayekhi, Ghoncheh and Fung, Russell and Ourmazd, Abbas and Schwander, Peter},
abstractNote = {A promising new route for structural biology is single-particle imaging with an X-ray Free-Electron Laser (XFEL). This method has the advantage that the samples do not require crystallization and can be examined at room temperature. However, high-resolution structures can only be obtained from a sufficiently large number of diffraction patterns of individual molecules, so-called single particles. Here, we present a method that allows for efficient identification of single particles in very large XFEL datasets, operates at low signal levels, and is tolerant to background. This method uses supervised Geometric Machine Learning (GML) to extract low-dimensional feature vectors from a training dataset, fuse test datasets into the feature space of training datasets, and separate the data into binary distributions of “single particles” and “non-single particles.” As a proof of principle, we tested simulated and experimental datasets of the Coliphage PR772 virus. We created a training dataset and classified three types of test datasets: First, a noise-free simulated test dataset, which gave near perfect separation. Second, simulated test datasets that were modified to reflect different levels of photon counts and background noise. These modified datasets were used to quantify the predictive limits of our approach. Third, an experimental dataset collected at the Stanford Linear Accelerator Center. The single-particle identification for this experimental dataset was compared with previously published results and it was found that GML covers a wide photon-count range, outperforming other single-particle identification methods. Moreover, a major advantage of GML is its ability to retrieve single particles in the presence of structural variability.},
doi = {10.1063/4.0000060},
journal = {Structural Dynamics},
number = 1,
volume = 8,
place = {United States},
year = {Thu Feb 18 00:00:00 EST 2021},
month = {Thu Feb 18 00:00:00 EST 2021}
}

Works referenced in this record:

Mapping the conformations of biological assemblies
journal, March 2010


Maximum-likelihood Multi-reference Refinement for Electron Microscopy Images
journal, April 2005

  • Scheres, Sjors H. W.; Valle, Mikel; Nuñez, Rafael
  • Journal of Molecular Biology, Vol. 348, Issue 1
  • DOI: 10.1016/j.jmb.2005.02.031

Hummingbird : monitoring and analyzing flash X-ray imaging experiments in real time
journal, April 2016

  • Daurer, Benedikt J.; Hantke, Max F.; Nettelblad, Carl
  • Journal of Applied Crystallography, Vol. 49, Issue 3
  • DOI: 10.1107/S1600576716005926

Single Particle X-ray Diffractive Imaging
journal, January 2008

  • Bogan, Michael J.; Benner, W. Henry; Boutet, Sébastien
  • Nano Letters, Vol. 8, Issue 1, p. 310-316
  • DOI: 10.1021/nl072728k

Deep neural networks for classifying complex features in diffraction images
journal, June 2019


Diffusion maps
journal, July 2006

  • Coifman, Ronald R.; Lafon, Stéphane
  • Applied and Computational Harmonic Analysis, Vol. 21, Issue 1
  • DOI: 10.1016/j.acha.2006.04.006

Unsupervised classification of single-particle X-ray diffraction snapshots by spectral clustering
journal, January 2011

  • Yoon, Chun Hong; Schwander, Peter; Abergel, Chantal
  • Optics Express, Vol. 19, Issue 17
  • DOI: 10.1364/OE.19.016542

Systematic determination of order parameters for chain dynamics using diffusion maps
journal, July 2010

  • Ferguson, A. L.; Panagiotopoulos, A. Z.; Debenedetti, P. G.
  • Proceedings of the National Academy of Sciences, Vol. 107, Issue 31
  • DOI: 10.1073/pnas.1003293107

Dragonfly : an implementation of the expand–maximize–compress algorithm for single-particle imaging
journal, June 2016

  • Ayyer, Kartik; Lan, Ti-Yen; Elser, Veit
  • Journal of Applied Crystallography, Vol. 49, Issue 4
  • DOI: 10.1107/S1600576716008165

Conformations of macromolecules and their complexes from heterogeneous datasets
journal, July 2014

  • Schwander, P.; Fung, R.; Ourmazd, A.
  • Philosophical Transactions of the Royal Society B: Biological Sciences, Vol. 369, Issue 1647
  • DOI: 10.1098/rstb.2013.0567

The Coherent X-ray Imaging Data Bank
journal, August 2012


High-resolution structure of viruses from random diffraction snapshots
journal, July 2014

  • Hosseinizadeh, A.; Schwander, P.; Dashti, A.
  • Philosophical Transactions of the Royal Society B: Biological Sciences, Vol. 369, Issue 1647
  • DOI: 10.1098/rstb.2013.0326

Time-resolved serial femtosecond crystallography at the European XFEL
journal, November 2019


The symmetries of image formation by scattering I Theoretical framework
journal, January 2012

  • Giannakis, Dimitrios; Schwander, Peter; Ourmazd, Abbas
  • Optics Express, Vol. 20, Issue 12
  • DOI: 10.1364/OE.20.012799

Megahertz single-particle imaging at the European XFEL
journal, May 2020

  • Sobolev, Egor; Zolotarev, Sergei; Giewekemeyer, Klaus
  • Communications Physics, Vol. 3, Issue 1
  • DOI: 10.1038/s42005-020-0362-y

Linac Coherent Light Source data analysis using psana
journal, March 2016

  • Damiani, D.; Dubrovin, M.; Gaponenko, I.
  • Journal of Applied Crystallography, Vol. 49, Issue 2
  • DOI: 10.1107/S1600576716004349

Phase retrieval algorithms: a comparison
journal, January 1982


Perspectives on single particle imaging with x rays at the advent of high repetition rate x-ray free electron laser sources
journal, July 2020

  • Bielecki, Johan; Maia, Filipe R. N. C.; Mancuso, Adrian P.
  • Structural Dynamics, Vol. 7, Issue 4
  • DOI: 10.1063/4.0000024

An introduction to ROC analysis
journal, June 2006


Considerations for three-dimensional image reconstruction from experimental data in coherent diffractive imaging
journal, September 2018


Maximum Likelihood from Incomplete Data Via the EM Algorithm
journal, September 1977

  • Dempster, A. P.; Laird, N. M.; Rubin, D. B.
  • Journal of the Royal Statistical Society: Series B (Methodological), Vol. 39, Issue 1
  • DOI: 10.1111/j.2517-6161.1977.tb01600.x

Very high brightness and power LCLS-II hard X-ray pulses
journal, April 2019

  • Halavanau, Aliaksei; Decker, Franz-Josef; Emma, Claudio
  • Journal of Synchrotron Radiation, Vol. 26, Issue 3
  • DOI: 10.1107/S1600577519002492

Single-particle imaging without symmetry constraints at an X-ray free-electron laser
journal, September 2018


Membrane protein megahertz crystallography at the European XFEL
journal, November 2019


3D printed nozzles on a silicon fluidic chip
journal, March 2019

  • Bohne, Sven; Heymann, Michael; Chapman, Henry N.
  • Review of Scientific Instruments, Vol. 90, Issue 3
  • DOI: 10.1063/1.5080428

Coherent diffraction of single Rice Dwarf virus particles using hard X-rays at the Linac Coherent Light Source
journal, August 2016

  • Munke, Anna; Andreasson, Jakob; Aquila, Andrew
  • Scientific Data, Vol. 3, Issue 1
  • DOI: 10.1038/sdata.2016.64

Conformational landscape of a virus by single-particle X-ray scattering
journal, August 2017

  • Hosseinizadeh, Ahmad; Mashayekhi, Ghoncheh; Copperman, Jeremy
  • Nature Methods, Vol. 14, Issue 9
  • DOI: 10.1038/nmeth.4395

Perspectives for imaging single protein molecules with the present design of the European XFEL
journal, April 2015

  • Ayyer, Kartik; Geloni, Gianluca; Kocharyan, Vitali
  • Structural Dynamics, Vol. 2, Issue 4
  • DOI: 10.1063/1.4919301

Reconstruction algorithm for single-particle diffraction imaging experiments
journal, August 2009


Single-particle imaging by x-ray free-electron lasers—How many snapshots are needed?
journal, March 2020

  • Poudyal, I.; Schmidt, M.; Schwander, P.
  • Structural Dynamics, Vol. 7, Issue 2
  • DOI: 10.1063/1.5144516

Single-particle structure determination by X-ray free-electron lasers: Possibilities and challenges
journal, July 2015

  • Hosseinizadeh, A.; Dashti, A.; Schwander, P.
  • Structural Dynamics, Vol. 2, Issue 4
  • DOI: 10.1063/1.4919740

Cryptotomography: Reconstructing 3D Fourier Intensities from Randomly Oriented Single-Shot Diffraction Patterns
journal, June 2010


The symmetries of image formation by scattering II Applications
journal, January 2012

  • Schwander, Peter; Giannakis, Dimitrios; Yoon, Chun Hong
  • Optics Express, Vol. 20, Issue 12
  • DOI: 10.1364/OE.20.012827

The linac coherent light source single particle imaging road map
journal, July 2015

  • Aquila, A.; Barty, A.; Bostedt, C.
  • Structural Dynamics, Vol. 2, Issue 4
  • DOI: 10.1063/1.4918726

The Adaptive Gain Integrating Pixel Detector at the European XFEL
journal, January 2019

  • Allahgholi, Aschkan; Becker, Julian; Delfs, Annette
  • Journal of Synchrotron Radiation, Vol. 26, Issue 1
  • DOI: 10.1107/S1600577518016077

Coherent soft X-ray diffraction imaging of coliphage PR772 at the Linac coherent light source
journal, June 2017

  • Reddy, Hemanth K. N.; Yoon, Chun Hong; Aquila, Andrew
  • Scientific Data, Vol. 4, Issue 1
  • DOI: 10.1038/sdata.2017.79

X-ray lasers for structural and dynamic biology
journal, September 2012


Data Fusion and Multicue Data Matching by Diffusion Maps
journal, November 2006

  • Lafon, S.; Keller, Y.; Coifman, R. R.
  • IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 28, Issue 11
  • DOI: 10.1109/TPAMI.2006.223

Sorting algorithms for single-particle imaging experiments at X-ray free-electron lasers
journal, October 2015

  • Bobkov, S. A.; Teslyuk, A. B.; Kurta, R. P.
  • Journal of Synchrotron Radiation, Vol. 22, Issue 6
  • DOI: 10.1107/S1600577515017348

Gas dynamic virtual nozzle for generation of microscopic droplet streams
journal, September 2008

  • DePonte, D. P.; Weierstall, U.; Schmidt, K.
  • Journal of Physics D: Applied Physics, Vol. 41, Issue 19, Article No. 195505
  • DOI: 10.1088/0022-3727/41/19/195505

The next big hit in molecule Hollywood
journal, April 2017


X-ray image reconstruction from a diffraction pattern alone
journal, October 2003


Large-format, high-speed, X-ray pnCCDs combined with electron and ion imaging spectrometers in a multipurpose chamber for experiments at 4th generation light sources
journal, March 2010

  • Strüder, Lothar; Epp, Sascha; Rolles, Daniel
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 614, Issue 3
  • DOI: 10.1016/j.nima.2009.12.053