skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Real-Time XFEL Data Analysis at SLAC and NERSC: a Trial Run of Nascent Exascale Experimental Data Analysis

Journal Article ·
OSTI ID:1827927
 [1];  [1];  [1];  [1];  [1];  [2];  [2];  [1];  [1]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  2. SLAC National Accelerator Lab., Menlo Park, CA (United States)

X-ray scattering experiments using Free Electron Lasers (XFELs) are a powerful tool to determine the molecular structure and function of unknown samples (such as COVID-19 viral proteins). XFEL experiments are a challenge to computing in two ways: i) due to the high cost of running XFELs, a fast turnaround time from data acquisition to data analysis is essential to make informed decisions on experimental protocols; ii) data collection rates are growing exponentially, requiring new scalable algorithms. Here we report our experiences analyzing data from two experiments at the Linac Coherent Light Source (LCLS) during September 2020. Raw data were analyzed on NERSC's Cori XC40 system, using the Superfacility paradigm: our workflow automatically moves raw data between LCLS and NERSC, where it is analyzed using the software package CCTBX. We achieved real time data analysis with a turnaround time from data acquisition to full molecular reconstruction in as little as 10 min -- sufficient time for the experiment's operators to make informed decisions. By hosting the data analysis on Cori, and by automating LCLS-NERSC interoperability, we achieved a data analysis rate which matches the data acquisition rate. Furthermore, completing data analysis with 10 mins is a first for XFEL experiments and an important milestone if we are to keep up with data collection trends.

Research Organization:
Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER); National Institutes of Health (NIH)
DOE Contract Number:
AC02-05CH11231; AC02-76SF00515; GM117126
OSTI ID:
1827927
Country of Publication:
United States
Language:
English

Similar Records

Real-time XFEL data analysis at SLAC and NERSC: A trial run of nascent exascale experimental data analysis
Journal Article · Tue Feb 13 00:00:00 EST 2024 · Concurrency and Computation. Practice and Experience · OSTI ID:1827927

Preparing NERSC users for Cori, a Cray XC40 system with Intel many integrated cores
Journal Article · Fri Aug 25 00:00:00 EDT 2017 · Concurrency and Computation. Practice and Experience · OSTI ID:1827927

STAR Data Production Workflow on HPC: Lessons Learned & Best Practices
Conference · Sun Mar 10 00:00:00 EST 2019 · OSTI ID:1827927

Related Subjects