Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Gold Standard for macromolecular crystallography diffraction data

Journal Article · · IUCrJ

Macromolecular crystallography (MX) is the dominant means of determining the three-dimensional structures of biological macromolecules. Over the last few decades, most MX data have been collected at synchrotron beamlines using a large number of different detectors produced by various manufacturers and taking advantage of various protocols and goniometries. These data came in their own formats: sometimes proprietary, sometimes open. The associated metadata rarely reached the degree of completeness required for data management according to Findability, Accessibility, Interoperability and Reusability (FAIR) principles. Efforts to reuse old data by other investigators or even by the original investigators some time later were often frustrated. In the culmination of an effort dating back more than two decades, a large portion of the research community concerned with high data-rate macromolecular crystallography (HDRMX) has now agreed to an updated specification of data and metadata for diffraction images produced at synchrotron light sources and X-ray free-electron lasers (XFELs). This `Gold Standard' will facilitate the processing of data sets independent of the facility at which they were collected and enable data archiving according to FAIR principles, with a particular focus on interoperability and reusability. This agreed standard builds on the NeXus/HDF5 NXmx application definition and the International Union of Crystallography (IUCr) imgCIF/CBF dictionary, and it is compatible with major data-processing programs and pipelines. Just as with the IUCr CBF/imgCIF standard from which it arose and to which it is tied, the NeXus/HDF5 NXmx Gold Standard application definition is intended to be applicable to all detectors used for crystallography, and all hardware and software developers in the field are encouraged to adopt and contribute to the standard.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
Hungarian Government; National Institutes of Health (NIH); USDOE Office of Science (SC), Basic Energy Sciences (BES) (SC-22); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231; AC02-98CH10886; SC0012704
OSTI ID:
1638110
Alternate ID(s):
OSTI ID: 1676386
OSTI ID: 1690168
Journal Information:
IUCrJ, Journal Name: IUCrJ Journal Issue: 5 Vol. 7; ISSN IUCRAJ; ISSN 2052-2525
Publisher:
International Union of CrystallographyCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (30)

The protein data bank: A computer-based archival file for macromolecular structures journal May 1977
[20] Processing of X-ray diffraction data collected in oscillation mode book January 1997
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
FACT and FAIR with Big Data allows objectivity in science: The view of crystallography journal September 2019
Best practices for high data-rate macromolecular crystallography (HDRMX) journal January 2020
JUNGFRAU detector for brighter x-ray sources: Solutions for IT and data science challenges in macromolecular crystallography journal January 2020
Predicting the X-ray lifetime of protein crystals journal December 2013
From then till now: changing data collection methods in single crystal X-ray crystallography since 1912 journal June 2019
Meeting Report: Workshop on Beamline Integration and Data Formatting journal September 2013
McStas, a general software package for neutron ray-tracing simulations journal January 1999
Operation and performance of the JUNGFRAU photon detector during first FEL and synchrotron experiments journal November 2018
xia2 : an expert system for macromolecular crystallography data reduction journal December 2009
What every experimentalist needs to know about recording essential metadata of primary ( i.e. raw) diffraction data journal December 2017
The crystallographic information file (CIF): a new standard archive file for crystallography journal November 1991
XDS journal January 2010
Integration, scaling, space-group assignment and post-refinement journal January 2010
iMOSFLM : a new graphical interface for diffraction-image processing with MOSFLM journal March 2011
Data processing and analysis with the autoPROC toolbox journal March 2011
Deposition of structure factors at the Protein Data Bank journal January 1999
Experiences with making diffraction image data available: what metadata do we need to archive? journal September 2014
The NeXus data format journal January 2015
OnDA : online data analysis and feedback for serial X-ray imaging journal May 2016
Experimental station Bernina at SwissFEL: condensed matter physics on femtosecond time scales investigated by X-ray diffraction and spectroscopic methods journal April 2019
The Karabo distributed control system journal August 2019
Raw diffraction data preservation and reuse: overview, update on practicalities and metadata requirements journal January 2017
DIALS : implementation and evaluation of a new integration package journal February 2018
Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB) journal April 2019
Characterization and Calibration of PILATUS Detectors journal June 2009
A new golden age for computer architecture journal January 2019
A Robust, Format-Agnostic Scientific Data Transfer Framework journal January 2016