DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Extension of the sasCIF format and its applications for data processing and deposition

Abstract

Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To organize the data and models and make them accessible to the community, the Task Forces on SAS and hybrid methods for the International Union of Crystallography and the Worldwide Protein Data Bank envisage developing a federated approach to SAS data and model archiving. Within the framework of this approach, the existing databases may exchange information and provide independent but synchronized entries to users. At present, ways of exchanging information between the various SAS databases are not established, leading to possible duplication and incompatibility of entries, and limiting the opportunities for data-driven research for SAS users. In this work, a solution is developed to resolve these issues and provide a universal exchange format for the community, based on the use of the widely adopted crystallographic information framework (CIF). The previous version of the sasCIF format, implemented as an extension of the core CIF dictionary, has been available since 2000 to facilitate SAS data exchange between laboratories. The sasCIFmore » format has now been extended to describe comprehensively the necessary experimental information, results and models, including relevant metadata for SAS data analysis and for deposition into a database. Processing tools for these files (sasCIFtools) have been developed, and these are available both as standalone open-source programs and integrated into the SAS Biological Data Bank, allowing the export and import of data entries as sasCIF files. Software modules to save the relevant information directly from beamline data-processing pipelines in sasCIF format are also developed. Lastly, this update of sasCIF and the relevant tools are an important step in the standardization of the way SAS data are presented and exchanged, to make the results easily accessible to users and to promote further the application of SAS in the structural biology community.« less

Authors:
 [1];  [2];  [1]
  1. Hamburg Outstation (Germany). European Molecular Biology Lab.
  2. Rutgers Univ., Piscataway, NJ (United States). Dept. of Chemistry and Chemical Biology and Center for Integrative Proteomics Research
Publication Date:
Research Org.:
Rutgers Univ., Piscataway, NJ (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1345572
Grant/Contract Number:  
SC0008434
Resource Type:
Accepted Manuscript
Journal Name:
Journal of Applied Crystallography (Online)
Additional Journal Information:
Journal Name: Journal of Applied Crystallography (Online); Journal Volume: 49; Journal Issue: 1; Journal ID: ISSN 1600-5767
Publisher:
International Union of Crystallography
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; 96 KNOWLEDGE MANAGEMENT AND PRESERVATION; sasCIF; CIF; small-angle scattering; computer programs

Citation Formats

Kachala, Michael, Westbrook, John, and Svergun, Dmitri. Extension of the sasCIF format and its applications for data processing and deposition. United States: N. p., 2016. Web. doi:10.1107/s1600576715024942.
Kachala, Michael, Westbrook, John, & Svergun, Dmitri. Extension of the sasCIF format and its applications for data processing and deposition. United States. https://doi.org/10.1107/s1600576715024942
Kachala, Michael, Westbrook, John, and Svergun, Dmitri. Mon . "Extension of the sasCIF format and its applications for data processing and deposition". United States. https://doi.org/10.1107/s1600576715024942. https://www.osti.gov/servlets/purl/1345572.
@article{osti_1345572,
title = {Extension of the sasCIF format and its applications for data processing and deposition},
author = {Kachala, Michael and Westbrook, John and Svergun, Dmitri},
abstractNote = {Recent advances in small-angle scattering (SAS) experimental facilities and data analysis methods have prompted a dramatic increase in the number of users and of projects conducted, causing an upsurge in the number of objects studied, experimental data available and structural models generated. To organize the data and models and make them accessible to the community, the Task Forces on SAS and hybrid methods for the International Union of Crystallography and the Worldwide Protein Data Bank envisage developing a federated approach to SAS data and model archiving. Within the framework of this approach, the existing databases may exchange information and provide independent but synchronized entries to users. At present, ways of exchanging information between the various SAS databases are not established, leading to possible duplication and incompatibility of entries, and limiting the opportunities for data-driven research for SAS users. In this work, a solution is developed to resolve these issues and provide a universal exchange format for the community, based on the use of the widely adopted crystallographic information framework (CIF). The previous version of the sasCIF format, implemented as an extension of the core CIF dictionary, has been available since 2000 to facilitate SAS data exchange between laboratories. The sasCIF format has now been extended to describe comprehensively the necessary experimental information, results and models, including relevant metadata for SAS data analysis and for deposition into a database. Processing tools for these files (sasCIFtools) have been developed, and these are available both as standalone open-source programs and integrated into the SAS Biological Data Bank, allowing the export and import of data entries as sasCIF files. Software modules to save the relevant information directly from beamline data-processing pipelines in sasCIF format are also developed. Lastly, this update of sasCIF and the relevant tools are an important step in the standardization of the way SAS data are presented and exchanged, to make the results easily accessible to users and to promote further the application of SAS in the structural biology community.},
doi = {10.1107/s1600576715024942},
journal = {Journal of Applied Crystallography (Online)},
number = 1,
volume = 49,
place = {United States},
year = {Mon Feb 01 00:00:00 EST 2016},
month = {Mon Feb 01 00:00:00 EST 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 15 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Correlation Map, a goodness-of-fit test for one-dimensional X-ray scattering spectra
journal, April 2015

  • Franke, Daniel; Jeffries, Cy M.; Svergun, Dmitri I.
  • Nature Methods, Vol. 12, Issue 5
  • DOI: 10.1038/nmeth.3358

Automated acquisition and analysis of small angle X-ray scattering data
journal, October 2012

  • Franke, Daniel; Kikhney, Alexey G.; Svergun, Dmitri I.
  • Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, Vol. 689
  • DOI: 10.1016/j.nima.2012.06.008

A new method for the evaluation of small-angle scattering data
journal, October 1977


Impact and progress in small and wide angle X-ray scattering (SAXS and WAXS)
journal, October 2013

  • Graewert, Melissa A.; Svergun, Dmitri I.
  • Current Opinion in Structural Biology, Vol. 23, Issue 5
  • DOI: 10.1016/j.sbi.2013.06.007

The STAR file: a new format for electronic data transfer and archiving
journal, May 1991

  • Hall, Sydney R.
  • Journal of Chemical Information and Modeling, Vol. 31, Issue 2
  • DOI: 10.1021/ci00002a020

The STAR File: detailed specifications
journal, May 1994

  • Hall, Sydney R.; Spadaccini, Nick
  • Journal of Chemical Information and Modeling, Vol. 34, Issue 3
  • DOI: 10.1021/ci00019a005

Jmol – a paradigm shift in crystallographic visualization
journal, September 2010


Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS)
journal, July 2009

  • Hura, Greg L.; Menon, Angeli L.; Hammel, Michal
  • Nature Methods, Vol. 6, Issue 8
  • DOI: 10.1038/nmeth.1353

sasCIF: an extension of core Crystallographic Information File for SAS
journal, June 2000


Accuracy of molecular mass determination of proteins in solution by small-angle X-ray scattering
journal, February 2007

  • Mylonas, Efstratios; Svergun, Dmitri I.
  • Journal of Applied Crystallography, Vol. 40, Issue s1
  • DOI: 10.1107/S002188980700252X

New developments in the ATSAS program package for small-angle scattering data analysis
journal, March 2012

  • Petoukhov, Maxim V.; Franke, Daniel; Shkumatov, Alexander V.
  • Journal of Applied Crystallography, Vol. 45, Issue 2
  • DOI: 10.1107/S0021889812007662

Global Rigid Body Modeling of Macromolecular Complexes against Small-Angle Scattering Data
journal, August 2005


RASMOL: biomolecular graphics for all
journal, September 1995


Determination of the regularization parameter in indirect-transform methods using perceptual criteria
journal, August 1992


CRYSOL – a Program to Evaluate X-ray Solution Scattering of Biological Macromolecules from Atomic Coordinates
journal, December 1995

  • Svergun, D.; Barberato, C.; Koch, M. H. J.
  • Journal of Applied Crystallography, Vol. 28, Issue 6
  • DOI: 10.1107/S0021889895007047

Report of the wwPDB Small-Angle Scattering Task Force: Data Requirements for Biomolecular Modeling and the PDB
journal, June 2013


SASBDB, a repository for biological small-angle scattering data
journal, October 2014

  • Valentini, Erica; Kikhney, Alexey G.; Previtali, Gianpietro
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1047

Works referencing / citing this record:

2017 publication guidelines for structural modelling of small-angle scattering data from biomolecules in solution: an update
journal, August 2017

  • Trewhella, Jill; Duff, Anthony P.; Durand, Dominique
  • Acta Crystallographica Section D Structural Biology, Vol. 73, Issue 9
  • DOI: 10.1107/s2059798317011597

Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB)
journal, April 2019

  • Adams, Paul D.; Afonine, Pavel V.; Baskaran, Kumaran
  • Acta Crystallographica Section D Structural Biology, Vol. 75, Issue 4
  • DOI: 10.1107/s2059798319004522

Archiving and disseminating integrative structure models
journal, July 2019