skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: RCSB Protein Data Bank: Architectural Advances Towards Integrated Searching and Efficient Access to Macromolecular Structure Data from the PDB Archive

Journal Article · · Journal of Molecular Biology

The US Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) serves many millions of unique users worldwide by delivering experimentally-determined 3D structures of biomolecules integrated with >40 external data resources via RCSB.org, application programming interfaces (APIs), and FTP downloads. Herein, we present the architectural redesign of RCSB PDB data delivery services that build on existing PDBx/mmCIF data schemas. New data access APIs (data.rcsb.org) enable efficient delivery of all PDB archive data. A novel GraphQL-based API provides flexible, declarative data retrieval along with a simple-to-use REST API. A powerful new search system (search.rcsb.org) seamlessly integrates heterogeneous types of searches across the PDB archive. Searches may combine text attributes, protein or nucleic acid sequences, small-molecule chemical descriptors, 3D macromolecular shapes, and sequence motifs. The new RCSB.org architecture adheres to the FAIR Principles, empowering users to address a wide array of research problems in fundamental biology, biomedicine, biotechnology, bioengineering, and bioenergy.

Research Organization:
Rutgers Univ., Piscataway, NJ (United States)
Sponsoring Organization:
USDOE Office of Science (SC); National Science Foundation (NSF); National Cancer Institute (NCI); National Institute of Allergy and Infectious Diseases (NIAID); National Institute of General Medical Science (NIGMS)
Grant/Contract Number:
SC0019749; DBI-1832184; R01GM133198
OSTI ID:
1769487
Alternate ID(s):
OSTI ID: 1853378
Journal Information:
Journal of Molecular Biology, Journal Name: Journal of Molecular Biology Vol. 433 Journal Issue: 11; ISSN 0022-2836
Publisher:
ElsevierCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (30)

PDBe: Protein Data Bank in Europe journal October 2009
STAR/mmCIF: An ontology for macromolecular structure journal February 2000
3C-like protease inhibitors block coronavirus replication in vitro and improve survival in MERS-CoV–infected mice journal August 2020
Improving the representation of peptide-like inhibitor and antibiotic molecules in the Protein Data Bank: Representation of Peptide-Like Inhibitor and Antibiotic Molecules journal March 2014
Impact of the Protein Data Bank Across Scientific Disciplines journal January 2020
Announcing the worldwide Protein Data Bank journal December 2003
Updates to the Symbol Nomenclature for Glycans guidelines journal June 2019
RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy journal October 2018
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets journal October 2017
Announcing mandatory submission of PDBx/mmCIF format files for crystallographic depositions to the Protein Data Bank (PDB) journal April 2019
RCSB Protein Data Bank: Sustaining a living digital data resource that enables breakthroughs in scientific research and biomedical education: RCSB Protein Data Bank journal November 2017
Clustering huge protein sequence sets in linear time journal June 2018
InChI - the worldwide chemical structure identifier standard journal January 2013
Crystallography: Protein Data Bank journal October 1971
RCSB Protein Data Bank: Enabling biomedical research and drug discovery journal November 2019
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
BioMagResBank journal December 2007
Validation of Structures in the Protein Data Bank journal December 2017
OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive journal March 2017
BinaryCIF and CIFTools—Lightweight, efficient and extensible macromolecular data management journal October 2020
The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank journal December 2014
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules journal February 1988
Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures journal October 2016
Real time structural search of the Protein Data Bank journal July 2020
The Protein Data Bank journal January 2000
Protein Data Bank: the single global archive for 3D macromolecular structure data journal October 2018
Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data journal January 2018
New and continuing developments at PROSITE journal November 2012
Structure of Mpro from SARS-CoV-2 and discovery of its inhibitors journal April 2020
The future of the protein data bank journal September 2012