DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Analysis of impact metrics for the Protein Data Bank

Journal Article · · Scientific Data
 [1];  [2];  [2];  [2];  [3];  [2]
  1. Rutgers Univ., Piscataway, NJ (United States); DOE/OSTI
  2. Rutgers Univ., Piscataway, NJ (United States)
  3. Rutgers Univ., Piscataway, NJ (United States); Univ. of California San Diego, La Jolla, CA (United States)

Since 1971, the Protein Data Bank (PDB) archive has served as the single, global repository for open access to atomic-level data for biological macromolecules. The archive currently holds >140,000 structures (>1 billion atoms). These structures are the molecules of life found in all organisms. Knowing the 3D structure of a biological macromolecule is essential for understanding the molecule’s function, providing insights in health and disease, food and energy production, and other topics of concern to prosperity and sustainability. PDB data are freely and publicly available, without restrictions on usage. Through bibliometric and usage studies, we sought to determine the impact of the PDB across disciplines and demographics. Our analysis shows that even though research areas such as molecular biology and biochemistry account for the most usage, other fields are increasingly using PDB resources. PDB usage is seen across 150 disciplines in applied sciences, humanities, and social sciences. Data are also re-used and integrated with >400 resources. Our study identifies trends in PDB usage and documents its utility across research disciplines.

Research Organization:
Rutgers Univ., Piscataway, NJ (United States)
Sponsoring Organization:
National Institutes of Health (NIH); National Science Foundation (NSF); USDOE
OSTI ID:
1624555
Journal Information:
Scientific Data, Journal Name: Scientific Data Journal Issue: 1 Vol. 5; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (149)

Long noncoding RNA LINC00844-mediated molecular network regulates expression of drug metabolizing enzymes and nuclear receptors in human liver cells journal March 2020
Adherence to literature search reporting guidelines in leading rheumatology journals’ systematic reviews: umbrella review protocol journal August 2022
Evidence for an association of interferon gene variants with sudden infant death syndrome journal January 2019
The top 100 papers journal October 2014
Metabolite identification via the Madison Metabolomics Consortium Database journal February 2008
Announcing the worldwide Protein Data Bank journal December 2003
A novel circular RNA, circIgfbp2, links neural plasticity and anxiety through targeting mitochondrial dysfunction and oxidative stress-induced synapse dysfunction after traumatic brain injury journal August 2022
Utilizing MALDI-TOF MS and LC-MS/MS to access serum peptidome-based biomarkers in canine oral tumors journal December 2022
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Mutation analysis of the entire mitochondrial genome using denaturing high performance liquid chromatography journal October 2000
PIRSF: family classification system at the Protein Information Resource journal January 2004
VIPERdb: a relational database for structural virology journal January 2006
BioMagResBank journal December 2007
REPAIRtoire--a database of DNA repair pathways journal November 2010
Saccharomyces Genome Database: the genomics resource of budding yeast journal November 2011
MODOMICS: a database of RNA modification pathways—2013 update journal October 2012
EcoCyc: fusing model organism databases with systems biology journal November 2012
HMDB 3.0—The Human Metabolome Database in 2013 journal November 2012
The ChEMBL bioactivity database: an update journal November 2013
SAbDab: the structural antibody database journal November 2013
TheCandidaGenome Database: The new homology information page highlights protein similarity and phylogeny journal October 2013
MultitaskProtDB: a database of multitasking proteins journal November 2013
SCOP2 prototype: a new approach to protein structure mining journal November 2013
PDBsum additions journal October 2013
STRING v10: protein–protein interaction networks, integrated over the tree of life journal October 2014
The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease journal October 2014
IMGT®, the international ImMunoGeneTics information system® 25 years on journal November 2014
ArrayExpress update—simplifying data submissions journal October 2014
Rfam 12.0: updates to the RNA families database journal November 2014
Genenames.org: the HGNC resources in 2015 journal October 2014
COSMIC: exploring the world's knowledge of somatic mutations in human cancer journal October 2014
VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases journal December 2014
Cancer3D: understanding cancer mutations through protein structures journal November 2014
CDD: NCBI's conserved domain database journal November 2014
The immune epitope database (IEDB) 3.0 journal October 2014
SMART: recent updates, new developments and status in 2015 journal October 2014
ChEBI in 2016: Improved services and an expanding collection of metabolites journal October 2015
PDBe: improved accessibility of macromolecular structure data from PDB and EMDB journal October 2015
The Transporter Classification Database (TCDB): recent advances journal November 2015
DNA data bank of Japan (DDBJ) progress report journal November 2015
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases journal November 2015
JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles journal November 2015
GPCRdb: an information system for G protein-coupled receptors journal November 2015
Ensembl Genomes 2016: more genomes, more complexity journal November 2015
Mouse genome database 2016 journal November 2015
dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins journal November 2015
TheSaccharomycesGenome Database Variant Viewer journal November 2015
ELM 2016—data update and new functionality of the eukaryotic linear motif resource journal November 2015
The Pfam protein families database: towards a more sustainable future journal December 2015
The Reactome pathway Knowledgebase journal December 2015
PubChem Substance and Compound databases journal September 2015
The SWISS-MODEL Repository—new features and functionality journal November 2016
Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures journal October 2016
The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes journal January 2017
PDBe: towards reusable data delivery infrastructure at protein data bank in Europe journal November 2017
The 2018 Nucleic Acids Research database issue and the online molecular biology database collection journal December 2017
The Protein Data Bank: a historical perspective journal December 2007
Response toOn prompt update of literature references in the Protein Data Bank journal September 2014
Citing a Data Repository: A Case Study of the Protein Data Bank journal August 2015
Impact Analysis of "Berman HM et al., (2000), The Protein Data Bank" journal May 2017
A comparison of two techniques for bibliometric mapping: Multidimensional scaling and VOS preprint January 2010
Analysis of Impact Metrics for the Protein Data Bank collection January 2018
A comparison of two techniques for bibliometric mapping: Multidimensional scaling and VOS
  • van Eck, Nees Jan; Waltman, Ludo; Dekker, Rommert
  • Journal of the American Society for Information Science and Technology, Vol. 61, Issue 12 https://doi.org/10.1002/asi.21421
journal December 2010
Software survey: VOSviewer, a computer program for bibliometric mapping journal December 2009
Strategies for design of improved biocatalysts for industrial applications journal December 2017
Antifungal activity of flavonoids from Heteropterys byrsonimifolia and a commercial source against Aspergillus ochraceus: In silico interactions of these compounds with a protein kinase journal August 2014
OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive journal March 2017
sc-PDB:  an Annotated Database of Druggable Binding Sites from the Protein Data Bank journal January 2006
The top 100 papers journal October 2014
Metabolite identification via the Madison Metabolomics Consortium Database journal February 2008
Crystallography: Protein Data Bank journal October 1971
Announcing the worldwide Protein Data Bank journal December 2003
The FAIR Guiding Principles for scientific data management and stewardship journal March 2016
Imaging and the Human Brain Project: A Review journal January 2002
The Molecular Biology Database Collection: an online compilation of relevant database resources journal January 2000
The Protein Data Bank journal January 2000
PIRSF: family classification system at the Protein Information Resource journal January 2004
VIPERdb: a relational database for structural virology journal January 2006
AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB journal January 2006
BioMagResBank journal December 2007
REPAIRtoire--a database of DNA repair pathways journal November 2010
Update of the FANTOM web resource: from mammalian transcriptional landscape to its dynamic regulation journal November 2010
Saccharomyces Genome Database: the genomics resource of budding yeast journal November 2011
MODOMICS: a database of RNA modification pathways—2013 update journal October 2012
EcoCyc: fusing model organism databases with systems biology journal November 2012
HMDB 3.0—The Human Metabolome Database in 2013 journal November 2012
New and continuing developments at PROSITE journal November 2012
The ChEMBL bioactivity database: an update journal November 2013
SAbDab: the structural antibody database journal November 2013
TheCandidaGenome Database: The new homology information page highlights protein similarity and phylogeny journal October 2013
MultitaskProtDB: a database of multitasking proteins journal November 2013
The carbohydrate-active enzymes database (CAZy) in 2013 journal November 2013
MMDB and VAST+: tracking structural similarities between macromolecular complexes journal December 2013
SCOP2 prototype: a new approach to protein structure mining journal November 2013
PDBsum additions journal October 2013
STRING v10: protein–protein interaction networks, integrated over the tree of life journal October 2014
The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease journal October 2014
The SUPERFAMILY 1.75 database in 2014: a doubling of data journal November 2014
IMGT®, the international ImMunoGeneTics information system® 25 years on journal November 2014
ArrayExpress update—simplifying data submissions journal October 2014
Rfam 12.0: updates to the RNA families database journal November 2014
Genenames.org: the HGNC resources in 2015 journal October 2014
COSMIC: exploring the world's knowledge of somatic mutations in human cancer journal October 2014
VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases journal December 2014
Cancer3D: understanding cancer mutations through protein structures journal November 2014
The neXtProt knowledgebase on human proteins: current status journal January 2015
CDD: NCBI's conserved domain database journal November 2014
The immune epitope database (IEDB) 3.0 journal October 2014
CATH: comprehensive structural and functional annotations for genome sequences journal October 2014
SMART: recent updates, new developments and status in 2015 journal October 2014
canSAR: an updated cancer research and drug discovery knowledgebase journal December 2015
ChEBI in 2016: Improved services and an expanding collection of metabolites journal October 2015
PDBe: improved accessibility of macromolecular structure data from PDB and EMDB journal October 2015
The Transporter Classification Database (TCDB): recent advances journal November 2015
DNA data bank of Japan (DDBJ) progress report journal November 2015
Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors journal November 2015
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases journal November 2015
JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles journal November 2015
GPCRdb: an information system for G protein-coupled receptors journal November 2015
Ensembl Genomes 2016: more genomes, more complexity journal November 2015
Mouse genome database 2016 journal November 2015
dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins journal November 2015
TheSaccharomycesGenome Database Variant Viewer journal November 2015
NONCODE 2016: an informative and valuable data source of long non-coding RNAs journal November 2015
STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data journal November 2015
ELM 2016—data update and new functionality of the eukaryotic linear motif resource journal November 2015
Biocuration of functional annotation at the European nucleotide archive journal November 2015
The Pfam protein families database: towards a more sustainable future journal December 2015
The Reactome pathway Knowledgebase journal December 2015
PubChem Substance and Compound databases journal September 2015
KEGG: new perspectives on genomes, pathways, diseases and drugs journal November 2016
UniProt: the universal protein knowledgebase journal November 2016
The SWISS-MODEL Repository—new features and functionality journal November 2016
The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes journal January 2017
Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures journal October 2016
PDBe: towards reusable data delivery infrastructure at protein data bank in Europe journal November 2017
The 2018 Nucleic Acids Research database issue and the online molecular biology database collection journal December 2017
The Protein Data Bank: a historical perspective journal December 2007
Response toOn prompt update of literature references in the Protein Data Bank journal September 2014
Digital Design of Molecular Sculptures and Abstractions journal February 2011
Biomorphic Presentation of Proteins: Artistic Science or Scientific Art? journal June 2013
Protein Sculptures: Life's Building Blocks Inspire Art journal February 2005
Proteins, Immersive Games and Music journal April 2006
Electronic Music for Bio-Molecules Using Short Music Phrases journal April 2007
Patterns of database citation in articles and patents indicate long-term scientific and industry value of biological data resources journal February 2016
Bioinformatics and Management Science: Some Common Tools and Techniques journal April 2004
Citing a Data Repository: A Case Study of the Protein Data Bank journal August 2015
Impact Analysis of "Berman HM et al., (2000), The Protein Data Bank" journal May 2017
Analysis of Impact Metrics for the Protein Data Bank collection January 2018

Cited By (2)

DRAMP 2.0, an updated data repository of antimicrobial peptides journal August 2019
RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy journal October 2018