DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Analysis of impact metrics for the Protein Data Bank

Abstract

Since 1971, the Protein Data Bank (PDB) archive has served as the single, global repository for open access to atomic-level data for biological macromolecules. The archive currently holds >140,000 structures (>1 billion atoms). These structures are the molecules of life found in all organisms. Knowing the 3D structure of a biological macromolecule is essential for understanding the molecule’s function, providing insights in health and disease, food and energy production, and other topics of concern to prosperity and sustainability. PDB data are freely and publicly available, without restrictions on usage. Through bibliometric and usage studies, we sought to determine the impact of the PDB across disciplines and demographics. Our analysis shows that even though research areas such as molecular biology and biochemistry account for the most usage, other fields are increasingly using PDB resources. PDB usage is seen across 150 disciplines in applied sciences, humanities, and social sciences. Data are also re-used and integrated with >400 resources. Our study identifies trends in PDB usage and documents its utility across research disciplines.

Authors:
 [1];  [1];  [1];  [1];  [2];  [1]
  1. Rutgers Univ., Piscataway, NJ (United States)
  2. Rutgers Univ., Piscataway, NJ (United States); Univ. of California San Diego, La Jolla, CA (United States)
Publication Date:
Research Org.:
Rutgers Univ., Piscataway, NJ (United States)
Sponsoring Org.:
USDOE; National Science Foundation (NSF); National Institutes of Health (NIH)
OSTI Identifier:
1624555
Grant/Contract Number:  
NSF-DBI 1338415
Resource Type:
Accepted Manuscript
Journal Name:
Scientific Data
Additional Journal Information:
Journal Volume: 5; Journal Issue: 1; Journal ID: ISSN 2052-4463
Publisher:
Nature Publishing Group
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; science and technology; databases; literature mining; publishing; structural biology

Citation Formats

Markosian, Christopher, Di Costanzo, Luigi, Sekharan, Monica, Shao, Chenghua, Burley, Stephen K., and Zardecki, Christine. Analysis of impact metrics for the Protein Data Bank. United States: N. p., 2018. Web. doi:10.1038/sdata.2018.212.
Markosian, Christopher, Di Costanzo, Luigi, Sekharan, Monica, Shao, Chenghua, Burley, Stephen K., & Zardecki, Christine. Analysis of impact metrics for the Protein Data Bank. United States. https://doi.org/10.1038/sdata.2018.212
Markosian, Christopher, Di Costanzo, Luigi, Sekharan, Monica, Shao, Chenghua, Burley, Stephen K., and Zardecki, Christine. Tue . "Analysis of impact metrics for the Protein Data Bank". United States. https://doi.org/10.1038/sdata.2018.212. https://www.osti.gov/servlets/purl/1624555.
@article{osti_1624555,
title = {Analysis of impact metrics for the Protein Data Bank},
author = {Markosian, Christopher and Di Costanzo, Luigi and Sekharan, Monica and Shao, Chenghua and Burley, Stephen K. and Zardecki, Christine},
abstractNote = {Since 1971, the Protein Data Bank (PDB) archive has served as the single, global repository for open access to atomic-level data for biological macromolecules. The archive currently holds >140,000 structures (>1 billion atoms). These structures are the molecules of life found in all organisms. Knowing the 3D structure of a biological macromolecule is essential for understanding the molecule’s function, providing insights in health and disease, food and energy production, and other topics of concern to prosperity and sustainability. PDB data are freely and publicly available, without restrictions on usage. Through bibliometric and usage studies, we sought to determine the impact of the PDB across disciplines and demographics. Our analysis shows that even though research areas such as molecular biology and biochemistry account for the most usage, other fields are increasingly using PDB resources. PDB usage is seen across 150 disciplines in applied sciences, humanities, and social sciences. Data are also re-used and integrated with >400 resources. Our study identifies trends in PDB usage and documents its utility across research disciplines.},
doi = {10.1038/sdata.2018.212},
journal = {Scientific Data},
number = 1,
volume = 5,
place = {United States},
year = {Tue Oct 16 00:00:00 EDT 2018},
month = {Tue Oct 16 00:00:00 EDT 2018}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Figures / Tables:

Figure 1 Figure 1: Number of publications for the top-assigned Web of Science Journal Subject Category for all documents (2000–2016) citing the inaugural Berman et al. (2000) reference. Biochemistry Molecular Biology is the largest category (6,735 publications), followed by Biophysics (2,872), Biochemical Research Methods (2,161), Computer Science Interdisciplinary Applications (1,852), Chemistry Medicinalmore » (1,666), Chemistry Multidisciplinary (1,660), Mathematical Computational Biology (1,656), Biotechnology Applied Microbiology (1,297), Chemistry Physical (871), and Multidisciplinary Sciences (789).« less

Save / Share:

Works referenced in this record:

Crystallography: Protein Data Bank
journal, October 1971


The Protein Data Bank: a historical perspective
journal, December 2007

  • Berman, Helen M.
  • Acta Crystallographica Section A Foundations of Crystallography, Vol. 64, Issue 1
  • DOI: 10.1107/S0108767307035623

Announcing the worldwide Protein Data Bank
journal, December 2003

  • Berman, Helen; Henrick, Kim; Nakamura, Haruki
  • Nature Structural & Molecular Biology, Vol. 10, Issue 12
  • DOI: 10.1038/nsb1203-980

The Protein Data Bank
journal, January 2000


PDBe: improved accessibility of macromolecular structure data from PDB and EMDB
journal, October 2015

  • Velankar, Sameer; van Ginkel, Glen; Alhroub, Younes
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1047

Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures
journal, October 2016

  • Kinjo, Akira R.; Bekker, Gert-Jan; Suzuki, Hirofumi
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw962

OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive
journal, March 2017


The top 100 papers
journal, October 2014

  • Van Noorden, Richard; Maher, Brendan; Nuzzo, Regina
  • Nature, Vol. 514, Issue 7524
  • DOI: 10.1038/514550a

Citing a Data Repository: A Case Study of the Protein Data Bank
journal, August 2015


Strategies for design of improved biocatalysts for industrial applications
journal, December 2017


Imaging and the Human Brain Project: A Review
journal, January 2002

  • Rosse, C.; Brinkley, J. F.
  • Methods of Information in Medicine, Vol. 41, Issue 04
  • DOI: 10.1055/s-0038-1634485

Bioinformatics and Management Science: Some Common Tools and Techniques
journal, April 2004


Biomorphic Presentation of Proteins: Artistic Science or Scientific Art?
journal, June 2013


Digital Design of Molecular Sculptures and Abstractions
journal, February 2011


Electronic Music for Bio-Molecules Using Short Music Phrases
journal, April 2007


Proteins, Immersive Games and Music
journal, April 2006


Protein Sculptures: Life's Building Blocks Inspire Art
journal, February 2005


The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016

  • Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
  • Scientific Data, Vol. 3, Issue 1
  • DOI: 10.1038/sdata.2016.18

The Molecular Biology Database Collection: an online compilation of relevant database resources
journal, January 2000


The 2018 Nucleic Acids Research database issue and the online molecular biology database collection
journal, December 2017

  • Rigden, Daniel J.; Fernández, Xosé M.
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx1235

AffinDB: a freely accessible database of affinities for protein-ligand complexes from the PDB
journal, January 2006


MultitaskProtDB: a database of multitasking proteins
journal, November 2013

  • Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1153

Metabolite identification via the Madison Metabolomics Consortium Database
journal, February 2008

  • Cui, Qiu; Lewis, Ian A.; Hegeman, Adrian D.
  • Nature Biotechnology, Vol. 26, Issue 2
  • DOI: 10.1038/nbt0208-162

REPAIRtoire--a database of DNA repair pathways
journal, November 2010

  • Milanowska, K.; Krwawicz, J.; Papaj, G.
  • Nucleic Acids Research, Vol. 39, Issue Database
  • DOI: 10.1093/nar/gkq1087

Cancer3D: understanding cancer mutations through protein structures
journal, November 2014

  • Porta-Pardo, Eduard; Hrabe, Thomas; Godzik, Adam
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1140

SAbDab: the structural antibody database
journal, November 2013

  • Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1043

VIPERdb: a relational database for structural virology
journal, January 2006


Saccharomyces Genome Database: the genomics resource of budding yeast
journal, November 2011

  • Cherry, J. M.; Hong, E. L.; Amundsen, C.
  • Nucleic Acids Research, Vol. 40, Issue D1
  • DOI: 10.1093/nar/gkr1029

sc-PDB:  an Annotated Database of Druggable Binding Sites from the Protein Data Bank
journal, January 2006

  • Kellenberger, Esther; Muller, Pascal; Schalon, Claire
  • Journal of Chemical Information and Modeling, Vol. 46, Issue 2
  • DOI: 10.1021/ci050372x

The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes
journal, January 2017

  • Galperin, Michael Y.; Fernández-Suárez, Xosé M.; Rigden, Daniel J.
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1188

BioMagResBank
journal, December 2007

  • Ulrich, E. L.; Akutsu, H.; Doreleijers, J. F.
  • Nucleic Acids Research, Vol. 36, Issue Database
  • DOI: 10.1093/nar/gkm957

PDBe: towards reusable data delivery infrastructure at protein data bank in Europe
journal, November 2017

  • Mir, Saqib; Alhroub, Younes; Anyango, Stephen
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx1070

Patterns of database citation in articles and patents indicate long-term scientific and industry value of biological data resources
journal, February 2016


A comparison of two techniques for bibliometric mapping: Multidimensional scaling and VOS
journal, December 2010

  • van Eck, Nees Jan; Waltman, Ludo; Dekker, Rommert
  • Journal of the American Society for Information Science and Technology, Vol. 61, Issue 12
  • DOI: 10.1002/asi.21421

Software survey: VOSviewer, a computer program for bibliometric mapping
journal, December 2009


MMDB and VAST+: tracking structural similarities between macromolecular complexes
journal, December 2013

  • Madej, Thomas; Lanczycki, Christopher J.; Zhang, Dachuan
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1208

PDBsum additions
journal, October 2013

  • de Beer, Tjaart A. P.; Berka, Karel; Thornton, Janet M.
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt940

SCOP2 prototype: a new approach to protein structure mining
journal, November 2013

  • Andreeva, Antonina; Howorth, Dave; Chothia, Cyrus
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1242

The SUPERFAMILY 1.75 database in 2014: a doubling of data
journal, November 2014

  • Oates, Matt E.; Stahlhacke, Jonathan; Vavoulis, Dimitrios V.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1041

The SWISS-MODEL Repository—new features and functionality
journal, November 2016

  • Bienert, Stefan; Waterhouse, Andrew; de Beer, Tjaart A. P.
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1132

ChEBI in 2016: Improved services and an expanding collection of metabolites
journal, October 2015

  • Hastings, Janna; Owen, Gareth; Dekker, Adriano
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1031

PubChem Substance and Compound databases
journal, September 2015

  • Kim, Sunghwan; Thiessen, Paul A.; Bolton, Evan E.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv951

GPCRdb: an information system for G protein-coupled receptors
journal, November 2015

  • Isberg, Vignir; Mordalski, Stefan; Munk, Christian
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1178

Twenty years of the MEROPS database of proteolytic enzymes, their substrates and inhibitors
journal, November 2015

  • Rawlings, Neil D.; Barrett, Alan J.; Finn, Robert
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1118

CDD: NCBI's conserved domain database
journal, November 2014

  • Marchler-Bauer, Aron; Derbyshire, Myra K.; Gonzales, Noreen R.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1221

CATH: comprehensive structural and functional annotations for genome sequences
journal, October 2014

  • Sillitoe, Ian; Lewis, Tony E.; Cuff, Alison
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku947

The Pfam protein families database: towards a more sustainable future
journal, December 2015

  • Finn, Robert D.; Coggill, Penelope; Eberhardt, Ruth Y.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1344

SMART: recent updates, new developments and status in 2015
journal, October 2014

  • Letunic, Ivica; Doerks, Tobias; Bork, Peer
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku949

ELM 2016—data update and new functionality of the eukaryotic linear motif resource
journal, November 2015

  • Dinkel, Holger; Van Roey, Kim; Michael, Sushama
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1291

New and continuing developments at PROSITE
journal, November 2012

  • Sigrist, Christian J. A.; de Castro, Edouard; Cerutti, Lorenzo
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1067

dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins
journal, November 2015

  • Huang, Kai-Yao; Su, Min-Gang; Kao, Hui-Ju
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1240

PIRSF: family classification system at the Protein Information Resource
journal, January 2004


The Transporter Classification Database (TCDB): recent advances
journal, November 2015

  • Saier, Milton H.; Reddy, Vamsee S.; Tsu, Brian V.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1103

UniProt: the universal protein knowledgebase
journal, November 2016


STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data
journal, November 2015

  • Szklarczyk, Damian; Santos, Alberto; von Mering, Christian
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1277

STRING v10: protein–protein interaction networks, integrated over the tree of life
journal, October 2014

  • Szklarczyk, Damian; Franceschini, Andrea; Wyder, Stefan
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1003

The carbohydrate-active enzymes database (CAZy) in 2013
journal, November 2013

  • Lombard, Vincent; Golaconda Ramulu, Hemalatha; Drula, Elodie
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1178

The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases
journal, November 2015

  • Caspi, Ron; Billington, Richard; Ferrer, Luciana
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1164

HMDB 3.0—The Human Metabolome Database in 2013
journal, November 2012

  • Wishart, David S.; Jewison, Timothy; Guo, An Chi
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1065

MODOMICS: a database of RNA modification pathways—2013 update
journal, October 2012

  • Machnicka, Magdalena A.; Milanowska, Kaja; Osman Oglou, Okan
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1007

The Reactome pathway Knowledgebase
journal, December 2015

  • Fabregat, Antonio; Sidiropoulos, Konstantinos; Garapati, Phani
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1351

EcoCyc: fusing model organism databases with systems biology
journal, November 2012

  • Keseler, Ingrid M.; Mackie, Amanda; Peralta-Gil, Martin
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1027

VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases
journal, December 2014

  • Giraldo-Calderón, Gloria I.; Emrich, Scott J.; MacCallum, Robert M.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1117

TheCandidaGenome Database: The new homology information page highlights protein similarity and phylogeny
journal, October 2013

  • Binkley, Jonathan; Arnaud, Martha B.; Inglis, Diane O.
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1046

TheSaccharomycesGenome Database Variant Viewer
journal, November 2015

  • Sheppard, Travis K.; Hitz, Benjamin C.; Engel, Stacia R.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1250

Genenames.org: the HGNC resources in 2015
journal, October 2014

  • Gray, Kristian A.; Yates, Bethan; Seal, Ruth L.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1071

KEGG: new perspectives on genomes, pathways, diseases and drugs
journal, November 2016

  • Kanehisa, Minoru; Furumichi, Miho; Tanabe, Mao
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1092

canSAR: an updated cancer research and drug discovery knowledgebase
journal, December 2015

  • Tym, Joseph E.; Mitsopoulos, Costas; Coker, Elizabeth A.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1030

COSMIC: exploring the world's knowledge of somatic mutations in human cancer
journal, October 2014

  • Forbes, Simon A.; Beare, David; Gunasekaran, Prasad
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1075

The ChEMBL bioactivity database: an update
journal, November 2013

  • Bento, A. Patrícia; Gaulton, Anna; Hersey, Anne
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1031

JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles
journal, November 2015

  • Mathelier, Anthony; Fornes, Oriol; Arenillas, David J.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1176

DNA data bank of Japan (DDBJ) progress report
journal, November 2015

  • Mashima, Jun; Kodama, Yuichi; Kosuge, Takehide
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1105

Biocuration of functional annotation at the European nucleotide archive
journal, November 2015

  • Gibson, Richard; Alako, Blaise; Amid, Clara
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1311

The immune epitope database (IEDB) 3.0
journal, October 2014

  • Vita, Randi; Overton, James A.; Greenbaum, Jason A.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku938

IMGT®, the international ImMunoGeneTics information system® 25 years on
journal, November 2014

  • Lefranc, Marie-Paule; Giudicelli, Véronique; Duroux, Patrice
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1056

NONCODE 2016: an informative and valuable data source of long non-coding RNAs
journal, November 2015

  • Zhao, Yi; Li, Hui; Fang, Shuangsang
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1252

Rfam 12.0: updates to the RNA families database
journal, November 2014

  • Nawrocki, Eric P.; Burge, Sarah W.; Bateman, Alex
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1063

Update of the FANTOM web resource: from mammalian transcriptional landscape to its dynamic regulation
journal, November 2010

  • Kawaji, H.; Severin, J.; Lizio, M.
  • Nucleic Acids Research, Vol. 39, Issue Database
  • DOI: 10.1093/nar/gkq1112

The neXtProt knowledgebase on human proteins: current status
journal, January 2015

  • Gaudet, Pascale; Michel, Pierre-André; Zahn-Zabal, Monique
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1178

Ensembl Genomes 2016: more genomes, more complexity
journal, November 2015

  • Kersey, Paul Julian; Allen, James E.; Armean, Irina
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1209

Mouse genome database 2016
journal, November 2015

  • Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1211

The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease
journal, October 2014

  • Shimoyama, Mary; De Pons, Jeff; Hayman, G. Thomas
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1026

ArrayExpress update—simplifying data submissions
journal, October 2014

  • Kolesnikov, Nikolay; Hastings, Emma; Keays, Maria
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1057

Analysis of Impact Metrics for the Protein Data Bank
collection, January 2018


Adherence to literature search reporting guidelines in leading rheumatology journals’ systematic reviews: umbrella review protocol
journal, August 2022

  • Pérez-Neri, Iván; Pineda, Carlos; Flores-Guerrero, Jose L.
  • Rheumatology International, Vol. 42, Issue 12
  • DOI: 10.1007/s00296-022-05194-1

Evidence for an association of interferon gene variants with sudden infant death syndrome
journal, January 2019

  • Hafke, Angelina; Schürmann, Peter; Rothämel, Thomas
  • International Journal of Legal Medicine, Vol. 133, Issue 3
  • DOI: 10.1007/s00414-018-1974-6

The top 100 papers
journal, October 2014

  • Van Noorden, Richard; Maher, Brendan; Nuzzo, Regina
  • Nature, Vol. 514, Issue 7524
  • DOI: 10.1038/514550a

Metabolite identification via the Madison Metabolomics Consortium Database
journal, February 2008

  • Cui, Qiu; Lewis, Ian A.; Hegeman, Adrian D.
  • Nature Biotechnology, Vol. 26, Issue 2
  • DOI: 10.1038/nbt0208-162

Announcing the worldwide Protein Data Bank
journal, December 2003

  • Berman, Helen; Henrick, Kim; Nakamura, Haruki
  • Nature Structural & Molecular Biology, Vol. 10, Issue 12
  • DOI: 10.1038/nsb1203-980

Utilizing MALDI-TOF MS and LC-MS/MS to access serum peptidome-based biomarkers in canine oral tumors
journal, December 2022

  • Ploypetch, Sekkarin; Jaresitthikunchai, Janthima; Phaonakrop, Narumon
  • Scientific Reports, Vol. 12, Issue 1
  • DOI: 10.1038/s41598-022-26132-y

The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016

  • Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
  • Scientific Data, Vol. 3, Issue 1
  • DOI: 10.1038/sdata.2016.18

Mutation analysis of the entire mitochondrial genome using denaturing high performance liquid chromatography
journal, October 2000


PIRSF: family classification system at the Protein Information Resource
journal, January 2004


VIPERdb: a relational database for structural virology
journal, January 2006


BioMagResBank
journal, December 2007

  • Ulrich, E. L.; Akutsu, H.; Doreleijers, J. F.
  • Nucleic Acids Research, Vol. 36, Issue Database
  • DOI: 10.1093/nar/gkm957

REPAIRtoire--a database of DNA repair pathways
journal, November 2010

  • Milanowska, K.; Krwawicz, J.; Papaj, G.
  • Nucleic Acids Research, Vol. 39, Issue Database
  • DOI: 10.1093/nar/gkq1087

Saccharomyces Genome Database: the genomics resource of budding yeast
journal, November 2011

  • Cherry, J. M.; Hong, E. L.; Amundsen, C.
  • Nucleic Acids Research, Vol. 40, Issue D1
  • DOI: 10.1093/nar/gkr1029

MODOMICS: a database of RNA modification pathways—2013 update
journal, October 2012

  • Machnicka, Magdalena A.; Milanowska, Kaja; Osman Oglou, Okan
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1007

EcoCyc: fusing model organism databases with systems biology
journal, November 2012

  • Keseler, Ingrid M.; Mackie, Amanda; Peralta-Gil, Martin
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1027

HMDB 3.0—The Human Metabolome Database in 2013
journal, November 2012

  • Wishart, David S.; Jewison, Timothy; Guo, An Chi
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1065

The ChEMBL bioactivity database: an update
journal, November 2013

  • Bento, A. Patrícia; Gaulton, Anna; Hersey, Anne
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1031

SAbDab: the structural antibody database
journal, November 2013

  • Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1043

TheCandidaGenome Database: The new homology information page highlights protein similarity and phylogeny
journal, October 2013

  • Binkley, Jonathan; Arnaud, Martha B.; Inglis, Diane O.
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1046

MultitaskProtDB: a database of multitasking proteins
journal, November 2013

  • Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1153

SCOP2 prototype: a new approach to protein structure mining
journal, November 2013

  • Andreeva, Antonina; Howorth, Dave; Chothia, Cyrus
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt1242

PDBsum additions
journal, October 2013

  • de Beer, Tjaart A. P.; Berka, Karel; Thornton, Janet M.
  • Nucleic Acids Research, Vol. 42, Issue D1
  • DOI: 10.1093/nar/gkt940

STRING v10: protein–protein interaction networks, integrated over the tree of life
journal, October 2014

  • Szklarczyk, Damian; Franceschini, Andrea; Wyder, Stefan
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1003

The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease
journal, October 2014

  • Shimoyama, Mary; De Pons, Jeff; Hayman, G. Thomas
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1026

IMGT®, the international ImMunoGeneTics information system® 25 years on
journal, November 2014

  • Lefranc, Marie-Paule; Giudicelli, Véronique; Duroux, Patrice
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1056

ArrayExpress update—simplifying data submissions
journal, October 2014

  • Kolesnikov, Nikolay; Hastings, Emma; Keays, Maria
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1057

Rfam 12.0: updates to the RNA families database
journal, November 2014

  • Nawrocki, Eric P.; Burge, Sarah W.; Bateman, Alex
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1063

Genenames.org: the HGNC resources in 2015
journal, October 2014

  • Gray, Kristian A.; Yates, Bethan; Seal, Ruth L.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1071

COSMIC: exploring the world's knowledge of somatic mutations in human cancer
journal, October 2014

  • Forbes, Simon A.; Beare, David; Gunasekaran, Prasad
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1075

VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases
journal, December 2014

  • Giraldo-Calderón, Gloria I.; Emrich, Scott J.; MacCallum, Robert M.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1117

Cancer3D: understanding cancer mutations through protein structures
journal, November 2014

  • Porta-Pardo, Eduard; Hrabe, Thomas; Godzik, Adam
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1140

CDD: NCBI's conserved domain database
journal, November 2014

  • Marchler-Bauer, Aron; Derbyshire, Myra K.; Gonzales, Noreen R.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1221

The immune epitope database (IEDB) 3.0
journal, October 2014

  • Vita, Randi; Overton, James A.; Greenbaum, Jason A.
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku938

SMART: recent updates, new developments and status in 2015
journal, October 2014

  • Letunic, Ivica; Doerks, Tobias; Bork, Peer
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku949

ChEBI in 2016: Improved services and an expanding collection of metabolites
journal, October 2015

  • Hastings, Janna; Owen, Gareth; Dekker, Adriano
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1031

PDBe: improved accessibility of macromolecular structure data from PDB and EMDB
journal, October 2015

  • Velankar, Sameer; van Ginkel, Glen; Alhroub, Younes
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1047

The Transporter Classification Database (TCDB): recent advances
journal, November 2015

  • Saier, Milton H.; Reddy, Vamsee S.; Tsu, Brian V.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1103

DNA data bank of Japan (DDBJ) progress report
journal, November 2015

  • Mashima, Jun; Kodama, Yuichi; Kosuge, Takehide
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1105

The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases
journal, November 2015

  • Caspi, Ron; Billington, Richard; Ferrer, Luciana
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1164

JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles
journal, November 2015

  • Mathelier, Anthony; Fornes, Oriol; Arenillas, David J.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1176

GPCRdb: an information system for G protein-coupled receptors
journal, November 2015

  • Isberg, Vignir; Mordalski, Stefan; Munk, Christian
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1178

Ensembl Genomes 2016: more genomes, more complexity
journal, November 2015

  • Kersey, Paul Julian; Allen, James E.; Armean, Irina
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1209

Mouse genome database 2016
journal, November 2015

  • Bult, Carol J.; Eppig, Janan T.; Blake, Judith A.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1211

dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins
journal, November 2015

  • Huang, Kai-Yao; Su, Min-Gang; Kao, Hui-Ju
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1240

TheSaccharomycesGenome Database Variant Viewer
journal, November 2015

  • Sheppard, Travis K.; Hitz, Benjamin C.; Engel, Stacia R.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1250

ELM 2016—data update and new functionality of the eukaryotic linear motif resource
journal, November 2015

  • Dinkel, Holger; Van Roey, Kim; Michael, Sushama
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1291

The Pfam protein families database: towards a more sustainable future
journal, December 2015

  • Finn, Robert D.; Coggill, Penelope; Eberhardt, Ruth Y.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1344

The Reactome pathway Knowledgebase
journal, December 2015

  • Fabregat, Antonio; Sidiropoulos, Konstantinos; Garapati, Phani
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv1351

PubChem Substance and Compound databases
journal, September 2015

  • Kim, Sunghwan; Thiessen, Paul A.; Bolton, Evan E.
  • Nucleic Acids Research, Vol. 44, Issue D1
  • DOI: 10.1093/nar/gkv951

The SWISS-MODEL Repository—new features and functionality
journal, November 2016

  • Bienert, Stefan; Waterhouse, Andrew; de Beer, Tjaart A. P.
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw1132

Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures
journal, October 2016

  • Kinjo, Akira R.; Bekker, Gert-Jan; Suzuki, Hirofumi
  • Nucleic Acids Research, Vol. 45, Issue D1
  • DOI: 10.1093/nar/gkw962

The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes
journal, January 2017

  • Galperin, Michael Y.; Fernández-Suárez, Xosé M.; Rigden, Daniel J.
  • Nucleic Acids Research
  • DOI: 10.1093/nar/gkx021

PDBe: towards reusable data delivery infrastructure at protein data bank in Europe
journal, November 2017

  • Mir, Saqib; Alhroub, Younes; Anyango, Stephen
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx1070

The 2018 Nucleic Acids Research database issue and the online molecular biology database collection
journal, December 2017

  • Rigden, Daniel J.; Fernández, Xosé M.
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx1235

Response toOn prompt update of literature references in the Protein Data Bank
journal, September 2014

  • Berman, Helen M.; Burley, Stephen K.; Kleywegt, Gerard J.
  • Acta Crystallographica Section D Biological Crystallography, Vol. 70, Issue 10
  • DOI: 10.1107/s1399004714020513

Citing a Data Repository: A Case Study of the Protein Data Bank
journal, August 2015


A comparison of two techniques for bibliometric mapping: Multidimensional scaling and VOS
preprint, January 2010


Works referencing / citing this record:

DRAMP 2.0, an updated data repository of antimicrobial peptides
journal, August 2019


RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy
journal, October 2018

  • Burley, Stephen K.; Berman, Helen M.; Bhikadiya, Charmi
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1004

Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.