Protein Data Bank: the single global archive for 3D macromolecular structure data
Abstract
The Protein Data Bank (PDB) is the single global archive of experimentally determined three-dimensional (3D) structure data of biological macromolecules. Since 2003, the PDB has been managed by the Worldwide Protein Data Bank (wwPDB; wwpdb.org), an international consortium that collaboratively oversees deposition, validation, biocuration, and open access dissemination of 3D macromolecular structure data. The PDB Core Archive houses 3D atomic coordinates of more than 144,000 structural models of proteins, DNA/RNA, and their complexes with metals and small molecules and related experimental data and metadata. Structure and experimental data/metadata are also stored in the PDB Core Archive using the readily extensible wwPDB PDBx/mmCIF master data format, which will continue to evolve as data/metadata from new experimental techniques and structure determination methods are incorporated by the wwPDB. Impacts of the recently developed universal wwPDB OneDep deposition/validation/biocuration system and various methods-specific wwPDB Validation Task Forces on improving the quality of structures and data housed in the PDB Core Archive are described together with current challenges and future plans.
- Authors:
- more »
- Publication Date:
- Research Org.:
- Rutgers Univ., New Brunswick, NJ (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER); National Institutes of Health (NIH); National Science Foundation (NSF)
- Contributing Org.:
- wwPDB Consortium
- OSTI Identifier:
- 1479119
- Alternate Identifier(s):
- OSTI ID: 1600527
- Grant/Contract Number:
- SC0019749; DBI-1338415
- Resource Type:
- Published Article
- Journal Name:
- Nucleic Acids Research
- Additional Journal Information:
- Journal Name: Nucleic Acids Research Journal Volume: 47 Journal Issue: D1; Journal ID: ISSN 0305-1048
- Publisher:
- Oxford University Press
- Country of Publication:
- United Kingdom
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES
Citation Formats
Burley, Stephen K., Berman, Helen M., Bhikadiya, Charmi, Bi, Chunxiao, Chen, Li, Costanzo, Luigi Di, Christie, Cole, Duarte, Jose M., Dutta, Shuchismita, Feng, Zukang, Ghosh, Sutapa, Goodsell, David S., Green, Rachel Kramer, Guranovic, Vladimir, Guzenko, Dmytro, Hudson, Brian P., Liang, Yuhe, Lowe, Robert, Peisach, Ezra, Periskova, Irina, Randle, Chris, Rose, Alexander, Sekharan, Monica, Shao, Chenghua, Tao, Yi-Ping, Valasatava, Yana, Voigt, Maria, Westbrook, John, Young, Jasmine, Zardecki, Christine, Zhuravleva, Marina, Kurisu, Genji, Nakamura, Haruki, Kengaku, Yumiko, Cho, Hasumi, Sato, Junko, Kim, Ju Yaen, Ikegawa, Yasuyo, Nakagawa, Atsushi, Yamashita, Reiko, Kudou, Takahiro, Bekker, Gert-Jan, Suzuki, Hirofumi, Iwata, Takeshi, Yokochi, Masashi, Kobayashi, Naohiro, Fujiwara, Toshimichi, Velankar, Sameer, Kleywegt, Gerard J., Anyango, Stephen, Armstrong, David R., Berrisford, John M., Conroy, Matthew J., Dana, Jose M., Deshpande, Mandar, Gane, Paul, Gáborová, Romana, Gupta, Deepti, Gutmanas, Aleksandras, Koča, Jaroslav, Mak, Lora, Mir, Saqib, Mukhopadhyay, Abhik, Nadzirin, Nurul, Nair, Sreenath, Patwardhan, Ardan, Paysan-Lafosse, Typhaine, Pravda, Lukas, Salih, Osman, Sehnal, David, Varadi, Mihaly, Vařeková, Radka, Markley, John L., Hoch, Jeffrey C., Romero, Pedro R., Baskaran, Kumaran, Maziuk, Dimitri, Ulrich, Eldon L., Wedell, Jonathan R., Yao, Hongyang, Livny, Miron, and Ioannidis, Yannis E. Protein Data Bank: the single global archive for 3D macromolecular structure data. United Kingdom: N. p., 2018.
Web. doi:10.1093/nar/gky949.
Burley, Stephen K., Berman, Helen M., Bhikadiya, Charmi, Bi, Chunxiao, Chen, Li, Costanzo, Luigi Di, Christie, Cole, Duarte, Jose M., Dutta, Shuchismita, Feng, Zukang, Ghosh, Sutapa, Goodsell, David S., Green, Rachel Kramer, Guranovic, Vladimir, Guzenko, Dmytro, Hudson, Brian P., Liang, Yuhe, Lowe, Robert, Peisach, Ezra, Periskova, Irina, Randle, Chris, Rose, Alexander, Sekharan, Monica, Shao, Chenghua, Tao, Yi-Ping, Valasatava, Yana, Voigt, Maria, Westbrook, John, Young, Jasmine, Zardecki, Christine, Zhuravleva, Marina, Kurisu, Genji, Nakamura, Haruki, Kengaku, Yumiko, Cho, Hasumi, Sato, Junko, Kim, Ju Yaen, Ikegawa, Yasuyo, Nakagawa, Atsushi, Yamashita, Reiko, Kudou, Takahiro, Bekker, Gert-Jan, Suzuki, Hirofumi, Iwata, Takeshi, Yokochi, Masashi, Kobayashi, Naohiro, Fujiwara, Toshimichi, Velankar, Sameer, Kleywegt, Gerard J., Anyango, Stephen, Armstrong, David R., Berrisford, John M., Conroy, Matthew J., Dana, Jose M., Deshpande, Mandar, Gane, Paul, Gáborová, Romana, Gupta, Deepti, Gutmanas, Aleksandras, Koča, Jaroslav, Mak, Lora, Mir, Saqib, Mukhopadhyay, Abhik, Nadzirin, Nurul, Nair, Sreenath, Patwardhan, Ardan, Paysan-Lafosse, Typhaine, Pravda, Lukas, Salih, Osman, Sehnal, David, Varadi, Mihaly, Vařeková, Radka, Markley, John L., Hoch, Jeffrey C., Romero, Pedro R., Baskaran, Kumaran, Maziuk, Dimitri, Ulrich, Eldon L., Wedell, Jonathan R., Yao, Hongyang, Livny, Miron, & Ioannidis, Yannis E. Protein Data Bank: the single global archive for 3D macromolecular structure data. United Kingdom. https://doi.org/10.1093/nar/gky949
Burley, Stephen K., Berman, Helen M., Bhikadiya, Charmi, Bi, Chunxiao, Chen, Li, Costanzo, Luigi Di, Christie, Cole, Duarte, Jose M., Dutta, Shuchismita, Feng, Zukang, Ghosh, Sutapa, Goodsell, David S., Green, Rachel Kramer, Guranovic, Vladimir, Guzenko, Dmytro, Hudson, Brian P., Liang, Yuhe, Lowe, Robert, Peisach, Ezra, Periskova, Irina, Randle, Chris, Rose, Alexander, Sekharan, Monica, Shao, Chenghua, Tao, Yi-Ping, Valasatava, Yana, Voigt, Maria, Westbrook, John, Young, Jasmine, Zardecki, Christine, Zhuravleva, Marina, Kurisu, Genji, Nakamura, Haruki, Kengaku, Yumiko, Cho, Hasumi, Sato, Junko, Kim, Ju Yaen, Ikegawa, Yasuyo, Nakagawa, Atsushi, Yamashita, Reiko, Kudou, Takahiro, Bekker, Gert-Jan, Suzuki, Hirofumi, Iwata, Takeshi, Yokochi, Masashi, Kobayashi, Naohiro, Fujiwara, Toshimichi, Velankar, Sameer, Kleywegt, Gerard J., Anyango, Stephen, Armstrong, David R., Berrisford, John M., Conroy, Matthew J., Dana, Jose M., Deshpande, Mandar, Gane, Paul, Gáborová, Romana, Gupta, Deepti, Gutmanas, Aleksandras, Koča, Jaroslav, Mak, Lora, Mir, Saqib, Mukhopadhyay, Abhik, Nadzirin, Nurul, Nair, Sreenath, Patwardhan, Ardan, Paysan-Lafosse, Typhaine, Pravda, Lukas, Salih, Osman, Sehnal, David, Varadi, Mihaly, Vařeková, Radka, Markley, John L., Hoch, Jeffrey C., Romero, Pedro R., Baskaran, Kumaran, Maziuk, Dimitri, Ulrich, Eldon L., Wedell, Jonathan R., Yao, Hongyang, Livny, Miron, and Ioannidis, Yannis E. Wed .
"Protein Data Bank: the single global archive for 3D macromolecular structure data". United Kingdom. https://doi.org/10.1093/nar/gky949.
@article{osti_1479119,
title = {Protein Data Bank: the single global archive for 3D macromolecular structure data},
author = {Burley, Stephen K. and Berman, Helen M. and Bhikadiya, Charmi and Bi, Chunxiao and Chen, Li and Costanzo, Luigi Di and Christie, Cole and Duarte, Jose M. and Dutta, Shuchismita and Feng, Zukang and Ghosh, Sutapa and Goodsell, David S. and Green, Rachel Kramer and Guranovic, Vladimir and Guzenko, Dmytro and Hudson, Brian P. and Liang, Yuhe and Lowe, Robert and Peisach, Ezra and Periskova, Irina and Randle, Chris and Rose, Alexander and Sekharan, Monica and Shao, Chenghua and Tao, Yi-Ping and Valasatava, Yana and Voigt, Maria and Westbrook, John and Young, Jasmine and Zardecki, Christine and Zhuravleva, Marina and Kurisu, Genji and Nakamura, Haruki and Kengaku, Yumiko and Cho, Hasumi and Sato, Junko and Kim, Ju Yaen and Ikegawa, Yasuyo and Nakagawa, Atsushi and Yamashita, Reiko and Kudou, Takahiro and Bekker, Gert-Jan and Suzuki, Hirofumi and Iwata, Takeshi and Yokochi, Masashi and Kobayashi, Naohiro and Fujiwara, Toshimichi and Velankar, Sameer and Kleywegt, Gerard J. and Anyango, Stephen and Armstrong, David R. and Berrisford, John M. and Conroy, Matthew J. and Dana, Jose M. and Deshpande, Mandar and Gane, Paul and Gáborová, Romana and Gupta, Deepti and Gutmanas, Aleksandras and Koča, Jaroslav and Mak, Lora and Mir, Saqib and Mukhopadhyay, Abhik and Nadzirin, Nurul and Nair, Sreenath and Patwardhan, Ardan and Paysan-Lafosse, Typhaine and Pravda, Lukas and Salih, Osman and Sehnal, David and Varadi, Mihaly and Vařeková, Radka and Markley, John L. and Hoch, Jeffrey C. and Romero, Pedro R. and Baskaran, Kumaran and Maziuk, Dimitri and Ulrich, Eldon L. and Wedell, Jonathan R. and Yao, Hongyang and Livny, Miron and Ioannidis, Yannis E.},
abstractNote = {The Protein Data Bank (PDB) is the single global archive of experimentally determined three-dimensional (3D) structure data of biological macromolecules. Since 2003, the PDB has been managed by the Worldwide Protein Data Bank (wwPDB; wwpdb.org), an international consortium that collaboratively oversees deposition, validation, biocuration, and open access dissemination of 3D macromolecular structure data. The PDB Core Archive houses 3D atomic coordinates of more than 144,000 structural models of proteins, DNA/RNA, and their complexes with metals and small molecules and related experimental data and metadata. Structure and experimental data/metadata are also stored in the PDB Core Archive using the readily extensible wwPDB PDBx/mmCIF master data format, which will continue to evolve as data/metadata from new experimental techniques and structure determination methods are incorporated by the wwPDB. Impacts of the recently developed universal wwPDB OneDep deposition/validation/biocuration system and various methods-specific wwPDB Validation Task Forces on improving the quality of structures and data housed in the PDB Core Archive are described together with current challenges and future plans.},
doi = {10.1093/nar/gky949},
journal = {Nucleic Acids Research},
number = D1,
volume = 47,
place = {United Kingdom},
year = {Wed Oct 24 00:00:00 EDT 2018},
month = {Wed Oct 24 00:00:00 EDT 2018}
}
https://doi.org/10.1093/nar/gky949
Web of Science
Works referenced in this record:
SASBDB, a repository for biological small-angle scattering data
journal, October 2014
- Valentini, Erica; Kikhney, Alexey G.; Previtali, Gianpietro
- Nucleic Acids Research, Vol. 43, Issue D1
STAR/mmCIF: An ontology for macromolecular structure
journal, February 2000
- Westbrook, J. D.; Bourne, P. E.
- Bioinformatics, Vol. 16, Issue 2
Report of the wwPDB Small-Angle Scattering Task Force: Data Requirements for Biomolecular Modeling and the PDB
journal, June 2013
- Trewhella, Jill; Hendrickson, Wayne A.; Kleywegt, Gerard J.
- Structure, Vol. 21, Issue 6
EMDB Web Resources
journal, March 2018
- Abbott, Sanja; Iudin, Andrii; Korir, Paul K.
- Current Protocols in Bioinformatics, Vol. 61, Issue 1
Protein Data Bank Japan (PDBj): maintaining a structural data archive and resource description framework format
journal, October 2011
- Kinjo, A. R.; Suzuki, H.; Yamashita, R.
- Nucleic Acids Research, Vol. 40, Issue D1
PDB-Dev: a Prototype System for Depositing Integrative/Hybrid Structural Models
journal, September 2017
- Burley, Stephen K.; Kurisu, Genji; Markley, John L.
- Structure, Vol. 25, Issue 9
Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12
journal, December 2017
- Haas, Jürgen; Barbato, Alessandro; Behringer, Dario
- Proteins: Structure, Function, and Bioinformatics, Vol. 86
Announcing the worldwide Protein Data Bank
journal, December 2003
- Berman, Helen; Henrick, Kim; Nakamura, Haruki
- Nature Structural & Molecular Biology, Vol. 10, Issue 12
D3R Grand Challenge 2: blind prediction of protein–ligand poses, affinity rankings, and relative binding free energies
journal, December 2017
- Gaieb, Zied; Liu, Shuai; Gathiaka, Symon
- Journal of Computer-Aided Molecular Design, Vol. 32, Issue 1
Collaboration gets the most out of software
journal, September 2013
- Morin, Andrew; Eisenbraun, Ben; Key, Jason
- eLife, Vol. 2
New electron microscopy database and deposition system
journal, November 2002
- Tagari, Mohamed; Newman, Richard; Chagoyen, Monica
- Trends in Biochemical Sciences, Vol. 27, Issue 11
A New Generation of Crystallographic Validation Tools for the Protein Data Bank
journal, October 2011
- Read, Randy J.; Adams, Paul D.; Arendall, W. Bryan
- Structure, Vol. 19, Issue 10
STAR/CIF macromolecular NMR data dictionaries and data file formats
journal, August 1996
- Ulrich, E. L.; Argentar, D.; Klimowicz, A.
- Acta Crystallographica Section A Foundations of Crystallography, Vol. 52, Issue a1
Outcome of the First wwPDB/CCDC/D3R Ligand Validation Workshop
journal, April 2016
- Adams, Paul D.; Aertgeerts, Kathleen; Bauer, Cary
- Structure, Vol. 24, Issue 4
EMPIAR: a public archive for raw electron microscopy image data
journal, March 2016
- Iudin, Andrii; Korir, Paul K.; Salavert-Torres, José
- Nature Methods, Vol. 13, Issue 5
The Cambridge Structural Database in Retrospect and Prospect
journal, January 2014
- Groom, Colin R.; Allen, Frank H.
- Angewandte Chemie International Edition, Vol. 53, Issue 3
Crystallography: Protein Data Bank
journal, October 1971
- Skipper, Magdalena
- Nature New Biology, Vol. 233, Issue 42, p. 223-223
The FAIR Guiding Principles for scientific data management and stewardship
journal, March 2016
- Wilkinson, Mark D.; Dumontier, Michel; Aalbersberg, IJsbrand Jan
- Scientific Data, Vol. 3, Issue 1
PDBe: towards reusable data delivery infrastructure at protein data bank in Europe
journal, November 2017
- Mir, Saqib; Alhroub, Younes; Anyango, Stephen
- Nucleic Acids Research, Vol. 46, Issue D1
BioMagResBank
journal, December 2007
- Ulrich, E. L.; Akutsu, H.; Doreleijers, J. F.
- Nucleic Acids Research, Vol. 36, Issue Database
Validation of Structures in the Protein Data Bank
journal, December 2017
- Gore, Swanand; Sanz García, Eduardo; Hendrickx, Pieter M. S.
- Structure, Vol. 25, Issue 12
NMR Exchange Format: a unified and open standard for representation of NMR restraint data
journal, June 2015
- Gutmanas, Aleksandras; Adams, Paul D.; Bardiaux, Benjamin
- Nature Structural & Molecular Biology, Vol. 22, Issue 6
Remediation of the protein data bank archive
journal, December 2007
- Henrick, K.; Feng, Z.; Bluhm, W. F.
- Nucleic Acids Research, Vol. 36, Issue Database
OneDep: Unified wwPDB System for Deposition, Biocuration, and Validation of Macromolecular Structures in the PDB Archive
journal, March 2017
- Young, Jasmine Y.; Westbrook, John D.; Feng, Zukang
- Structure, Vol. 25, Issue 3
A public database of macromolecular diffraction experiments
journal, October 2016
- Grabowski, Marek; Langner, Karol M.; Cymborowski, Marcin
- Acta Crystallographica Section D Structural Biology, Vol. 72, Issue 11
Development of a Prototype System for Archiving Integrative/Hybrid Structure Models of Biological Macromolecules
journal, June 2018
- Vallat, Brinda; Webb, Benjamin; Westbrook, John D.
- Structure, Vol. 26, Issue 6
Evaluation of the template-based modeling in CASP12
journal, December 2017
- Kryshtafovych, Andriy; Monastyrskyy, Bohdan; Fidelis, Krzysztof
- Proteins: Structure, Function, and Bioinformatics, Vol. 86
The chemical component dictionary: complete descriptions of constituent molecules in experimentally determined 3D macromolecules in the Protein Data Bank
journal, December 2014
- Westbrook, John D.; Shao, Chenghua; Feng, Zukang
- Bioinformatics, Vol. 31, Issue 8
Outcome of the First Electron Microscopy Validation Task Force Meeting
journal, February 2012
- Henderson, Richard; Sali, Andrej; Baker, Matthew L.
- Structure, Vol. 20, Issue 2
Protein Data Bank Japan (PDBj): updated user interfaces, resource description framework, analysis tools for large structures
journal, October 2016
- Kinjo, Akira R.; Bekker, Gert-Jan; Suzuki, Hirofumi
- Nucleic Acids Research, Vol. 45, Issue D1
Outcome of the First wwPDB Hybrid/Integrative Methods Task Force Workshop
journal, July 2015
- Sali, Andrej; Berman, Helen M.; Schwede, Torsten
- Structure, Vol. 23, Issue 7
The challenge of modeling protein assemblies: the CASP12-CAPRI experiment
journal, November 2017
- Lensink, Marc F.; Velankar, Sameer; Baek, Minkyung
- Proteins: Structure, Function, and Bioinformatics, Vol. 86
PDBML: the representation of archival macromolecular structure data in XML
journal, October 2004
- Westbrook, J.; Ito, N.; Nakamura, H.
- Bioinformatics, Vol. 21, Issue 7
Implementing an X-ray validation pipeline for the Protein Data Bank
journal, March 2012
- Gore, Swanand; Velankar, Sameer; Kleywegt, Gerard J.
- Acta Crystallographica Section D Biological Crystallography, Vol. 68, Issue 4
Recommendations of the wwPDB NMR Validation Task Force
journal, September 2013
- Montelione, Gaetano T.; Nilges, Michael; Bax, Ad
- Structure, Vol. 21, Issue 9
Worldwide Protein Data Bank biocuration supporting open access to high-quality 3D structural biology data
journal, January 2018
- Young, Jasmine Y.; Westbrook, John D.; Feng, Zukang
- Database, Vol. 2018
Chemical annotation of small and peptide-like molecules at the Protein Data Bank
journal, January 2013
- Young, Jasmine Y.; Feng, Zukang; Dimitropoulos, Dimitris
- Database, Vol. 2013