The value of protein structure classification information-Surveying the scientific literature
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Univ. of California, Berkeley, CA (United States)
The Structural Classification of Proteins (SCOP) and Class, Architecture, Topology, Homology (CATH) databases have been valuable resources for protein structure classification for over 20 years. Development of SCOP (version 1) concluded in June 2009 with SCOP 1.75. The SCOPe (SCOP-extended) database offers continued development of the classic SCOP hierarchy, adding over 33,000 structures. We have attempted to assess the impact of these two decade old resources and guide future development. To this end, we surveyed recent articles to learn how structure classification data are used. Of 571 articles published in 2012-2013 that cite SCOP, 439 actually use data from the resource. We found that the type of use was fairly evenly distributed among four top categories: A) study protein structure or evolution (27% of articles), B) train and/or benchmark algorithms (28% of articles), C) augment non-SCOP datasets with SCOP classification (21% of articles), and D) examine the classification of one protein/a small set of proteins (22% of articles). Most articles described computational research, although 11% described purely experimental research, and a further 9% included both. We examined how CATH and SCOP were used in 158 articles that cited both databases: while some studies used only one dataset, the majority used data from both resources. Protein structure classification remains highly relevant for a diverse range of problems and settings.
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- National Institutes of Health; USDOE
- Grant/Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1378622
- Journal Information:
- Proteins, Journal Name: Proteins Journal Issue: 11 Vol. 83; ISSN 0887-3585
- Publisher:
- WileyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Protein Classification Based on Analysis of Local Sequence-Structure Correspondence
PROCOGNATE: a cognate ligand domain mapping for enzymes
SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures
Technical Report
·
Sun Feb 12 23:00:00 EST 2006
·
OSTI ID:893991
PROCOGNATE: a cognate ligand domain mapping for enzymes
Journal Article
·
Thu Aug 23 20:00:00 EDT 2007
· Nucleic Acids Research
·
OSTI ID:1625427
SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures
Journal Article
·
Mon Dec 02 19:00:00 EST 2013
· Nucleic Acids Research
·
OSTI ID:1625521