DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: SCOPe: classification of large macromolecular structures in the structural classification of proteins—extended database

Journal Article · · Nucleic Acids Research
ORCiD logo [1];  [2]; ORCiD logo [3]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Genomics and Systems Biology Division; Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Molecular Biophysics and Integrated Bioimaging Division; DOE/OSTI
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Genomics and Systems Biology Division; Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Molecular Biophysics and Integrated Bioimaging Division
  3. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Environmental Genomics and Systems Biology Division; Univ. of California, Berkeley, CA (United States). Department of Plant and Microbial Biology

The SCOPe (Structural Classification of Proteins—extended, https://scop.berkeley.edu) database hierarchically classifies domains from the majority of proteins of known structure according to their structural and evolutionary relationships. SCOPe also incorporates and updates the ASTRAL compendium, which provides multiple databases and tools to aid in the analysis of the sequences and structures of proteins classified in SCOPe. Protein structures are classified using a combination of manual curation and highly precise automated methods. In the current release of SCOPe, 2.07, we have focused our manual curation efforts on larger protein structures, including the spliceosome, proteasome and RNA polymerase I, as well as many other Pfam families that had not previously been classified. Domains from these large protein complexes are distinctive in several ways: novel non-globular folds are more common, and domains from previously observed protein families often have N- or C-terminal extensions that were disordered or not present in previous structures. The current monthly release update, SCOPe 2.07–2018-10–18, classifies 90 992 PDB entries (about two thirds of PDB entries).

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1625572
Journal Information:
Nucleic Acids Research, Journal Name: Nucleic Acids Research Journal Issue: D1 Vol. 47; ISSN 0305-1048
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United States
Language:
English

References (61)

Cryo-EM Structure of a Pre-catalytic Human Spliceosome Primed for Activation journal August 2017
SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins – extended Database journal February 2017
RNA, the first macromolecular catalyst: the ribosome is a ribozyme journal August 2003
Structural patterns in globular proteins journal June 1976
Cryo-EM structure of the yeast U4/U6.U5 tri-snRNP at 3.7 Å resolution journal February 2016
Deep classification of a large cryo-EM dataset defines the conformational landscape of the 26S proteasome journal March 2014
Crystal Structure of Human eIF3k, the First Structure of eIF3 Subunits journal June 2004
3DSwap: curated knowledgebase of proteins involved in 3D domain swapping journal January 2011
The Protein Data Bank journal January 2000
SCOP database in 2002: refinements accommodate structural genomics journal January 2002
The Pfam Protein Families Database journal January 2002
SCOP database in 2004: refinements integrate structure and sequence family data journal January 2004
Pfam: clans, web tools and services journal January 2006
Data growth and its impact on the SCOP database: new developments journal December 2007
SCOP2 prototype: a new approach to protein structure mining journal November 2013
CATH: comprehensive structural and functional annotations for genome sequences journal October 2014
The Pfam protein families database: towards a more sustainable future journal December 2015
UniProt: the universal protein knowledgebase journal November 2016
The Impact of Structural Genomics: Expectations and Outcomes journal January 2006
Structure of a yeast spliceosome at 3.6-angstrom resolution journal August 2015
Molecular architecture of the Saccharomyces cerevisiae activated spliceosome journal August 2016
Structure of a yeast catalytic step I spliceosome at 3.4 A resolution journal July 2016
Prp8, the pivotal protein of the spliceosomal catalytic center, evolved from a retroelement-encoded reverse transcriptase journal March 2011
The value of protein structure classification information-Surveying the scientific literature: The Value of Protein Structure Classification journal September 2015
SCOP: A structural classification of proteins database for the investigation of sequences and structures journal April 1995
RNA, the first macromolecular catalyst: the ribosome is a ribozyme journal August 2003
Cryo-EM Structure of a Pre-catalytic Human Spliceosome Primed for Activation journal August 2017
SCOPe: Manual Curation and Artifact Removal in the Structural Classification of Proteins – extended Database journal February 2017
Structural patterns in globular proteins journal June 1976
Cryo-electron microscopy wins chemistry Nobel journal October 2017
Crystal structure of the 14-subunit RNA polymerase I journal October 2013
Cryo-EM structure of the yeast U4/U6.U5 tri-snRNP at 3.7 Å resolution journal February 2016
Nature of the protein universe journal June 2009
Deep classification of a large cryo-EM dataset defines the conformational landscape of the 26S proteasome journal March 2014
Crystal Structure of Human eIF3k, the First Structure of eIF3 Subunits journal June 2004
3DSwap: curated knowledgebase of proteins involved in 3D domain swapping journal January 2011
The Protein Data Bank journal January 2000
The ASTRAL compendium for protein structure and sequence analysis journal January 2000
The Pfam Protein Families Database journal January 2000
ASTRAL compendium enhancements journal January 2002
SCOP database in 2002: refinements accommodate structural genomics journal January 2002
The Pfam Protein Families Database journal January 2002
The ASTRAL Compendium in 2004 journal January 2004
SCOP database in 2004: refinements integrate structure and sequence family data journal January 2004
The Pfam protein families database journal January 2004
UniProt: the Universal Protein knowledgebase journal January 2004
Pfam: clans, web tools and services journal January 2006
Data growth and its impact on the SCOP database: new developments journal December 2007
SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures journal December 2013
SCOP2 prototype: a new approach to protein structure mining journal November 2013
CATH: comprehensive structural and functional annotations for genome sequences journal October 2014
The Pfam protein families database: towards a more sustainable future journal December 2015
UniProt: the universal protein knowledgebase journal November 2016
UniProt: the universal protein knowledgebase journal February 2018
The Protein Data Bank journal May 2002
The Impact of Structural Genomics: Expectations and Outcomes journal January 2006
Structure of a yeast spliceosome at 3.6-angstrom resolution journal August 2015
Molecular architecture of the Saccharomyces cerevisiae activated spliceosome journal August 2016
Structure of a yeast catalytic step I spliceosome at 3.4 A resolution journal July 2016
Prp8, the pivotal protein of the spliceosomal catalytic center, evolved from a retroelement-encoded reverse transcriptase journal March 2011
ECOD: An Evolutionary Classification of Protein Domains journal December 2014

Cited By (11)

Targeting adenylate-forming enzymes with designed sulfonyladenosine inhibitors journal April 2019
The human DEPhOsphorylation Database DEPOD: 2019 update journal January 2019
The Nature and Arrangement of Pentatricopeptide Domains and the Linker Sequences Between Them journal January 2020
Discovery of several thousand highly diverse circular DNA viruses. journalarticle January 2020
Sequence and Structure Properties Uncover the Natural Classification of Protein Complexes Formed by Intrinsically Disordered Proteins via Mutual Synergistic Folding journal November 2019
Novel Network Science Approaches for a Better Understanding of Protein Folding and Human Aging text January 2021
DALI and the persistence of protein shape journal November 2019
Targeting adenylate-forming enzymes with designed sulfonyladenosine inhibitors journal April 2019
Discovery of several thousand highly diverse circular DNA viruses text January 2020
Sequence and Structure Properties Uncover the Natural Classification of Protein Complexes Formed by Intrinsically Disordered Proteins via Mutual Synergistic Folding journal November 2019
Discovery of several thousand highly diverse circular DNA viruses journal February 2020