Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Classifying metal‐binding sites with neural networks

Journal Article · · Protein Science
DOI:https://doi.org/10.1002/pro.4591· OSTI ID:1958634
 [1];  [1];  [2];  [2];  [2];  [2]
  1. National Security Directorate Pacific Northwest National Laboratory Richland Washington USA
  2. Physical and Computational Sciences Directorate Pacific Northwest National Laboratory Richland Washington USA

Abstract

To advance our ability to predict impacts of the protein scaffold on catalysis, robust classification schemes to define features of proteins that will influence reactivity are needed. One of these features is a protein's metal‐binding ability, as metals are critical to catalytic conversion by metalloenzymes. As a step toward realizing this goal, we used convolutional neural networks (CNNs) to enable the classification of a metal cofactor binding pocket within a protein scaffold. CNNs enable images to be classified based on multiple levels of detail in the image, from edges and corners to entire objects, and can provide rapid classification. First, six CNN models were fine‐tuned to classify the 20 standard amino acids to choose a performant model for amino acid classification. This model was then trained in two parallel efforts: to classify a 2D image of the environment within a given radius of the central metal binding site, either an Fe ion or a [2Fe‐2S] cofactor, with the metal visible (effort 1) or the metal hidden (effort 2). We further used two sub‐classifications of the [2Fe‐2S] cofactor: (1) a standard [2Fe‐2S] cofactor and (2) a Rieske [2Fe‐2S] cofactor. The accuracy for the model correctly identifying all three defined features was >95%, despite our perception of the increased challenge of the metalloenzyme identification. This demonstrates that machine learning methodology to classify and distinguish similar metal‐binding sites, even in the absence of a visible cofactor, is indeed possible and offers an additional tool for metal‐binding site identification in proteins.

Sponsoring Organization:
USDOE
OSTI ID:
1958634
Alternate ID(s):
OSTI ID: 1958635
OSTI ID: 1961579
Journal Information:
Protein Science, Journal Name: Protein Science Journal Issue: 3 Vol. 32; ISSN 0961-8368
Publisher:
Wiley Blackwell (John Wiley & Sons)Copyright Statement
Country of Publication:
United Kingdom
Language:
English

References (37)

Machine learning techniques for protein function prediction journal October 2019
Iron-Sulfur Cluster Biosynthesis book January 2003
Classification of Proteins: Available Structural Space for Molecular Modeling book January 2011
Metal ions in biological catalysis: from enzyme databases to general principles journal July 2008
Catalysis by metallo-enzymes: The entatic state journal December 1971
Analysis of Catalytic Residues in Enzyme Active Sites journal November 2002
Molql: Towards a Common General Purpose Molecular Query Language journal February 2018
iProStruct2D: Identifying protein structural classes by deep learning via 2D representations journal March 2020
Identification of Iron-Sulfur (Fe-S) Cluster and Zinc (Zn) Binding Sites Within Proteomes Predicted by DeepMind’s AlphaFold2 Program Dramatically Expands the Metalloproteome journal January 2022
LmrR: A Privileged Scaffold for Artificial Metalloenzymes journal February 2019
Artificial Metalloenzymes: Reaction Scope and Optimization Strategies journal May 2017
Patterns of Ligands Coordinated to Metallocofactors Extracted from the Protein Data Bank journal November 2017
The Depth of Chemical Time and the Power of Enzymes as Catalysts journal December 2001
Electrostatic Basis for Enzyme Catalysis journal August 2006
Protein Design: Toward Functional Metalloenzymes journal January 2014
Structural and Functional Aspects of Metal Sites in Biology journal January 1996
Going beyond Structure: Nickel-Substituted Rubredoxin as a Mechanistic Model for the [NiFe] Hydrogenases journal July 2018
Kemp elimination catalysts by computational enzyme design journal March 2008
Announcing the worldwide Protein Data Bank journal December 2003
Highly accurate protein structure prediction with AlphaFold journal July 2021
Chicken fat for catalysis: a scaffold is as important for molecular complexes for energy transformations as it is for enzymes in catalytic function journal January 2019
Artificial metalloenzymes: proteins as hosts for enantioselective catalysis journal January 2005
Design of a single protein that spans the entire 2-V range of physiological redox potentials journal December 2015
The extended environment of mononuclear metal centers in protein structures journal December 1997
MetalPredator: a web server to predict iron–sulfur cluster binding proteomes journal June 2016
High precision protein functional site detection using 3D convolutional neural networks journal September 2018
MetalPDB: a database of metal sites in biological macromolecular structures journal November 2012
UniProt: a worldwide hub of protein knowledge November 2018
Learning to discriminate between ligand-bound and disulfide-bound cysteines journal May 2004
Similarity Analysis of 3D Structures of Proteins Based Tile-CNN journal January 2020
Characterizing and Predicting Catalytic Residues in Enzyme Active Sites Based on Local Properties: A Machine Learning Approach conference October 2007
ImageNet: A large-scale hierarchical image database
  • Deng, Jia; Dong, Wei; Socher, Richard
  • 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPR Workshops), 2009 IEEE Conference on Computer Vision and Pattern Recognition https://doi.org/10.1109/CVPR.2009.5206848
conference June 2009
Multi-view Convolutional Neural Networks for 3D Shape Recognition conference December 2015
From sequence to enzyme mechanism using multi-label machine learning journal May 2014
3D deep convolutional neural networks for amino acid environment similarity analysis journal June 2017
Formation of Unstable and very Reactive Chemical Species Catalyzed by Metalloenzymes: A Mechanistic Overview journal July 2019
Artificial Metalloenzymes: From Selective Chemical Transformations to Biochemical Applications journal June 2020

Similar Records

Classifying Metal-Binding Sites with Neural Networks
Dataset · Wed Sep 07 00:00:00 EDT 2022 · OSTI ID:1923007

Related Subjects