DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: AF2Complex predicts direct physical interactions in multimeric proteins with deep learning

Journal Article · · Nature Communications

Accurate descriptions of protein-protein interactions are essential for understanding biological systems. Remarkably accurate atomic structures have been recently computed for individual proteins by AlphaFold2 (AF2). Here, we demonstrate that the same neural network models from AF2 developed for single protein sequences can be adapted to predict the structures of multimeric protein complexes without retraining. In contrast to common approaches, our method, AF2Complex, does not require paired multiple sequence alignments. It achieves higher accuracy than some complex protein-protein docking strategies and provides a significant improvement over AF-Multimer, a development of AlphaFold for multimeric proteins. Moreover, we introduce metrics for predicting direct protein-protein interactions between arbitrary protein pairs and validate AF2Complex on some challenging benchmark sets and the E. coli proteome. Lastly, using the cytochrome c biogenesis system I as an example, we present high-confidence models of three sought-after assemblies formed by eight members of this system.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER); National Institutes of health (NIH); USDOE
Grant/Contract Number:
AC05-00OR22725; SC0021303; R35GM118039
OSTI ID:
1860537
Alternate ID(s):
OSTI ID: 1862138
Journal Information:
Nature Communications, Vol. 13, Issue 1; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (58)

Cytochrome c biogenesis: the Ccm system journal June 2010
The thioreduction component CcmG confers efficiency and the heme ligation component CcmH ensures stereo-specificity during cytochrome c maturation journal June 2017
MM-align: a quick algorithm for aligning multiple-chain protein complex structures using iterative dynamic programming journal May 2009
High-Quality Binary Protein Interaction Map of the Yeast Interactome Network journal October 2008
Structural basis of meiotic chromosome synaptic elongation through hierarchical fibrous assembly of SYCE2-TEX12 journal August 2021
Principles of Protein−Protein Interactions: What are the Preferred Ways For Proteins To Interact? journal April 2008
Scoring function for automated assessment of protein structure template quality journal January 2004
DockQ: A Quality Measure for Protein-Protein Docking Models journal August 2016
Inter-residue, inter-protein and inter-family coevolution: bridging the scales journal June 2018
Accurate prediction of protein structures and interactions using a three-track neural network journal July 2021
Protein interface conservation across structure space journal June 2010
Protein complex prediction with AlphaFold-Multimer preprint March 2022
AlphaFold 2: Why It Works and Its Implications for Understanding the Relationships of Protein Sequence, Structure, and Function journal September 2021
Highly accurate protein structure prediction with AlphaFold journal July 2021
ColabFold - Making protein folding accessible to all posted_content February 2022
ZDOCK: An initial-stage protein-docking algorithm journal May 2003
ABC transporter-mediated release of a haem chaperone allows cytochrome c biogenesis: New role for ABC transporter is haem release journal May 2006
Architecture of the membrane-bound cytochrome c heme lyase CcmF journal May 2021
Input features and benchmark data sets for protein complex prediction and E. coli proteome application by AF2Complex dataset January 2022
PIPER: An FFT-based protein docking program with pairwise potentials journal August 2006
Protein-Protein Docking: From Interaction to Interactome journal October 2014
Detecting Protein Function and Protein-Protein Interactions from Genome Sequences journal July 1999
Large-scale identification of protein-protein interaction of Escherichia coli K-12 journal May 2006
A Pareto-Optimal Refinement Method for Protein Design Scaffolds journal April 2013
ROCR: visualizing classifier performance in R journal August 2005
Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold journal June 2019
The CcmFH complex is the system I holocytochrome c synthetase: engineering cytochrome c maturation independent of CcmABCDE : CcmFH is the system I holocytochrome journal January 2014
Prediction of protein assemblies, the next frontier: The CASP14‐CAPRI experiment journal September 2021
Role of a Conserved Glutamine Residue in Tuning the Catalytic Activity of Escherichia coli Cytochrome c Nitrite Reductase journal March 2008
Structure-Based Assembly of Protein Complexes in Yeast journal March 2004
UniProt: a worldwide hub of protein knowledge November 2018
Structural space of protein-protein interfaces is degenerate, close to complete, and highly connected journal December 2010
The ClusPro web server for protein–protein docking journal January 2017
Integrating Multimeric Threading With High-throughput Experiments for Structural Interactome of Escherichia coli journal May 2021
Protein-Protein Complex Structure Predictions by Multimeric Threading and Template Recombination journal July 2011
Highly accurate protein structure prediction for the human proteome journal July 2021
M-TASSER: An Algorithm for Protein Quaternary Structure Prediction journal February 2008
Cryo-EM of CcsBA reveals the basis for cytochrome c biogenesis and heme transport journal December 2021
Protein interaction networks revealed by proteome coevolution journal July 2019
Interaction network containing conserved and essential protein complexes in Escherichia coli journal February 2005
Global Functional Atlas of Escherichia coli Encompassing Previously Uncharacterized Proteins journal April 2009
The Universal Protein Resource (UniProt): an expanding universe of protein information journal January 2006
Uniclust databases of clustered and deeply annotated protein sequences and alignments journal November 2016
Assessment of the CASP14 assembly predictions journal August 2021
The Protein Data Bank journal January 2000
iAlign: a method for the structural comparison of protein–protein interfaces journal July 2010
The CcmC:Heme:CcmE Complex in Heme Trafficking and Cytochrome c Biosynthesis journal August 2010
Computed structures of core eukaryotic protein complexes journal December 2021
New benchmark metrics for protein-protein docking methods: Assessment of Protein Docking Models journal March 2011
Improved protein structure prediction by deep learning irrespective of co-evolution information journal May 2021
Interaction of HoloCcmE with CcmF in Heme Trafficking and Cytochrome c Biosynthesis journal February 2014
A comprehensive analysis of protein–protein interactions in Saccharomyces cerevisiae journal February 2000
VMD: Visual molecular dynamics journal February 1996
HADDOCK:  A Protein−Protein Docking Approach Based on Biochemical or Biophysical Information journal February 2003
Structurally Mapping Endogenous Heme in the CcmCDE Membrane Complex for Cytochrome c Biogenesis journal April 2018
Orthologs, Paralogs, and Evolutionary Genomics journal December 2005
Proteome-scale Deployment of Protein Structure Prediction Workflows on the Summit Supercomputer conference May 2022
Input features and benchmark data sets for protein complex prediction and E. coli proteome application by AF2Complex dataset January 2022