DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: A deep dilated convolutional residual network for predicting interchain contacts of protein homodimers

Journal Article · · Bioinformatics

Abstract Motivation Deep learning has revolutionized protein tertiary structure prediction recently. The cutting-edge deep learning methods such as AlphaFold can predict high-accuracy tertiary structures for most individual protein chains. However, the accuracy of predicting quaternary structures of protein complexes consisting of multiple chains is still relatively low due to lack of advanced deep learning methods in the field. Because interchain residue–residue contacts can be used as distance restraints to guide quaternary structure modeling, here we develop a deep dilated convolutional residual network method (DRCon) to predict interchain residue–residue contacts in homodimers from residue–residue co-evolutionary signals derived from multiple sequence alignments of monomers, intrachain residue–residue contacts of monomers extracted from true/predicted tertiary structures or predicted by deep learning, and other sequence and structural features. Results Tested on three homodimer test datasets (Homo_std dataset, DeepHomo dataset and CASP-CAPRI dataset), the precision of DRCon for top L/5 interchain contact predictions (L: length of monomer in a homodimer) is 43.46%, 47.10% and 33.50% respectively at 6 Å contact threshold, which is substantially better than DeepHomo and DNCON2_inter and similar to Glinter. Moreover, our experiments demonstrate that using predicted tertiary structure or intrachain contacts of monomers in the unbound state as input, DRCon still performs well, even though its accuracy is lower than using true tertiary structures in the bound state are used as input. Finally, our case study shows that good interchain contact predictions can be used to build high-accuracy quaternary structure models of homodimers. Availability and implementation The source code of DRCon is available at https://github.com/jianlin-cheng/DRCon. The datasets are available at https://zenodo.org/record/5998532#.YgF70vXMKsB. Supplementary information Supplementary data are available at Bioinformatics online.

Research Organization:
Donald Danforth Plant Science Center, St. Louis, MO (United States); University of Missouri, Columbia, MO (United States)
Sponsoring Organization:
National Institutes of Health; National Science Foundation; Thompson Missouri Distinguished Professorship; USDOE; USDOE Advanced Research Projects Agency - Energy (ARPA-E); USDOE Office of Science (SC); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC05-00OR22725; AR0001213; SC0020400; SC0021303
OSTI ID:
1859808
Journal Information:
Bioinformatics, Journal Name: Bioinformatics Journal Issue: 7 Vol. 38; ISSN 1367-4803
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (41)

Scoring function for automated assessment of protein structure template quality journal January 2004
CASP prediction center infrastructure and evaluation measures in CASP10 and CASP ROLL: CASP Prediction Center journal October 2013
CONFOLD: Residue-residue contact-guided ab initio protein folding: Contact-Guided Protein Folding journal June 2015
Prediction of homoprotein and heteroprotein complexes by protein docking and template‐based modeling: A CASP‐CAPRI experiment journal June 2016
Critical assessment of methods of protein structure prediction: Progress and new directions in round XI: Progress in CASP XI journal June 2016
The challenge of modeling protein assemblies: the CASP12-CAPRI experiment journal November 2017
Critical assessment of methods of protein structure prediction (CASP)—Round XIII journal August 2019
Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13) journal August 2019
Assessment of protein model structure accuracy estimation in CASP14 : Old and new challenges journal August 2021
Prediction of protein assemblies, the next frontier: The CASP14‐CAPRI experiment journal September 2021
Protein–Protein Docking with Simultaneous Optimization of Rigid-body Displacement and Side-chain Conformations journal August 2003
HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment journal December 2011
Improved protein structure prediction using potentials from deep learning journal January 2020
Highly accurate protein structure prediction with AlphaFold journal July 2021
Highly accurate protein structure prediction for the human proteome journal July 2021
DNCON2_Inter: predicting interchain contacts for homodimeric and homomultimeric protein complexes using multiple sequence alignments of monomers and deep learning journal June 2021
Improved protein structure prediction using predicted interresidue orientations journal January 2020
Accurate prediction of inter-protein residue–residue contacts for homo-oligomeric protein complexes journal March 2021
InterPep2: global peptide–protein docking using interaction surface templates journal January 2020
Deep graph learning of inter-protein contacts journal November 2021
PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments journal November 2011
Predicting protein residue–residue contacts using deep networks and boosting journal October 2012
ZDOCK server: interactive docking prediction of protein-protein complexes and symmetric multimers journal February 2014
CCMpred—fast and precise prediction of protein residue–residue contacts from correlated mutations journal July 2014
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches journal November 2014
DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins journal November 2019
SCRATCH: a protein structure and structural feature prediction server journal July 2005
TM-align: a protein structure alignment algorithm based on the TM-score journal April 2005
The RosettaDock server for local protein-protein docking journal May 2008
Protein-Protein Interaction Interface Residue Pair Prediction Based on Deep Learning Architecture journal September 2019
Accurate prediction of protein structures and interactions using a three-track neural network journal July 2021
Protein-protein docking using region-based 3D Zernike descriptors journal January 2009
Hidden Markov model speed heuristic and iterative HMM search procedure journal August 2010
Protein docking prediction using predicted protein-protein interface journal January 2012
DeepDist: real-value inter-residue distance prediction with deep residual convolutional network journal January 2021
3D Complex: A Structural Classification of Protein Complexes journal January 2006
Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model journal January 2017
DockQ: A Quality Measure for Protein-Protein Docking Models journal August 2016
DeepComplex: A Web Server of Predicting Protein Complex Structures by Deep Learning Inter-chain Contact Prediction and Distance-Based Modelling journal August 2021
Robust and accurate prediction of residue–residue interactions across protein interfaces using evolutionary information journal May 2014
Sequence co-evolution gives 3D contacts and structures of protein complexes journal September 2014