Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Robust deep learning–based protein sequence design using ProteinMPNN

Journal Article · · Science
 [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [2];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [1];  [3];  [1] more »;  [1];  [1] « less
  1. Univ. of Washington, Seattle, WA (United States)
  2. Wageningen Univ. (Netherlands)
  3. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Although deep learning has revolutionized protein structure prediction, almost all experimentally characterized de novo protein designs have been generated using physically based approaches such as Rosetta. Here, we describe a deep learning–based protein sequence design method, ProteinMPNN, that has outstanding performance in both in silico and experimental tests. On native protein backbones, ProteinMPNN has a sequence recovery of 52.4% compared with 32.9% for Rosetta. The amino acid sequence at different positions can be coupled between single or multiple chains, enabling application to a wide range of current protein design challenges. We demonstrate the broad utility and high accuracy of ProteinMPNN using x-ray crystallography, cryo–electron microscopy, and functional studies by rescuing previously failed designs, which were made using Rosetta or AlphaFold, of protein monomers, cyclic homo-oligomers, tetrahedral nanoparticles, and target-binding proteins.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2470608
Journal Information:
Science, Journal Name: Science Journal Issue: 6615 Vol. 378; ISSN 0036-8075
Publisher:
AAASCopyright Statement
Country of Publication:
United States
Language:
English

References (26)

MolProbity: More and better reference data for improved all-atom structure validation: PROTEIN SCIENCE.ORG journal November 2017
ProDCoNN: Protein design using a convolutional neural network journal January 2020
CATH – a hierarchic classification of protein domain structures journal August 1997
Induction of Potent Neutralizing Antibody Responses by a Designed Protein Nanoparticle Vaccine for Respiratory Syncytial Virus journal March 2019
Elicitation of Potent Neutralizing Antibody Responses by Designed Protein Nanoparticle Vaccines for SARS-CoV-2 journal November 2020
Fast and Flexible Protein Design Using Deep Graph Neural Networks journal October 2020
DenseCPD: Improving the Accuracy of Neural-Network-Based Computational Protein Sequence Design with DenseNet journal March 2020
Accurate design of co-assembling multi-component protein nanomaterials journal May 2014
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets journal October 2017
Protein sequence design with a learned potential journal February 2022
Quadrivalent influenza nanoparticle vaccines induce broad protection journal March 2021
Highly accurate protein structure prediction with AlphaFold journal July 2021
Design of protein-binding proteins from the target structure alone journal March 2022
SNAC-tag for sequence-specific chemical protein cleavage journal March 2019
Macromolecular modeling and design in Rosetta: recent methods and frameworks journal June 2020
lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests journal August 2013
TM-align: a protein structure alignment algorithm based on the TM-score journal April 2005
Learning inverse folding from millions of predicted structures preprint September 2022
Phaser crystallographic software journal July 2007
Coot model-building tools for molecular graphics journal November 2004
XDS journal January 2010
PHENIX: a comprehensive Python-based system for macromolecular structure solution journal January 2010
Overview of the CCP 4 suite and current developments journal March 2011
Rethinking the Inception Architecture for Computer Vision conference June 2016
Accurate prediction of protein structures and interactions using a three-track neural network journal July 2021
Hallucinating symmetric protein assemblies journal October 2022

Similar Records

Rapid and automated design of two-component protein nanomaterials using ProteinMPNN
Journal Article · Mon Mar 18 20:00:00 EDT 2024 · Proceedings of the National Academy of Sciences of the United States of America · OSTI ID:2372865

Protein sequence design with a learned potential
Journal Article · Mon Feb 07 19:00:00 EST 2022 · Nature Communications · OSTI ID:1869814

De novo design of protein structure and function with RFdiffusion
Journal Article · Tue Jul 11 00:00:00 EDT 2023 · Nature (London) · OSTI ID:2420884

Related Subjects