Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

A generalized platform for artificial intelligence-powered autonomous enzyme engineering

Journal Article · · Nature Communications
Proteins are the molecular machines of life with numerous applications in energy, health, and sustainability. However, engineering proteins with desired functions for practical applications remains slow, expensive, and specialist-dependent. Here we report a generally applicable platform for autonomous enzyme engineering that integrates machine learning and large language models with biofoundry automation to eliminate the need for human intervention, judgement, and domain expertise. Requiring only an input protein sequence and a quantifiable way to measure fitness, this automated platform can be applied to engineer a wide array of proteins. As a proof of concept, we engineer Arabidopsis thaliana halide methyltransferase (AtHMT) for a 90-fold improvement in substrate preference and 16-fold improvement in ethyltransferase activity, along with developing a Yersinia mollaretii phytase (YmPhytase) variant with 26-fold improvement in activity at neutral pH. This is accomplished in four rounds over 4 weeks, while requiring construction and characterization of fewer than 500 variants for each enzyme. This platform for autonomous experimentation paves the way for rapid advancements across diverse industries, from medicine and biotechnology to renewable energy and sustainable chemistry.
Research Organization:
University of Illinois Urbana-Champaign, IL (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
SC0018420
OSTI ID:
2574786
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 16; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (50)

Profound Methyl Effects in Drug Discovery and a Call for New CH Methylation Reactions journal October 2013
Directed Evolution: Bringing New Chemistry to Life journal November 2017
Directed Evolution of a Halide Methyltransferase Enables Biocatalytic Synthesis of Diverse SAM Analogs journal November 2020
From Natural Methylation to Versatile Alkylations Using Halide Methyltransferases journal May 2021
Biotechnological production and applications of phytases journal July 2005
Directed evolution of a highly active Yersinia mollaretii phytase journal December 2011
Directed evolution of an acid Yersinia mollaretii phytase for broadened activity at neutral pH journal August 2018
Directed evolution of industrial enzymes journal April 1999
Catalyzing the future: recent advances in chemical synthesis using enzymes journal December 2024
Informed training set design enables efficient machine learning-assisted directed protein evolution journal August 2021
In vitro continuous protein evolution empowered by machine learning and automation journal August 2023
Self-resistance-gene-guided, high-throughput automated genome mining of bioactive natural products from Streptomyces journal March 2025
Perspectives for self-driving labs in synthetic biology journal February 2023
Engineering of halide methyltransferases for synthesis of SAE and its application in biosynthesis of ethyl vanillin journal November 2023
Machine learning for metabolic engineering: A review journal January 2021
Directed Evolution: Methodologies and Applications journal July 2021
Self-Driving Laboratories for Chemistry and Materials Science journal August 2024
Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering journal February 2024
Enabling Biocatalysis by High-Throughput Protein Engineering Using Droplet Microfluidics Coupled to Mass Spectrometry journal February 2018
Physical Laboratory Automation in Synthetic Biology journal November 2023
Artificial Intelligence Methods and Models for Retro-Biosynthesis: A Scoping Review journal July 2024
Fully Automated One-Step Synthesis of Single-Transcript TALEN Pairs Using a Biological Foundry journal January 2017
Opportunities at the Intersection of Synthetic Biology, Machine Learning, and Automation journal July 2019
An operating system for the biology lab journal September 2019
Mutation effects predicted from sequence co-variation journal January 2017
Automated multiplex genome-scale engineering in yeast journal May 2017
Exploring protein fitness landscapes by directed evolution journal December 2009
Protein engineering 20 years on journal December 2002
Towards a fully automated algorithm driven platform for biosystems design journal November 2019
A machine learning Automated Recommendation Tool for synthetic biology journal September 2020
PlasmidMaker is a versatile, automated, and high throughput end-to-end platform for plasmid construction journal May 2022
A scalable platform to discover antimicrobials of ribosomal origin journal October 2022
A mobile robotic chemist journal July 2020
An autonomous laboratory for the accelerated synthesis of novel materials journal November 2023
Autonomous chemical research with large language models journal December 2023
Accurate structure prediction of biomolecular interactions with AlphaFold 3 journal May 2024
Autonomous mobile robots for exploratory synthetic chemistry journal November 2024
Learning protein fitness models from evolutionary and assay-labeled data journal January 2022
Enabling high-throughput enzyme discovery and engineering with a low-cost, robot-assisted pipeline journal June 2024
Bayesian statistics and modelling journal January 2021
Self-driving laboratories to autonomously navigate the protein fitness landscape journal January 2024
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences journal April 2021
Protein engineering via Bayesian optimization-guided evolutionary algorithm and robotic experiments journal December 2022
Expert-guided protein language models enable accurate and blazingly fast fitness prediction journal November 2024
The EVcouplings Python framework for coevolutionary sequence analysis journal October 2018
Language models enable zero-shot prediction of the effects of mutations on protein function preprint November 2021
Self-driving laboratory for accelerated discovery of thin-film materials journal May 2020
The Automation of Science journal April 2009
Evolutionary-scale prediction of atomic-level protein structure with a language model journal March 2023
Comparison of High-Dimensional Bayesian Optimization Algorithms on BBOB
  • Santoni, Maria Laura; Raponi, Elena; Leone, Renato De
  • ACM Transactions on Evolutionary Learning and Optimization, Vol. 4, Issue 3 https://doi.org/10.1145/3670683
journal July 2024

Similar Records

Self-driving laboratories to autonomously navigate the protein fitness landscape
Journal Article · Wed Jan 10 19:00:00 EST 2024 · Nature Chemical Engineering · OSTI ID:2281045

Enzyme property prediction using artificial intelligence
Journal Article · Sun Dec 21 19:00:00 EST 2025 · Current Opinion in Chemical Engineering · OSTI ID:3028996

Artificial Intelligence for Autonomous Molecular Design: A Perspective
Journal Article · Mon Nov 08 19:00:00 EST 2021 · Molecules · OSTI ID:1829517