DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Towards a fully automated algorithm driven platform for biosystems design

Journal Article · · Nature Communications
ORCiD logo [1];  [1]; ORCiD logo [2]; ORCiD logo [3];  [4]; ORCiD logo [5]
  1. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Chemical and Biomolecular Engineering and Carl R. Woese Inst. for Genomic Biology; LifeFoundry Inc., Champaign, IL (United States)
  2. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Biochemistry
  3. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Chemical and Biomolecular Engineering; Zhejiang Univ., Hangzhou (China). Key Lab. of Biomass Chemical Engineering of Ministry of Education, College of Chemical and Biological Engineering
  4. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Carl R. Woese Inst. for Genomic Biology and Dept. of Computer Science
  5. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Chemical and Biomolecular Engineering, Carl R. Woese Inst. for Genomic Biology and Dept. of Chemistry and Bioengineering

Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.

Research Organization:
Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER); USDOE
Grant/Contract Number:
SC0018420
OSTI ID:
1619591
Alternate ID(s):
OSTI ID: 1575393; OSTI ID: 1575394
Journal Information:
Nature Communications, Vol. 10, Issue 1; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 73 works
Citation information provided by
Web of Science

References (106)

Learned protein embeddings for machine learning journal March 2018
Functional genomic hypothesis generation and experimentation by a robot scientist journal January 2004
Efficient hyperparameter optimization by using Bayesian optimization for drug-target interaction prediction
  • Ban, Tomohiro; Ohue, Masahito; Akiyama, Yutaka
  • 2017 IEEE 7th International Conference on Computational Advances in Bio- and Medical Sciences (ICCABS), 2017 IEEE 7th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS) https://doi.org/10.1109/ICCABS.2017.8114299
conference October 2017
Rapid protein-folding assay using green fluorescent protein journal July 1999
Customized optimization of metabolic pathways by combinatorial transcriptional engineering journal June 2012
Heteroscedastic Gaussian process regression conference January 2005
Automated multiplex genome-scale engineering in yeast journal May 2017
Engineering microbial factories for synthesis of value-added products journal April 2011
Modular optimization of multi-gene pathways for fatty acids production in E. coli journal January 2013
Controlling the Metabolic Flux through the Carotenoid Pathway Using Directed mRNA Processing and Stabilization journal October 2001
A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise journal March 1964
Navigating the protein fitness landscape with Gaussian processes journal December 2012
Advances in metabolic pathway and strain engineering paving the way for sustainable production of chemical building blocks journal December 2013
Improving lycopene production in Escherichia coli by engineering metabolic control journal May 2000
Exploring Synthetic and Systems Biology at the University of Edinburgh journal June 2016
Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay journal September 2013
Engineering the third wave of biocatalysis journal May 2012
The Synthetic Biology Open Language (SBOL) provides a community standard for communicating designs in synthetic biology journal June 2014
High level production of tyrosinase in recombinant Escherichia coli journal February 2013
Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes journal July 2006
Modular control of multiple pathways using engineered orthogonal T7 polymerases journal June 2012
Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria journal June 2014
A Taxonomy of Global Optimization Methods Based on Response Surfaces journal December 2001
Enzymatic assembly of DNA molecules up to several hundred kilobases journal April 2009
CIDAR MoClo: Improved MoClo Assembly Standard and New E. coli Part Library Enable Rapid Combinatorial Design for Synthetic and Traditional Biology journal November 2015
Lipid engineering combined with systematic metabolic engineering of Saccharomyces cerevisiae for high-yield production of lycopene journal March 2019
Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization journal October 2017
Host and Pathway Engineering for Enhanced Lycopene Biosynthesis in Yarrowia lipolytica journal November 2017
Regression on manifolds: Estimation of the exterior derivative journal February 2011
Highly Efficient Single-Pot Scarless Golden Gate Assembly journal April 2019
Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes journal August 2018
A method for efficient Bayesian optimization of self-assembly systems from scattering data journal June 2018
Engineering biological systems using automated biofoundries journal July 2017
Robust optimization of SVM hyperparameters in the classification of bioactive compounds journal August 2015
Application of Bayesian approach to numerical methods of global and stochastic optimization journal June 1994
Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering journal August 2016
Lycopene overproduction and in situ extraction in organic-aqueous culture systems using a metabolically engineered Escherichia coli journal September 2015
Automated design of synthetic ribosome binding sites to control protein expression journal October 2009
Phoenics: A Bayesian Optimizer for Chemistry journal August 2018
Metabolic engineering of the nonmevalonate isopentenyl diphosphate synthesis pathway inEscherichia coli enhances lycopene production journal January 2001
Toward metabolic engineering in the context of system biology and synthetic biology: advances and prospects journal December 2014
A Highly Characterized Yeast Toolkit for Modular, Multipart Assembly journal April 2015
Novel reference genes for quantifying transcriptional responses of Escherichia coli to protein overexpression by quantitative PCR journal January 2011
Application of Bayesian Optimization for Pharmaceutical Product Development journal March 2019
Sharing Structure and Function in Biological Design with SBOL 2.0 journal May 2016
Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning journal May 2019
Construction of lycopene-overproducing E. coli strains by combining systematic and combinatorial gene knockout targets journal April 2005
Bayesian optimization for genomic selection: a method for discovering the best genotype among a large number of candidates journal October 2017
Building biological foundries for next-generation synthetic biology journal May 2015
Gaussian Processes for Machine Learning book January 2005
SBOL Visual: A Graphical Language for Genetic Designs journal December 2015
The Automation of Science journal April 2009
A Survey on Transfer Learning journal October 2010
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision journal May 2018
Metabolic pathway optimization using ribosome binding site variants and combinatorial gene assembly journal November 2013
Combinatorial pathway engineering for optimized production of the anti-malarial FR900098: Combinatorial Engineering of FR900098 Biosynthetic Pathway journal September 2015
Production of lycopene by metabolically-engineered Escherichia coli journal May 2014
High-Throughput Metabolic Engineering: Advances in Small-Molecule Screening and Selection journal June 2010
FairyTALE: A High-Throughput TAL Effector Synthesis Platform journal September 2013
Construction of plasmids with tunable copy numbers in Saccharomyces cerevisiae and their applications in pathway optimization and multiplex genome integration : Plasmid Copy Number Engineering journal June 2016
Global transcription machinery engineering: A new approach for improving cellular phenotype journal May 2007
Engineering Cellular Metabolism journal March 2016
Fully Automated One-Step Synthesis of Single-Transcript TALEN Pairs Using a Biological Foundry journal January 2017
Learned protein embeddings for machine learning journal June 2018
REinforcement learning based Adaptive samPling: REAPing Rewards by Exploring Protein Conformational Landscapes preprint January 2017
Combinatorial pathway engineering for optimized production of the anti-malarial FR900098: Combinatorial Engineering of FR900098 Biosynthetic Pathway journal September 2015
Construction of plasmids with tunable copy numbers in Saccharomyces cerevisiae and their applications in pathway optimization and multiplex genome integration : Plasmid Copy Number Engineering journal June 2016
Controlling the Metabolic Flux through the Carotenoid Pathway Using Directed mRNA Processing and Stabilization journal October 2001
Metabolic pathway optimization using ribosome binding site variants and combinatorial gene assembly journal November 2013
Toward metabolic engineering in the context of system biology and synthetic biology: advances and prospects journal December 2014
Engineering microbial factories for synthesis of value-added products journal April 2011
Production of lycopene by metabolically-engineered Escherichia coli journal May 2014
Building biological foundries for next-generation synthetic biology journal May 2015
Engineering Cellular Metabolism journal March 2016
Advances in metabolic pathway and strain engineering paving the way for sustainable production of chemical building blocks journal December 2013
Global transcription machinery engineering: A new approach for improving cellular phenotype journal May 2007
Engineering biological systems using automated biofoundries journal July 2017
Expression of prokaryotic 1-deoxy- d -xylulose-5-phosphatases in Escherichia coli increases carotenoid and ubiquinone biosynthesis journal April 1999
Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes journal August 2018
Phoenics: A Bayesian Optimizer for Chemistry journal August 2018
CIDAR MoClo: Improved MoClo Assembly Standard and New E. coli Part Library Enable Rapid Combinatorial Design for Synthetic and Traditional Biology journal November 2015
Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering journal August 2016
Fully Automated One-Step Synthesis of Single-Transcript TALEN Pairs Using a Biological Foundry journal January 2017
Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning journal May 2019
FairyTALE: A High-Throughput TAL Effector Synthesis Platform journal September 2013
Rapid protein-folding assay using green fluorescent protein journal July 1999
Improving lycopene production in Escherichia coli by engineering metabolic control journal May 2000
Functional genomic hypothesis generation and experimentation by a robot scientist journal January 2004
Engineering the third wave of biocatalysis journal May 2012
Automated design of synthetic ribosome binding sites to control protein expression journal October 2009
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision journal May 2018
Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes journal July 2006
Automated multiplex genome-scale engineering in yeast journal May 2017
Modular optimization of multi-gene pathways for fatty acids production in E. coli journal January 2013
Machine learning discovery of missing links that mediate alternative branches to plant alkaloids journal March 2022
Growth of E. coli on formate and methanol via the reductive glycine pathway journal February 2020
Learned protein embeddings for machine learning journal June 2018
Customized optimization of metabolic pathways by combinatorial transcriptional engineering journal June 2012
Modular control of multiple pathways using engineered orthogonal T7 polymerases journal June 2012
Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay journal September 2013
A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise journal March 1964
The Automation of Science journal April 2009
High-Throughput Metabolic Engineering: Advances in Small-Molecule Screening and Selection journal June 2010
A method for efficient Bayesian optimization of self-assembly systems from scattering data journal June 2018
Robust optimization of SVM hyperparameters in the classification of bioactive compounds journal August 2015
SBOL Visual: A Graphical Language for Genetic Designs journal December 2015

Cited By (4)

Role of Digital Microfluidics in Enabling Access to Laboratory Automation and Making Biology Programmable journal June 2020
Homology-dependent recombination of large synthetic pathways into E. coli genome via λ-Red and CRISPR/Cas9 dependent selection methodology journal May 2020
Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways journal January 2020
A machine learning Automated Recommendation Tool for synthetic biology journal September 2020