DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Towards a fully automated algorithm driven platform for biosystems design

Abstract

Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.

Authors:
ORCiD logo; ; ORCiD logo; ORCiD logo; ; ORCiD logo
Publication Date:
Research Org.:
Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1619591
Alternate Identifier(s):
OSTI ID: 1575393; OSTI ID: 1575394
Grant/Contract Number:  
SC0018420
Resource Type:
Published Article
Journal Name:
Nature Communications
Additional Journal Information:
Journal Name: Nature Communications Journal Volume: 10 Journal Issue: 1; Journal ID: ISSN 2041-1723
Publisher:
Nature Publishing Group
Country of Publication:
United Kingdom
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING

Citation Formats

HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, and Zhao, Huimin. Towards a fully automated algorithm driven platform for biosystems design. United Kingdom: N. p., 2019. Web. doi:10.1038/s41467-019-13189-z.
HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, & Zhao, Huimin. Towards a fully automated algorithm driven platform for biosystems design. United Kingdom. https://doi.org/10.1038/s41467-019-13189-z
HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, and Zhao, Huimin. Wed . "Towards a fully automated algorithm driven platform for biosystems design". United Kingdom. https://doi.org/10.1038/s41467-019-13189-z.
@article{osti_1619591,
title = {Towards a fully automated algorithm driven platform for biosystems design},
author = {HamediRad, Mohammad and Chao, Ran and Weisberg, Scott and Lian, Jiazhang and Sinha, Saurabh and Zhao, Huimin},
abstractNote = {Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.},
doi = {10.1038/s41467-019-13189-z},
journal = {Nature Communications},
number = 1,
volume = 10,
place = {United Kingdom},
year = {2019},
month = {11}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1038/s41467-019-13189-z

Citation Metrics:
Cited by: 53 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Learned protein embeddings for machine learning
journal, March 2018


Functional genomic hypothesis generation and experimentation by a robot scientist
journal, January 2004

  • King, Ross D.; Whelan, Kenneth E.; Jones, Ffion M.
  • Nature, Vol. 427, Issue 6971
  • DOI: 10.1038/nature02236

Efficient hyperparameter optimization by using Bayesian optimization for drug-target interaction prediction
conference, October 2017

  • Ban, Tomohiro; Ohue, Masahito; Akiyama, Yutaka
  • 2017 IEEE 7th International Conference on Computational Advances in Bio- and Medical Sciences (ICCABS), 2017 IEEE 7th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)
  • DOI: 10.1109/ICCABS.2017.8114299

Rapid protein-folding assay using green fluorescent protein
journal, July 1999

  • Waldo, Geoffrey S.; Standish, Blake M.; Berendzen, Joel
  • Nature Biotechnology, Vol. 17, Issue 7, p. 691-695
  • DOI: 10.1038/10904

Customized optimization of metabolic pathways by combinatorial transcriptional engineering
journal, June 2012

  • Du, Jing; Yuan, Yongbo; Si, Tong
  • Nucleic Acids Research, Vol. 40, Issue 18
  • DOI: 10.1093/nar/gks549

Heteroscedastic Gaussian process regression
conference, January 2005

  • Le, Quoc V.; Smola, Alex J.; Canu, Stéphane
  • Proceedings of the 22nd international conference on Machine learning - ICML '05
  • DOI: 10.1145/1102351.1102413

Automated multiplex genome-scale engineering in yeast
journal, May 2017

  • Si, Tong; Chao, Ran; Min, Yuhao
  • Nature Communications, Vol. 8, Issue 1
  • DOI: 10.1038/ncomms15187

Engineering microbial factories for synthesis of value-added products
journal, April 2011

  • Du, Jing; Shao, Zengyi; Zhao, Huimin
  • Journal of Industrial Microbiology & Biotechnology, Vol. 38, Issue 8
  • DOI: 10.1007/s10295-011-0970-3

Modular optimization of multi-gene pathways for fatty acids production in E. coli
journal, January 2013

  • Xu, Peng; Gu, Qin; Wang, Wenya
  • Nature Communications, Vol. 4, Issue 1
  • DOI: 10.1038/ncomms2425

Controlling the Metabolic Flux through the Carotenoid Pathway Using Directed mRNA Processing and Stabilization
journal, October 2001

  • Smolke, Christina D.; Martin, Vincent J. J.; Keasling, Jay D.
  • Metabolic Engineering, Vol. 3, Issue 4, p. 313-321
  • DOI: 10.1006/mben.2001.0194

A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise
journal, March 1964

  • Kushner, H. J.
  • Journal of Basic Engineering, Vol. 86, Issue 1
  • DOI: 10.1115/1.3653121

Navigating the protein fitness landscape with Gaussian processes
journal, December 2012

  • Romero, P. A.; Krause, A.; Arnold, F. H.
  • Proceedings of the National Academy of Sciences, Vol. 110, Issue 3
  • DOI: 10.1073/pnas.1215251110

Advances in metabolic pathway and strain engineering paving the way for sustainable production of chemical building blocks
journal, December 2013


Improving lycopene production in Escherichia coli by engineering metabolic control
journal, May 2000

  • Farmer, William R.; Liao, James C.
  • Nature Biotechnology, Vol. 18, Issue 5
  • DOI: 10.1038/75398

Exploring Synthetic and Systems Biology at the University of Edinburgh
journal, June 2016

  • Fletcher, Liz; Rosser, Susan; Elfick, Alistair
  • Biochemical Society Transactions, Vol. 44, Issue 3
  • DOI: 10.1042/BST20160006

Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay
journal, September 2013

  • Lee, Michael E.; Aswani, Anil; Han, Audrey S.
  • Nucleic Acids Research, Vol. 41, Issue 22
  • DOI: 10.1093/nar/gkt809

Engineering the third wave of biocatalysis
journal, May 2012

  • Bornscheuer, U. T.; Huisman, G. W.; Kazlauskas, R. J.
  • Nature, Vol. 485, Issue 7397
  • DOI: 10.1038/nature11117

The Synthetic Biology Open Language (SBOL) provides a community standard for communicating designs in synthetic biology
journal, June 2014

  • Galdzicki, Michal; Clancy, Kevin P.; Oberortner, Ernst
  • Nature Biotechnology, Vol. 32, Issue 6
  • DOI: 10.1038/nbt.2891

High level production of tyrosinase in recombinant Escherichia coli
journal, February 2013


Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes
journal, July 2006

  • Pfleger, Brian F.; Pitera, Douglas J.; Smolke, Christina D.
  • Nature Biotechnology, Vol. 24, Issue 8
  • DOI: 10.1038/nbt1226

Modular control of multiple pathways using engineered orthogonal T7 polymerases
journal, June 2012

  • Temme, Karsten; Hill, Rena; Segall-Shapiro, Thomas H.
  • Nucleic Acids Research, Vol. 40, Issue 17
  • DOI: 10.1093/nar/gks597

Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria
journal, June 2014

  • Farasat, I.; Kushwaha, M.; Collens, J.
  • Molecular Systems Biology, Vol. 10, Issue 6, p. 731-731
  • DOI: 10.15252/msb.20134955

Enzymatic assembly of DNA molecules up to several hundred kilobases
journal, April 2009

  • Gibson, Daniel G.; Young, Lei; Chuang, Ray-Yuan
  • Nature Methods, Vol. 6, Issue 5, p. 343-345
  • DOI: 10.1038/nmeth.1318

CIDAR MoClo: Improved MoClo Assembly Standard and New E. coli Part Library Enable Rapid Combinatorial Design for Synthetic and Traditional Biology
journal, November 2015


Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
journal, October 2017


Host and Pathway Engineering for Enhanced Lycopene Biosynthesis in Yarrowia lipolytica
journal, November 2017


Regression on manifolds: Estimation of the exterior derivative
journal, February 2011

  • Aswani, Anil; Bickel, Peter; Tomlin, Claire
  • The Annals of Statistics, Vol. 39, Issue 1
  • DOI: 10.1214/10-AOS823

Highly Efficient Single-Pot Scarless Golden Gate Assembly
journal, April 2019


Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes
journal, August 2018

  • Shamsi, Zahra; Cheng, Kevin J.; Shukla, Diwakar
  • The Journal of Physical Chemistry B, Vol. 122, Issue 35
  • DOI: 10.1021/acs.jpcb.8b06521

A method for efficient Bayesian optimization of self-assembly systems from scattering data
journal, June 2018


Engineering biological systems using automated biofoundries
journal, July 2017


Robust optimization of SVM hyperparameters in the classification of bioactive compounds
journal, August 2015

  • Czarnecki, Wojciech M.; Podlewska, Sabina; Bojarski, Andrzej J.
  • Journal of Cheminformatics, Vol. 7, Issue 1
  • DOI: 10.1186/s13321-015-0088-0

Application of Bayesian approach to numerical methods of global and stochastic optimization
journal, June 1994


Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering
journal, August 2016


Lycopene overproduction and in situ extraction in organic-aqueous culture systems using a metabolically engineered Escherichia coli
journal, September 2015


Automated design of synthetic ribosome binding sites to control protein expression
journal, October 2009

  • Salis, Howard M.; Mirsky, Ethan A.; Voigt, Christopher A.
  • Nature Biotechnology, Vol. 27, Issue 10, p. 946-950
  • DOI: 10.1038/nbt.1568

Phoenics: A Bayesian Optimizer for Chemistry
journal, August 2018


Toward metabolic engineering in the context of system biology and synthetic biology: advances and prospects
journal, December 2014

  • Liu, Yanfeng; Shin, Hyun-dong; Li, Jianghua
  • Applied Microbiology and Biotechnology, Vol. 99, Issue 3
  • DOI: 10.1007/s00253-014-6298-y

A Highly Characterized Yeast Toolkit for Modular, Multipart Assembly
journal, April 2015

  • Lee, Michael E.; DeLoache, William C.; Cervantes, Bernardo
  • ACS Synthetic Biology, Vol. 4, Issue 9
  • DOI: 10.1021/sb500366v

Novel reference genes for quantifying transcriptional responses of Escherichia coli to protein overexpression by quantitative PCR
journal, January 2011


Application of Bayesian Optimization for Pharmaceutical Product Development
journal, March 2019


Sharing Structure and Function in Biological Design with SBOL 2.0
journal, May 2016


Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning
journal, May 2019


Construction of lycopene-overproducing E. coli strains by combining systematic and combinatorial gene knockout targets
journal, April 2005

  • Alper, Hal; Miyaoku, Kohei; Stephanopoulos, Gregory
  • Nature Biotechnology, Vol. 23, Issue 5
  • DOI: 10.1038/nbt1083

Bayesian optimization for genomic selection: a method for discovering the best genotype among a large number of candidates
journal, October 2017


Building biological foundries for next-generation synthetic biology
journal, May 2015


Gaussian Processes for Machine Learning
book, January 2005


SBOL Visual: A Graphical Language for Genetic Designs
journal, December 2015


The Automation of Science
journal, April 2009


A Survey on Transfer Learning
journal, October 2010

  • Pan, Sinno Jialin; Yang, Qiang
  • IEEE Transactions on Knowledge and Data Engineering, Vol. 22, Issue 10
  • DOI: 10.1109/TKDE.2009.191

Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision
journal, May 2018

  • Bao, Zehua; HamediRad, Mohammad; Xue, Pu
  • Nature Biotechnology, Vol. 36, Issue 6
  • DOI: 10.1038/nbt.4132

Metabolic pathway optimization using ribosome binding site variants and combinatorial gene assembly
journal, November 2013

  • Nowroozi, Farnaz F.; Baidoo, Edward E. K.; Ermakov, Simon
  • Applied Microbiology and Biotechnology, Vol. 98, Issue 4
  • DOI: 10.1007/s00253-013-5361-4

Combinatorial pathway engineering for optimized production of the anti-malarial FR900098: Combinatorial Engineering of FR900098 Biosynthetic Pathway
journal, September 2015

  • Freestone, Todd S.; Zhao, Huimin
  • Biotechnology and Bioengineering, Vol. 113, Issue 2
  • DOI: 10.1002/bit.25719

Production of lycopene by metabolically-engineered Escherichia coli
journal, May 2014


High-Throughput Metabolic Engineering: Advances in Small-Molecule Screening and Selection
journal, June 2010


FairyTALE: A High-Throughput TAL Effector Synthesis Platform
journal, September 2013

  • Liang, Jing; Chao, Ran; Abil, Zhanar
  • ACS Synthetic Biology, Vol. 3, Issue 2
  • DOI: 10.1021/sb400109p

Global transcription machinery engineering: A new approach for improving cellular phenotype
journal, May 2007


Engineering Cellular Metabolism
journal, March 2016


Fully Automated One-Step Synthesis of Single-Transcript TALEN Pairs Using a Biological Foundry
journal, January 2017