Towards a fully automated algorithm driven platform for biosystems design
Abstract
Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.
- Authors:
- Publication Date:
- Research Org.:
- Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- OSTI Identifier:
- 1619591
- Alternate Identifier(s):
- OSTI ID: 1575393; OSTI ID: 1575394
- Grant/Contract Number:
- SC0018420
- Resource Type:
- Published Article
- Journal Name:
- Nature Communications
- Additional Journal Information:
- Journal Name: Nature Communications Journal Volume: 10 Journal Issue: 1; Journal ID: ISSN 2041-1723
- Publisher:
- Nature Publishing Group
- Country of Publication:
- United Kingdom
- Language:
- English
- Subject:
- 59 BASIC BIOLOGICAL SCIENCES; 97 MATHEMATICS AND COMPUTING
Citation Formats
HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, and Zhao, Huimin. Towards a fully automated algorithm driven platform for biosystems design. United Kingdom: N. p., 2019.
Web. doi:10.1038/s41467-019-13189-z.
HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, & Zhao, Huimin. Towards a fully automated algorithm driven platform for biosystems design. United Kingdom. https://doi.org/10.1038/s41467-019-13189-z
HamediRad, Mohammad, Chao, Ran, Weisberg, Scott, Lian, Jiazhang, Sinha, Saurabh, and Zhao, Huimin. Wed .
"Towards a fully automated algorithm driven platform for biosystems design". United Kingdom. https://doi.org/10.1038/s41467-019-13189-z.
@article{osti_1619591,
title = {Towards a fully automated algorithm driven platform for biosystems design},
author = {HamediRad, Mohammad and Chao, Ran and Weisberg, Scott and Lian, Jiazhang and Sinha, Saurabh and Zhao, Huimin},
abstractNote = {Large-scale data acquisition and analysis are often required in the successful implementation of the design, build, test, and learn (DBTL) cycle in biosystems design. However, it has long been hindered by experimental cost, variability, biases, and missed insights from traditional analysis methods. Here, we report the application of an integrated robotic system coupled with machine learning algorithms to fully automate the DBTL process for biosystems design. As proof of concept, we have demonstrated its capacity by optimizing the lycopene biosynthetic pathway. This fully-automated robotic platform, BioAutomata, evaluates less than 1% of possible variants while outperforming random screening by 77%. A paired predictive model and Bayesian algorithm select experiments which are performed by Illinois Biological Foundry for Advanced Biomanufacturing (iBioFAB). BioAutomata excels with black-box optimization problems, where experiments are expensive and noisy and the success of the experiment is not dependent on extensive prior knowledge of biological mechanisms.},
doi = {10.1038/s41467-019-13189-z},
journal = {Nature Communications},
number = 1,
volume = 10,
place = {United Kingdom},
year = {2019},
month = {11}
}
https://doi.org/10.1038/s41467-019-13189-z
Web of Science
Works referenced in this record:
Learned protein embeddings for machine learning
journal, March 2018
- Yang, Kevin K.; Wu, Zachary; Bedbrook, Claire N.
- Bioinformatics, Vol. 34, Issue 15
Functional genomic hypothesis generation and experimentation by a robot scientist
journal, January 2004
- King, Ross D.; Whelan, Kenneth E.; Jones, Ffion M.
- Nature, Vol. 427, Issue 6971
Efficient hyperparameter optimization by using Bayesian optimization for drug-target interaction prediction
conference, October 2017
- Ban, Tomohiro; Ohue, Masahito; Akiyama, Yutaka
- 2017 IEEE 7th International Conference on Computational Advances in Bio- and Medical Sciences (ICCABS), 2017 IEEE 7th International Conference on Computational Advances in Bio and Medical Sciences (ICCABS)
Rapid protein-folding assay using green fluorescent protein
journal, July 1999
- Waldo, Geoffrey S.; Standish, Blake M.; Berendzen, Joel
- Nature Biotechnology, Vol. 17, Issue 7, p. 691-695
Customized optimization of metabolic pathways by combinatorial transcriptional engineering
journal, June 2012
- Du, Jing; Yuan, Yongbo; Si, Tong
- Nucleic Acids Research, Vol. 40, Issue 18
Heteroscedastic Gaussian process regression
conference, January 2005
- Le, Quoc V.; Smola, Alex J.; Canu, Stéphane
- Proceedings of the 22nd international conference on Machine learning - ICML '05
Automated multiplex genome-scale engineering in yeast
journal, May 2017
- Si, Tong; Chao, Ran; Min, Yuhao
- Nature Communications, Vol. 8, Issue 1
Engineering microbial factories for synthesis of value-added products
journal, April 2011
- Du, Jing; Shao, Zengyi; Zhao, Huimin
- Journal of Industrial Microbiology & Biotechnology, Vol. 38, Issue 8
Modular optimization of multi-gene pathways for fatty acids production in E. coli
journal, January 2013
- Xu, Peng; Gu, Qin; Wang, Wenya
- Nature Communications, Vol. 4, Issue 1
Controlling the Metabolic Flux through the Carotenoid Pathway Using Directed mRNA Processing and Stabilization
journal, October 2001
- Smolke, Christina D.; Martin, Vincent J. J.; Keasling, Jay D.
- Metabolic Engineering, Vol. 3, Issue 4, p. 313-321
A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise
journal, March 1964
- Kushner, H. J.
- Journal of Basic Engineering, Vol. 86, Issue 1
Navigating the protein fitness landscape with Gaussian processes
journal, December 2012
- Romero, P. A.; Krause, A.; Arnold, F. H.
- Proceedings of the National Academy of Sciences, Vol. 110, Issue 3
Advances in metabolic pathway and strain engineering paving the way for sustainable production of chemical building blocks
journal, December 2013
- Chen, Yun; Nielsen, Jens
- Current Opinion in Biotechnology, Vol. 24, Issue 6
Improving lycopene production in Escherichia coli by engineering metabolic control
journal, May 2000
- Farmer, William R.; Liao, James C.
- Nature Biotechnology, Vol. 18, Issue 5
Exploring Synthetic and Systems Biology at the University of Edinburgh
journal, June 2016
- Fletcher, Liz; Rosser, Susan; Elfick, Alistair
- Biochemical Society Transactions, Vol. 44, Issue 3
Expression-level optimization of a multi-enzyme pathway in the absence of a high-throughput assay
journal, September 2013
- Lee, Michael E.; Aswani, Anil; Han, Audrey S.
- Nucleic Acids Research, Vol. 41, Issue 22
Engineering the third wave of biocatalysis
journal, May 2012
- Bornscheuer, U. T.; Huisman, G. W.; Kazlauskas, R. J.
- Nature, Vol. 485, Issue 7397
The Synthetic Biology Open Language (SBOL) provides a community standard for communicating designs in synthetic biology
journal, June 2014
- Galdzicki, Michal; Clancy, Kevin P.; Oberortner, Ernst
- Nature Biotechnology, Vol. 32, Issue 6
High level production of tyrosinase in recombinant Escherichia coli
journal, February 2013
- Ren, Qun; Henes, Bernhard; Fairhead, Michael
- BMC Biotechnology, Vol. 13, Issue 1
Combinatorial engineering of intergenic regions in operons tunes expression of multiple genes
journal, July 2006
- Pfleger, Brian F.; Pitera, Douglas J.; Smolke, Christina D.
- Nature Biotechnology, Vol. 24, Issue 8
Modular control of multiple pathways using engineered orthogonal T7 polymerases
journal, June 2012
- Temme, Karsten; Hill, Rena; Segall-Shapiro, Thomas H.
- Nucleic Acids Research, Vol. 40, Issue 17
Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria
journal, June 2014
- Farasat, I.; Kushwaha, M.; Collens, J.
- Molecular Systems Biology, Vol. 10, Issue 6, p. 731-731
Enzymatic assembly of DNA molecules up to several hundred kilobases
journal, April 2009
- Gibson, Daniel G.; Young, Lei; Chuang, Ray-Yuan
- Nature Methods, Vol. 6, Issue 5, p. 343-345
CIDAR MoClo: Improved MoClo Assembly Standard and New E. coli Part Library Enable Rapid Combinatorial Design for Synthetic and Traditional Biology
journal, November 2015
- Iverson, Sonya V.; Haddock, Traci L.; Beal, Jacob
- ACS Synthetic Biology, Vol. 5, Issue 1
Lipid engineering combined with systematic metabolic engineering of Saccharomyces cerevisiae for high-yield production of lycopene
journal, March 2019
- Ma, Tian; Shi, Bin; Ye, Ziling
- Metabolic Engineering, Vol. 52
Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
journal, October 2017
- Bedbrook, Claire N.; Yang, Kevin K.; Rice, Austin J.
- PLOS Computational Biology, Vol. 13, Issue 10
Host and Pathway Engineering for Enhanced Lycopene Biosynthesis in Yarrowia lipolytica
journal, November 2017
- Schwartz, Cory; Frogue, Keith; Misa, Joshua
- Frontiers in Microbiology, Vol. 8
Regression on manifolds: Estimation of the exterior derivative
journal, February 2011
- Aswani, Anil; Bickel, Peter; Tomlin, Claire
- The Annals of Statistics, Vol. 39, Issue 1
Highly Efficient Single-Pot Scarless Golden Gate Assembly
journal, April 2019
- HamediRad, Mohammad; Weisberg, Scott; Chao, Ran
- ACS Synthetic Biology, Vol. 8, Issue 5
Reinforcement Learning Based Adaptive Sampling: REAPing Rewards by Exploring Protein Conformational Landscapes
journal, August 2018
- Shamsi, Zahra; Cheng, Kevin J.; Shukla, Diwakar
- The Journal of Physical Chemistry B, Vol. 122, Issue 35
A method for efficient Bayesian optimization of self-assembly systems from scattering data
journal, June 2018
- Thomas, Marcus; Schwartz, Russell
- BMC Systems Biology, Vol. 12, Issue 1
Engineering biological systems using automated biofoundries
journal, July 2017
- Chao, Ran; Mishra, Shekhar; Si, Tong
- Metabolic Engineering, Vol. 42
Robust optimization of SVM hyperparameters in the classification of bioactive compounds
journal, August 2015
- Czarnecki, Wojciech M.; Podlewska, Sabina; Bojarski, Andrzej J.
- Journal of Cheminformatics, Vol. 7, Issue 1
Application of Bayesian approach to numerical methods of global and stochastic optimization
journal, June 1994
- Mockus, Jonas
- Journal of Global Optimization, Vol. 4, Issue 4
Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering
journal, August 2016
- Xu, Peng; Rizzoni, Elizabeth Anne; Sul, Se-Yeong
- ACS Synthetic Biology, Vol. 6, Issue 1
Lycopene overproduction and in situ extraction in organic-aqueous culture systems using a metabolically engineered Escherichia coli
journal, September 2015
- Gallego-Jara, Julia; de Diego, Teresa; del Real, Álvaro
- AMB Express, Vol. 5, Issue 1
Automated design of synthetic ribosome binding sites to control protein expression
journal, October 2009
- Salis, Howard M.; Mirsky, Ethan A.; Voigt, Christopher A.
- Nature Biotechnology, Vol. 27, Issue 10, p. 946-950
Phoenics: A Bayesian Optimizer for Chemistry
journal, August 2018
- Häse, Florian; Roch, Loïc M.; Kreisbeck, Christoph
- ACS Central Science, Vol. 4, Issue 9
Metabolic engineering of the nonmevalonate isopentenyl diphosphate synthesis pathway inEscherichia coli enhances lycopene production
journal, January 2001
- Kim, Seon-Won; Keasling, J. D.
- Biotechnology and Bioengineering, Vol. 72, Issue 4
Toward metabolic engineering in the context of system biology and synthetic biology: advances and prospects
journal, December 2014
- Liu, Yanfeng; Shin, Hyun-dong; Li, Jianghua
- Applied Microbiology and Biotechnology, Vol. 99, Issue 3
A Highly Characterized Yeast Toolkit for Modular, Multipart Assembly
journal, April 2015
- Lee, Michael E.; DeLoache, William C.; Cervantes, Bernardo
- ACS Synthetic Biology, Vol. 4, Issue 9
Novel reference genes for quantifying transcriptional responses of Escherichia coli to protein overexpression by quantitative PCR
journal, January 2011
- Zhou, Kang; Zhou, Lihan; Lim, Qing
- BMC Molecular Biology, Vol. 12, Issue 1
Application of Bayesian Optimization for Pharmaceutical Product Development
journal, March 2019
- Sano, Syusuke; Kadowaki, Tadashi; Tsuda, Koji
- Journal of Pharmaceutical Innovation
Sharing Structure and Function in Biological Design with SBOL 2.0
journal, May 2016
- Roehner, Nicholas; Beal, Jacob; Clancy, Kevin
- ACS Synthetic Biology, Vol. 5, Issue 6
Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning
journal, May 2019
- Opgenorth, Paul; Costello, Zak; Okada, Takuya
- ACS Synthetic Biology, Vol. 8, Issue 6
Construction of lycopene-overproducing E. coli strains by combining systematic and combinatorial gene knockout targets
journal, April 2005
- Alper, Hal; Miyaoku, Kohei; Stephanopoulos, Gregory
- Nature Biotechnology, Vol. 23, Issue 5
Bayesian optimization for genomic selection: a method for discovering the best genotype among a large number of candidates
journal, October 2017
- Tanaka, Ryokei; Iwata, Hiroyoshi
- Theoretical and Applied Genetics, Vol. 131, Issue 1
Expression of prokaryotic 1-deoxy- d -xylulose-5-phosphatases in Escherichia coli increases carotenoid and ubiquinone biosynthesis
journal, April 1999
- Harker, M.; Bramley, P. M.
- FEBS Letters, Vol. 448, Issue 1
Building biological foundries for next-generation synthetic biology
journal, May 2015
- Chao, Ran; Yuan, YongBo; Zhao, HuiMin
- Science China Life Sciences, Vol. 58, Issue 7
Gaussian Processes for Machine Learning
book, January 2005
- Rasmussen, Carl Edward; Williams, Christopher K. I.
- The MIT Press
SBOL Visual: A Graphical Language for Genetic Designs
journal, December 2015
- Quinn, Jacqueline Y.; Cox, Robert Sidney; Adler, Aaron
- PLOS Biology, Vol. 13, Issue 12
The Automation of Science
journal, April 2009
- King, Ross D.; Rowland, Jem; Oliver, Stephen G.
- Science, Vol. 324, Issue 5923
A Survey on Transfer Learning
journal, October 2010
- Pan, Sinno Jialin; Yang, Qiang
- IEEE Transactions on Knowledge and Data Engineering, Vol. 22, Issue 10
Genome-scale engineering of Saccharomyces cerevisiae with single-nucleotide precision
journal, May 2018
- Bao, Zehua; HamediRad, Mohammad; Xue, Pu
- Nature Biotechnology, Vol. 36, Issue 6
Metabolic pathway optimization using ribosome binding site variants and combinatorial gene assembly
journal, November 2013
- Nowroozi, Farnaz F.; Baidoo, Edward E. K.; Ermakov, Simon
- Applied Microbiology and Biotechnology, Vol. 98, Issue 4
Combinatorial pathway engineering for optimized production of the anti-malarial FR900098: Combinatorial Engineering of FR900098 Biosynthetic Pathway
journal, September 2015
- Freestone, Todd S.; Zhao, Huimin
- Biotechnology and Bioengineering, Vol. 113, Issue 2
Production of lycopene by metabolically-engineered Escherichia coli
journal, May 2014
- Sun, Tao; Miao, Liangtian; Li, Qingyan
- Biotechnology Letters, Vol. 36, Issue 7
High-Throughput Metabolic Engineering: Advances in Small-Molecule Screening and Selection
journal, June 2010
- Dietrich, Jeffrey A.; McKee, Adrienne E.; Keasling, Jay D.
- Annual Review of Biochemistry, Vol. 79, Issue 1
FairyTALE: A High-Throughput TAL Effector Synthesis Platform
journal, September 2013
- Liang, Jing; Chao, Ran; Abil, Zhanar
- ACS Synthetic Biology, Vol. 3, Issue 2
Construction of plasmids with tunable copy numbers in Saccharomyces cerevisiae and their applications in pathway optimization and multiplex genome integration : Plasmid Copy Number Engineering
journal, June 2016
- Lian, Jiazhang; Jin, Run; Zhao, Huimin
- Biotechnology and Bioengineering, Vol. 113, Issue 11
Global transcription machinery engineering: A new approach for improving cellular phenotype
journal, May 2007
- Alper, H.; Stephanopoulos, G.
- Metabolic Engineering, Vol. 9, Issue 3, p. 258-267
Engineering Cellular Metabolism
journal, March 2016
- Nielsen, Jens; Keasling, Jay D.
- Cell, Vol. 164, Issue 6
Fully Automated One-Step Synthesis of Single-Transcript TALEN Pairs Using a Biological Foundry
journal, January 2017
- Chao, Ran; Liang, Jing; Tasan, Ipek
- ACS Synthetic Biology, Vol. 6, Issue 4