DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning

Abstract

The Design–Build–Test–Learn (DBTL) cycle, facilitated by exponentially improving capabilities in synthetic biology, is an increasingly adopted metabolic engineering framework that represents a more systematic and efficient approach to strain development than historical efforts in biofuels and biobased products. Here, we report on implementation of two DBTL cycles to optimize 1-dodecanol production from glucose using 60 engineered Escherichia coli MG1655 strains. The first DBTL cycle employed a simple strategy to learn efficiently from a relatively small number of strains (36), wherein only the choice of ribosomebinding sites and an acyl-ACP/acyl-CoA reductase were modulated in a single pathway operon including genes encoding a thioesterase (UcFatB1), an acyl-ACP/acyl-CoA reductase (Maqu_2507, Maqu_2220, or Acr1), and an acyl-CoA synthetase (FadD). Measured variables included concentrations of dodecanol and all proteins in the engineered pathway. We used the data produced in the first DBTL cycle to train several machine-learning algorithms and to suggest protein profiles for the second DBTL cycle that would increase production. These strategies resulted in a 21% increase in dodecanol titer in Cycle 2 (up to 0.83 g/L, which is more than 6-fold greater than previously reported batch values for minimal medium). Beyond specific lessons learned about optimizing dodecanol titer in E. coli, thismore » study had findings of broader relevance across synthetic biology applications, such as the importance of sequencing checks on plasmids in production strains as well as in cloning strains, and the critical need for more accurate protein expression predictive tools.« less

Authors:
 [1];  [2];  [3];  [2];  [2];  [2];  [2]; ORCiD logo [4]; ORCiD logo [5];  [6];  [7];  [2];  [2]; ORCiD logo [8]; ORCiD logo [9]; ORCiD logo [1]
  1. Joint BioEnergy Institute (JBEI), Emeryville, California 94608, United States, Biological Systems & Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
  2. Joint BioEnergy Institute (JBEI), Emeryville, California 94608, United States, Biological Systems & Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States, DOE Agile BioFoundry, Emeryville, California 94608, United States
  3. Research Institute for Bioscience Product & Fine Chemicals, Ajinomoto Co., Inc., Kawasaki 210-8680, Japan
  4. Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States, DOE Joint Genome Institute, Walnut Creek, California 94598, United States
  5. Joint BioEnergy Institute (JBEI), Emeryville, California 94608, United States, Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States, DOE Joint Genome Institute, Walnut Creek, California 94598, United States
  6. Sandia National Laboratories, Livermore, California 94550, United States
  7. DOE Joint Genome Institute, Walnut Creek, California 94598, United States
  8. Joint BioEnergy Institute (JBEI), Emeryville, California 94608, United States, Biological Systems & Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States, DOE Agile BioFoundry, Emeryville, California 94608, United States, DOE Joint Genome Institute, Walnut Creek, California 94598, United States
  9. Joint BioEnergy Institute (JBEI), Emeryville, California 94608, United States, Biological Systems & Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States, DOE Agile BioFoundry, Emeryville, California 94608, United States, BCAM, Basque Center for Applied Mathematics, 48009 Bilbao, Spain
Publication Date:
Research Org.:
Joint BioEnergy Institute (JBEI), Emeryville, CA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER); USDOE Office of Energy Efficiency and Renewable Energy (EERE); USDOE Office of Energy Efficiency and Renewable Energy (EERE), Transportation Office. Bioenergy Technologies Office
OSTI Identifier:
1515590
Alternate Identifier(s):
OSTI ID: 1529116; OSTI ID: 1877605
Grant/Contract Number:  
AC02-05CH11231
Resource Type:
Published Article
Journal Name:
ACS Synthetic Biology
Additional Journal Information:
Journal Name: ACS Synthetic Biology Journal Volume: 8 Journal Issue: 6; Journal ID: ISSN 2161-5063
Publisher:
American Chemical Society
Country of Publication:
United States
Language:
English
Subject:
37 INORGANIC, ORGANIC, PHYSICAL, AND ANALYTICAL CHEMISTRY; 59 BASIC BIOLOGICAL SCIENCES; DBTL; machine learning; dodecanol; proteomics; synthetic biology

Citation Formats

Opgenorth, Paul, Costello, Zak, Okada, Takuya, Goyal, Garima, Chen, Yan, Gin, Jennifer, Benites, Veronica, de Raad, Markus, Northen, Trent R., Deng, Kai, Deutsch, Samuel, Baidoo, Edward E. K., Petzold, Christopher J., Hillson, Nathan J., Garcia Martin, Hector, and Beller, Harry R. Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning. United States: N. p., 2019. Web. doi:10.1021/acssynbio.9b00020.
Opgenorth, Paul, Costello, Zak, Okada, Takuya, Goyal, Garima, Chen, Yan, Gin, Jennifer, Benites, Veronica, de Raad, Markus, Northen, Trent R., Deng, Kai, Deutsch, Samuel, Baidoo, Edward E. K., Petzold, Christopher J., Hillson, Nathan J., Garcia Martin, Hector, & Beller, Harry R. Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning. United States. https://doi.org/10.1021/acssynbio.9b00020
Opgenorth, Paul, Costello, Zak, Okada, Takuya, Goyal, Garima, Chen, Yan, Gin, Jennifer, Benites, Veronica, de Raad, Markus, Northen, Trent R., Deng, Kai, Deutsch, Samuel, Baidoo, Edward E. K., Petzold, Christopher J., Hillson, Nathan J., Garcia Martin, Hector, and Beller, Harry R. Thu . "Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning". United States. https://doi.org/10.1021/acssynbio.9b00020.
@article{osti_1515590,
title = {Lessons from Two Design–Build–Test–Learn Cycles of Dodecanol Production in Escherichia coli Aided by Machine Learning},
author = {Opgenorth, Paul and Costello, Zak and Okada, Takuya and Goyal, Garima and Chen, Yan and Gin, Jennifer and Benites, Veronica and de Raad, Markus and Northen, Trent R. and Deng, Kai and Deutsch, Samuel and Baidoo, Edward E. K. and Petzold, Christopher J. and Hillson, Nathan J. and Garcia Martin, Hector and Beller, Harry R.},
abstractNote = {The Design–Build–Test–Learn (DBTL) cycle, facilitated by exponentially improving capabilities in synthetic biology, is an increasingly adopted metabolic engineering framework that represents a more systematic and efficient approach to strain development than historical efforts in biofuels and biobased products. Here, we report on implementation of two DBTL cycles to optimize 1-dodecanol production from glucose using 60 engineered Escherichia coli MG1655 strains. The first DBTL cycle employed a simple strategy to learn efficiently from a relatively small number of strains (36), wherein only the choice of ribosomebinding sites and an acyl-ACP/acyl-CoA reductase were modulated in a single pathway operon including genes encoding a thioesterase (UcFatB1), an acyl-ACP/acyl-CoA reductase (Maqu_2507, Maqu_2220, or Acr1), and an acyl-CoA synthetase (FadD). Measured variables included concentrations of dodecanol and all proteins in the engineered pathway. We used the data produced in the first DBTL cycle to train several machine-learning algorithms and to suggest protein profiles for the second DBTL cycle that would increase production. These strategies resulted in a 21% increase in dodecanol titer in Cycle 2 (up to 0.83 g/L, which is more than 6-fold greater than previously reported batch values for minimal medium). Beyond specific lessons learned about optimizing dodecanol titer in E. coli, this study had findings of broader relevance across synthetic biology applications, such as the importance of sequencing checks on plasmids in production strains as well as in cloning strains, and the critical need for more accurate protein expression predictive tools.},
doi = {10.1021/acssynbio.9b00020},
journal = {ACS Synthetic Biology},
number = 6,
volume = 8,
place = {United States},
year = {Thu May 09 00:00:00 EDT 2019},
month = {Thu May 09 00:00:00 EDT 2019}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1021/acssynbio.9b00020

Citation Metrics:
Cited by: 62 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

Fatty alcohol production in engineered E. coli expressing Marinobacter fatty acyl-CoA reductases
journal, June 2013

  • Liu, Aiqiu; Tan, Xiaoming; Yao, Lun
  • Applied Microbiology and Biotechnology, Vol. 97, Issue 15
  • DOI: 10.1007/s00253-013-5027-2

Standard Flow Liquid Chromatography for Shotgun Proteomics in Bioenergy Research
journal, April 2015

  • González Fernández-Niño, Susana M.; Smith-Moritz, A. Michelle; Chan, Leanne Jade G.
  • Frontiers in Bioengineering and Biotechnology, Vol. 3
  • DOI: 10.3389/fbioe.2015.00044

Production of medium chain length fatty alcohols from glucose in Escherichia coli
journal, November 2013


Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites
journal, November 2013

  • Espah Borujeni, Amin; Channarasappa, Anirudh S.; Salis, Howard M.
  • Nucleic Acids Research, Vol. 42, Issue 4
  • DOI: 10.1093/nar/gkt1139

Principal component analysis of proteomics (PCAP) as a tool to direct metabolic engineering
journal, March 2015


A specific acyl-ACP thioesterase implicated in medium-chain fatty acid production in immature cotyledons of Umbellularia californica
journal, February 1991

  • Pollard, Michael R.; Anderson, Lana; Fan, Calvin
  • Archives of Biochemistry and Biophysics, Vol. 284, Issue 2, p. 306-312
  • DOI: 10.1016/0003-9861(91)90300-8

Panorama: A Targeted Proteomics Knowledge Base
journal, July 2014

  • Sharma, Vagisha; Eckels, Josh; Taylor, Greg K.
  • Journal of Proteome Research, Vol. 13, Issue 9
  • DOI: 10.1021/pr5006636

Improving Metabolic Pathway Efficiency by Statistical Model-Based Multivariate Regulatory Metabolic Engineering
journal, August 2016


Natural products as biofuels and bio-based chemicals: fatty acids and isoprenoids
journal, January 2015

  • Beller, Harry R.; Lee, Taek Soon; Katz, Leonard
  • Natural Product Reports, Vol. 32, Issue 10
  • DOI: 10.1039/C5NP00068H

Characterization of a Fatty Acyl-CoA Reductase from Marinobacter aquaeolei VT8: A Bacterial Enzyme Catalyzing the Reduction of Fatty Acyl-CoA to Fatty Alcohol
journal, December 2011

  • Willis, Robert M.; Wahlen, Bradley D.; Seefeldt, Lance C.
  • Biochemistry, Vol. 50, Issue 48, p. 10550-10558
  • DOI: 10.1021/bi2008646

DeviceEditor visual biological CAD canvas
journal, January 2012


Rapid Kinetic Characterization of Glycosyl Hydrolases Based on Oxime Derivatization and Nanostructure-Initiator Mass Spectrometry (NIMS)
journal, May 2014

  • Deng, Kai; Takasuka, Taichi E.; Heins, Richard
  • ACS Chemical Biology, Vol. 9, Issue 7
  • DOI: 10.1021/cb5000289

Automated design of synthetic ribosome binding sites to control protein expression
journal, October 2009

  • Salis, Howard M.; Mirsky, Ethan A.; Voigt, Christopher A.
  • Nature Biotechnology, Vol. 27, Issue 10, p. 946-950
  • DOI: 10.1038/nbt.1568

Substantial improvements in methyl ketone production in E. coli and insights on the pathway from in vitro studies
journal, November 2014


j5 DNA Assembly Design Automation Software
journal, December 2011

  • Hillson, Nathan J.; Rosengarten, Rafael D.; Keasling, Jay D.
  • ACS Synthetic Biology, Vol. 1, Issue 1, p. 14-21
  • DOI: 10.1021/sb2000116

Isolation and characterization of novel mutations in the pSC101 origin that increase copy number
journal, January 2018


Golden Gate Shuffling: A One-Pot DNA Shuffling Method Based on Type IIs Restriction Enzymes
journal, May 2009


Predictable tuning of protein expression in bacteria
journal, January 2016

  • Bonde, Mads T.; Pedersen, Margit; Klausen, Michael S.
  • Nature Methods, Vol. 13, Issue 3
  • DOI: 10.1038/nmeth.3727

Improving methyl ketone production in Escherichia coli by heterologous expression of NADH-dependent FabG
journal, March 2018

  • Goh, Ee-Been; Chen, Yan; Petzold, Christopher J.
  • Biotechnology and Bioengineering, Vol. 115, Issue 5
  • DOI: 10.1002/bit.26558

Microbial production of fatty-acid-derived fuels and chemicals from plant biomass
journal, January 2010

  • Steen, Eric J.; Kang, Yisheng; Bokinsky, Gregory
  • Nature, Vol. 463, Issue 7280, p. 559-562
  • DOI: 10.1038/nature08721

Carboxylic acid reductase is a versatile enzyme for the conversion of fatty acids into fuels and chemical commodities
journal, December 2012

  • Akhtar, M. K.; Turner, N. J.; Jones, P. R.
  • Proceedings of the National Academy of Sciences, Vol. 110, Issue 1, p. 87-92
  • DOI: 10.1073/pnas.1216516110

Comprehensive analysis of glucose and xylose metabolism in Escherichia coli under aerobic and anaerobic conditions by 13C metabolic flux analysis
journal, January 2017

  • Gonzalez, Jacqueline E.; Long, Christopher P.; Antoniewicz, Maciek R.
  • Metabolic Engineering, Vol. 39
  • DOI: 10.1016/j.ymben.2016.11.003

A prokaryotic acyl-CoA reductase performing reduction of fatty acyl-CoA to fatty alcohol
journal, October 2011


Cell-free synthetic biology: Thinking outside the cell
journal, May 2012


An automated Design-Build-Test-Learn pipeline for enhanced microbial production of fine chemicals
journal, June 2018

  • Carbonell, Pablo; Jervis, Adrian J.; Robinson, Christopher J.
  • Communications Biology, Vol. 1, Issue 1
  • DOI: 10.1038/s42003-018-0076-9

Machine Learning of Designed Translational Control Allows Predictive Pathway Optimization in Escherichia coli
journal, December 2018


SBOL Visual: A Graphical Language for Genetic Designs
journal, December 2015


A Survey on Transfer Learning
journal, October 2010

  • Pan, Sinno Jialin; Yang, Qiang
  • IEEE Transactions on Knowledge and Data Engineering, Vol. 22, Issue 10
  • DOI: 10.1109/TKDE.2009.191

Development of an orthogonal fatty acid biosynthesis system in E. coli for oleochemical production
journal, July 2015


Identification of long chain specific aldehyde reductase and its use in enhanced fatty alcohol production in E. coli
journal, September 2016


Microbial Biosynthesis of Alkanes
journal, July 2010


Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools
journal, June 2012

  • Ham, T. S.; Dmytriv, Z.; Plahar, H.
  • Nucleic Acids Research, Vol. 40, Issue 18
  • DOI: 10.1093/nar/gks531

The Experiment Data Depot: A Web-Based Software Tool for Biological Experimental Data Storage, Sharing, and Visualization
journal, September 2017

  • Morrell, William C.; Birkel, Garrett W.; Forrer, Mark
  • ACS Synthetic Biology, Vol. 6, Issue 12
  • DOI: 10.1021/acssynbio.7b00204

OpenMSI Arrayed Analysis Toolkit: Analyzing Spatially Defined Samples Using Mass Spectrometry Imaging
journal, May 2017


A One Pot, One Step, Precision Cloning Method with High Throughput Capability
journal, November 2008


Enzymatic assembly of DNA molecules up to several hundred kilobases
journal, April 2009

  • Gibson, Daniel G.; Young, Lei; Chuang, Ray-Yuan
  • Nature Methods, Vol. 6, Issue 5, p. 343-345
  • DOI: 10.1038/nmeth.1318

Nanostructure-initiator mass spectrometry: a protocol for preparing and applying NIMS surfaces for high-sensitivity mass analysis
journal, July 2008

  • Woo, Hin-Koon; Northen, Trent R.; Yanes, Oscar
  • Nature Protocols, Vol. 3, Issue 8
  • DOI: 10.1038/nprot.2008.110

Random Forests
journal, January 2001


Algorithmic co-optimization of genetic constructs and growth conditions: application to 6-ACA, a potential nylon-6 precursor
journal, October 2015


Optimization of fatty alcohol biosynthesis pathway for selectively enhanced production of C12/14 and C16/18 fatty alcohols in engineered Escherichia coli
journal, January 2012

  • Zheng, Yan-Ning; Li, Ling-Ling; Liu, Qiang
  • Microbial Cell Factories, Vol. 11, Article No. 65
  • DOI: 10.1186/1475-2859-11-65