DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Engineering of increased L-Threonine production in bacteria by combinatorial cloning and machine learning

Journal Article · · Metabolic Engineering Communications
 [1];  [2];  [3];  [1];  [4];  [1];  [1];  [1];  [1];  [1];  [5];  [1]
  1. Argonne National Laboratory (ANL), Argonne, IL (United States)
  2. University of Chicago, IL (United States)
  3. BSMI, Northbrook, IL (United States)
  4. Columbia University, New York, NY (United States)
  5. Argonne National Laboratory (ANL), Argonne, IL (United States); University of Chicago, IL (United States)

The goal of this study is to develop a general strategy for bacterial engineering using an integrated synthetic biology and machine learning (ML) approach. This strategy was developed in the context of increasing L-threonine production in Escherichia coli ATCC 21277. A set of 16 genes was initially selected based on metabolic pathway relevance to threonine biosynthesis and used for combinatorial cloning to construct a set of 385 strains to generate training data (i.e., a range of L-threonine titers linked to each of the specific gene combinations). Hybrid (regression/classification) deep learning (DL) models were developed and used to predict additional gene combinations in subsequent rounds of combinatorial cloning for increased L-threonine production based on the training data. As a result, E. coli strains built after just three rounds of iterative combinatorial cloning and model prediction generated higher L-threonine titers (from 2.7 g/L to 8.4 g/L) than those of patented L-threonine strains being used as controls (4-5 g/L). Interesting combinations of genes in L-threonine production included deletions of the tdh, metL, dapA, and dhaM genes as well as overexpression of the pntAB, ppc, and aspC genes. Mechanistic analysis of the metabolic system constraints for the best performing constructs offers ways to improve the models by adjusting weights for specific gene combinations. Graph theory analysis of pairwise gene modifications and corresponding levels of L-threonine production also suggests additional rules that can be incorporated into future ML models.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program; USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1988321
Alternate ID(s):
OSTI ID: 2475507
Journal Information:
Metabolic Engineering Communications, Vol. 17; ISSN 2214-0301
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (53)

Metabolic flux engineering of l-lysine production in Corynebacterium glutamicum—over expression and modification of G6P dehydrogenase journal October 2007
Bayesian inference of metabolic kinetics from genome-scale multiomics data journal November 2019
A genome-based approach to create a minimally mutated Corynebacterium glutamicum strain for efficient l-lysine production journal February 2006
The dihydroxyacetone kinase of Escherichia coli utilizes a phosphoprotein instead of ATP as phosphoryl donor journal May 2001
Industrial biomanufacturing: The future of chemical production journal January 2017
Production of amino acids – Genetic and metabolic engineering approaches journal December 2017
Machine Learning Applied to Zeolite Synthesis: The Missing Link for Realizing High-Throughput Discovery journal September 2019
From zero to hero—Design-based systems metabolic engineering of Corynebacterium glutamicum for l-lysine production journal March 2011
High-Level 5-Methyltetrahydrofolate Bioproduction in Bacillus subtilis by Combining Modular Engineering and Transcriptomics-Guided Global Metabolic Regulation journal May 2022
Systems and synthetic metabolic engineering for amino acid production – the heartbeat of industrial strain development journal October 2012
Deep scanning lysine metabolism in Escherichia coli journal November 2018
iModulonDB: a knowledgebase of microbial transcriptional regulation derived from machine learning journal October 2020
Leveraging knowledge engineering and machine learning for microbial bio-manufacturing journal July 2018
Laboratory evolution for forced glucose-xylose co-consumption enables identification of mutations that improve mixed-sugar fermentation by xylose-fermenting Saccharomyces cerevisiae journal May 2018
Metabolic engineering of a reduced-genome strain of Escherichia coli for L-threonine production journal January 2009
Machine-learning from Pseudomonas putida KT2440 transcriptomes reveals its transcriptional regulatory network journal July 2022
Cytoscape Automation: empowering workflow-based network analysis journal September 2019
Improved Production of l-Threonine in Escherichia coli by Use of a DNA Scaffold System journal November 2012
The Escherichia coli transcriptome mostly consists of independently regulated modules journal December 2019
The Soluble and Membrane-bound Transhydrogenases UdhA and PntAB Have Divergent Functions in NADPH Metabolism of Escherichia coli journal December 2003
EcoFlex: A Multifunctional MoClo Kit for E. coli Synthetic Biology journal May 2016
Integrated knowledge mining, genome-scale modeling, and machine learning for predicting Yarrowia lipolytica bioproduction journal September 2021
Machine Learning Applied to Predicting Microorganism Growth Temperatures and Enzyme Catalytic Optima journal May 2019
Dynamic Metabolomics for Engineering Biology: Accelerating Learning Cycles for Bioproduction journal January 2020
A comprehensive genome‐scale reconstruction of Escherichia coli metabolism—2011 journal January 2011
luxS -Dependent Gene Regulation in Escherichia coli K-12 Revealed by Genomic Expression Profiling journal December 2005
Biomanufacturing: history and perspective journal November 2016
Systems metabolic engineering strategies for the production of amino acids journal June 2017
Expression regulation of multiple key genes to improve l-threonine in Escherichia coli journal February 2020
Identification and characterization of the new gene rhtA involved in threonine and homoserine efflux in Escherichia coli journal March 2003
Systems metabolic engineering of Escherichia coli for L ‐threonine production journal January 2007
Increasing l-threonine production in Escherichia coli by overexpressing the gene cluster phaCAB journal November 2019
Improving l-threonine production in Escherichia coli by elimination of transporters ProP and ProVWX journal March 2021
Mini-review: In vitro Metabolic Engineering for Biomanufacturing of High-value Products journal January 2017
Refactoring and Optimization of Light-Switchable Escherichia coli Two-Component Systems journal October 2014
Recent advances in constraint and machine learning-based metabolic modeling by leveraging stoichiometric balances, thermodynamic feasibility and kinetic law formalisms journal January 2021
Escherichia coli dihydroxyacetone kinase controls gene expression by binding to transcription factor DhaR journal December 2004
A novel methodology employing Corynebacterium glutamicum genome information to generate a new L -lysine-producing mutant journal January 2002
From zero to hero – Production of bio-based nylon from renewable resources using engineered Corynebacterium glutamicum journal September 2014
Combining mechanistic and machine learning models for predictive engineering and optimization of tryptophan metabolism journal September 2020
l-Threonine Production by Auxotrophs ofE. coli journal January 1973
Production ofl-Threonine by Analog-resistant Mutants journal September 1972
NADPH-Auxotrophic E. coli : A Sensor Strain for Testing in Vivo Regeneration of NADPH journal November 2018
One-step inactivation of chromosomal genes in Escherichia coli K-12 using PCR products journal May 2000
New synthetic biology tools for metabolic control journal August 2022
Developing an l‐ threonine‐producing strain from wild‐type Escherichia coli by modifying the glucose uptake, glyoxylate shunt, and l ‐threonine biosynthetic pathway journal September 2019
Metabolic impact of an NADH-producing glucose-6-phosphate dehydrogenase in Escherichia coli journal December 2014
Two-stage carbon distribution and cofactor generation for improvingl-threonine production ofEscherichia coli journal October 2018
Engineering photosynthetic production of L-lysine journal November 2017
Mutation Analysis of the Feedback Inhibition Site of Aspartokinase III of Escherichia coli K-12 and its Use in L-Threonine Production journal January 2001
Regulation of methionine synthesis in Escherichia coli journal July 1991
Machine learning framework for assessment of microbial factory performance journal January 2019
Causal mutations from adaptive laboratory evolution are outlined by multiple scales of genome annotations and condition-specificity journal July 2020