DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways

Abstract

Natural products (NPs), also known as secondary metabolites, are produced in bacteria, fungi, and plants. NPs represent a rich source of antibacterial, antifungal, and anticancer agents. Recent advances in DNA sequencing technologies and bioinformatics unveiled nature’s great potential for synthesizing numerous NPs that may confer unprecedented structural and biological features. However, discovering novel bioactive NPs by genome mining remains a challenge. Moreover, even with interesting bioactivity, the low productivity of many NPs significantly limits their practical applications. Here we discuss the progress in developing bioinformatics tools for efficient discovery of bioactive NPs. In addition, we highlight computational methods for optimizing the productivity of NPs with pharmaceutical importance.

Authors:
; ; ORCiD logo
Publication Date:
Research Org.:
Center for Advanced Bioenergy and Bioproducts Innovation (CABBI), Urbana, IL (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1581825
Alternate Identifier(s):
OSTI ID: 1581188
Grant/Contract Number:  
SC0018420; SC0018260; GM077596; AI144967
Resource Type:
Published Article
Journal Name:
iScience
Additional Journal Information:
Journal Name: iScience Journal Volume: 23 Journal Issue: 1; Journal ID: ISSN 2589-0042
Publisher:
Elsevier
Country of Publication:
Netherlands
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; biosynthesis; natural products; biosynthetic gene clusters; biosynthetic pathways; synthetic biology

Citation Formats

Ren, Hengqian, Shi, Chengyou, and Zhao, Huimin. Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways. Netherlands: N. p., 2020. Web. doi:10.1016/j.isci.2019.100795.
Ren, Hengqian, Shi, Chengyou, & Zhao, Huimin. Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways. Netherlands. https://doi.org/10.1016/j.isci.2019.100795
Ren, Hengqian, Shi, Chengyou, and Zhao, Huimin. Wed . "Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways". Netherlands. https://doi.org/10.1016/j.isci.2019.100795.
@article{osti_1581825,
title = {Computational Tools for Discovering and Engineering Natural Product Biosynthetic Pathways},
author = {Ren, Hengqian and Shi, Chengyou and Zhao, Huimin},
abstractNote = {Natural products (NPs), also known as secondary metabolites, are produced in bacteria, fungi, and plants. NPs represent a rich source of antibacterial, antifungal, and anticancer agents. Recent advances in DNA sequencing technologies and bioinformatics unveiled nature’s great potential for synthesizing numerous NPs that may confer unprecedented structural and biological features. However, discovering novel bioactive NPs by genome mining remains a challenge. Moreover, even with interesting bioactivity, the low productivity of many NPs significantly limits their practical applications. Here we discuss the progress in developing bioinformatics tools for efficient discovery of bioactive NPs. In addition, we highlight computational methods for optimizing the productivity of NPs with pharmaceutical importance.},
doi = {10.1016/j.isci.2019.100795},
journal = {iScience},
number = 1,
volume = 23,
place = {Netherlands},
year = {2020},
month = {1}
}

Works referenced in this record:

Antibiotic resistance–mediated isolation of scaffold-specific natural product producers
journal, May 2014

  • Thaker, Maulik N.; Waglechner, Nicholas; Wright, Gerry D.
  • Nature Protocols, Vol. 9, Issue 6
  • DOI: 10.1038/nprot.2014.093

A review of computational tools for design and reconstruction of metabolic pathways
journal, December 2017


PRISM 3: expanded prediction of natural product chemical structures from microbial genomes
journal, April 2017

  • Skinnider, Michael A.; Merwin, Nishanth J.; Johnston, Chad W.
  • Nucleic Acids Research, Vol. 45, Issue W1
  • DOI: 10.1093/nar/gkx320

Identification of Thiotetronic Acid Antibiotic Biosynthetic Pathways by Target-directed Genome Mining
journal, October 2015


Predicting synonymous codon usage and optimizing the heterologous gene for expression in E. coli
journal, August 2017


The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters
journal, November 2018

  • Blin, Kai; Pascal Andreu, Victòria; de los Santos, Emmanuel L. C.
  • Nucleic Acids Research, Vol. 47, Issue D1
  • DOI: 10.1093/nar/gky1060

Data access for the 1,000 Plants (1KP) project
journal, October 2014


KEGG: Kyoto Encyclopedia of Genes and Genomes
journal, January 2000

  • Kanehisa, Minoru; Goto, Susumu
  • Nucleic Acids Research, Vol. 28, Issue 1, p. 27-30
  • DOI: 10.1093/nar/28.1.27

A New Golden Age of Natural Products Drug Discovery
journal, December 2015


Computational tools for enzyme improvement: why everyone can – and should – use them
journal, April 2017


WGCNA: an R package for weighted correlation network analysis
journal, December 2008


CLUSEAN: A computer-based framework for the automated analysis of bacterial secondary metabolite biosynthetic gene clusters
journal, March 2009


Complete biosynthesis of cannabinoids and their unnatural analogues in yeast
journal, February 2019


TATA is a modular component of synthetic promoters
journal, July 2010


OPTIMIZER: a web server for optimizing the codon usage of DNA sequences
journal, May 2007

  • Puigbo, P.; Guzman, E.; Romeu, A.
  • Nucleic Acids Research, Vol. 35, Issue Web Server
  • DOI: 10.1093/nar/gkm219

SMURF: Genomic mapping of fungal secondary metabolite clusters
journal, September 2010

  • Khaldi, Nora; Seifuddin, Fayaz T.; Turner, Geoff
  • Fungal Genetics and Biology, Vol. 47, Issue 9
  • DOI: 10.1016/j.fgb.2010.06.003

Discovery of microbial natural products by activation of silent biosynthetic gene clusters
journal, June 2015

  • Rutledge, Peter J.; Challis, Gregory L.
  • Nature Reviews Microbiology, Vol. 13, Issue 8
  • DOI: 10.1038/nrmicro3496

An introduction to the medicinal plant genome project
journal, June 2011


JCat: a novel tool to adapt codon usage of a target gene to its potential expression host
journal, July 2005

  • Grote, A.; Hiller, K.; Scheer, M.
  • Nucleic Acids Research, Vol. 33, Issue Web Server
  • DOI: 10.1093/nar/gki376

antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences
journal, June 2011

  • Medema, Marnix H.; Blin, Kai; Cimermancic, Peter
  • Nucleic Acids Research, Vol. 39, Issue suppl_2
  • DOI: 10.1093/nar/gkr466

Natural Products as Sources of New Drugs from 1981 to 2014
journal, October 2015


antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification
journal, April 2017

  • Blin, Kai; Wolf, Thomas; Chevrette, Marc G.
  • Nucleic Acids Research, Vol. 45, Issue W1
  • DOI: 10.1093/nar/gkx319

Direct cloning of large genomic sequences
journal, May 2012

  • Cobb, Ryan E.; Zhao, Huimin
  • Nature Biotechnology, Vol. 30, Issue 5
  • DOI: 10.1038/nbt.2207

Visual gene developer: a fully programmable bioinformatics software for synthetic gene optimization
journal, August 2011


“CodonWizard” – An intuitive software tool with graphical user interface for customizable codon optimization in protein expression efforts
journal, August 2019

  • Rehbein, Peter; Berz, Jannik; Kreisel, Patrick
  • Protein Expression and Purification, Vol. 160
  • DOI: 10.1016/j.pep.2019.03.018

Assembly-Line Enzymology for Polyketide and Nonribosomal Peptide Antibiotics:  Logic, Machinery, and Mechanisms
journal, August 2006

  • Fischbach, Michael A.; Walsh, Christopher T.
  • Chemical Reviews, Vol. 106, Issue 8
  • DOI: 10.1021/cr0503097

MIBiG 2.0: a repository for biosynthetic gene clusters of known function
journal, October 2019

  • Kautsar, Satria A.; Blin, Kai; Shaw, Simon
  • Nucleic Acids Research
  • DOI: 10.1093/nar/gkz882

Automated genome mining for natural products
journal, January 2009


Phylogenomic Analysis of Natural Products Biosynthetic Gene Clusters Allows Discovery of Arseno-Organic Metabolites in Model Streptomycetes
journal, June 2016

  • Cruz-Morales, Pablo; Kopp, Johannes Florian; Martínez-Guerrero, Christian
  • Genome Biology and Evolution, Vol. 8, Issue 6
  • DOI: 10.1093/gbe/evw125

Design of computational retrobiosynthesis tools for the design of de novo synthetic pathways
journal, October 2015


Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature
journal, January 2013

  • Arnison, Paul G.; Bibb, Mervyn J.; Bierbaum, Gabriele
  • Nat. Prod. Rep., Vol. 30, Issue 1
  • DOI: 10.1039/C2NP20085F

CoExpNetViz: Comparative Co-Expression Networks Construction and Visualization Tool
journal, January 2016


De novo design of bioactive protein switches
journal, July 2019


A Review of the Microbial Production of Bioactive Natural Products and Biologics
journal, June 2019

  • Pham, Janette V.; Yilma, Mariamawit A.; Feliz, Adriana
  • Frontiers in Microbiology, Vol. 10
  • DOI: 10.3389/fmicb.2019.01404

Computational codon optimization of synthetic gene for protein expression
journal, January 2012


Natural products against Alzheimer's disease: Pharmaco-therapeutics and biotechnological interventions
journal, March 2017


Presyncodon, a Web Server for Gene Design with the Evolutionary Information of the Expression Hosts
journal, December 2018

  • Tian, Jian; Li, Qingbin; Chu, Xiaoyu
  • International Journal of Molecular Sciences, Vol. 19, Issue 12
  • DOI: 10.3390/ijms19123872

Computer-aided re-engineering of nonribosomal peptide and polyketide biosynthetic assembly lines
journal, January 2019

  • Alanjary, Mohammad; Cano-Prieto, Carolina; Gross, Harald
  • Natural Product Reports, Vol. 36, Issue 9
  • DOI: 10.1039/C9NP00021F

Codon Bias as a Means to Fine-Tune Gene Expression
journal, July 2015


Insights into Secondary Metabolism from a Global Analysis of Prokaryotic Biosynthetic Gene Clusters
journal, July 2014


Virtual Footprint and PRODORIC: an integrative framework for regulon prediction in prokaryotes
journal, August 2005


ClustScan : an integrated program package for the semi-automatic annotation of modular biosynthetic gene clusters and in silico prediction of novel chemical structures
journal, October 2008

  • Starcevic, Antonio; Zucko, Jurica; Simunkovic, Jurica
  • Nucleic Acids Research, Vol. 36, Issue 21
  • DOI: 10.1093/nar/gkn685

antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters
journal, May 2015

  • Weber, Tilmann; Blin, Kai; Duddela, Srikanth
  • Nucleic Acids Research, Vol. 43, Issue W1
  • DOI: 10.1093/nar/gkv437

Complete biosynthesis of opioids in yeast
journal, August 2015


BacPP: Bacterial promoter prediction—A tool for accurate sigma-factor specific assignment in enterobacteria
journal, October 2011

  • de Avila e. Silva, Scheila; Echeverrigaray, Sergio; Gerhardt, Günther J. L.
  • Journal of Theoretical Biology, Vol. 287
  • DOI: 10.1016/j.jtbi.2011.07.017

Using RNA Sequence and Structure for the Prediction of Riboswitch Aptamer: A Comprehensive Review of Available Software and Tools
journal, January 2018

  • Antunes, Deborah; Jorge, Natasha A. N.; Caffarena, Ernesto R.
  • Frontiers in Genetics, Vol. 8
  • DOI: 10.3389/fgene.2017.00231

EvoMining reveals the origin and fate of natural product biosynthetic enzymes
journal, December 2019

  • Sélem-Mojica, Nelly; Aguilar, César; Gutiérrez-García, Karina
  • Microbial Genomics, Vol. 5, Issue 12
  • DOI: 10.1099/mgen.0.000260

Synthetic biology strategies for microbial biosynthesis of plant natural products
journal, May 2019


RetroPath2.0: A retrosynthesis workflow for metabolic engineers
journal, January 2018


Synthetic promoter libraries – tuning of gene expression
journal, February 2006


RibEx: a web server for locating riboswitches and other conserved bacterial regulatory elements
journal, July 2005

  • Abreu-Goodger, C.; Merino, E.
  • Nucleic Acids Research, Vol. 33, Issue Web Server
  • DOI: 10.1093/nar/gki445

Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles 1 1Edited by J. Doudna
journal, November 2001

  • Gautheret, Daniel; Lambert, André
  • Journal of Molecular Biology, Vol. 313, Issue 5
  • DOI: 10.1006/jmbi.2001.5102

The Antibiotic Resistant Target Seeker (ARTS), an exploration engine for antibiotic cluster prioritization and novel drug target discovery
journal, May 2017

  • Alanjary, Mohammad; Kronmiller, Brent; Adamek, Martina
  • Nucleic Acids Research, Vol. 45, Issue W1
  • DOI: 10.1093/nar/gkx360

Defining and combating antibiotic resistance from One Health and Global Health perspectives
journal, August 2019

  • Hernando-Amado, Sara; Coque, Teresa M.; Baquero, Fernando
  • Nature Microbiology, Vol. 4, Issue 9
  • DOI: 10.1038/s41564-019-0503-9

Improving heterologous membrane protein production in Escherichia coli by combining transcriptional tuning and codon usage algorithms
journal, September 2017


eSNaPD: A Versatile, Web-Based Bioinformatics Platform for Surveying and Mining Natural Product Biosynthetic Diversity from Metagenomes
journal, August 2014


COStar: A D-star Lite-based dynamic search algorithm for codon optimization
journal, March 2014


Advances in protein structure prediction and design
journal, August 2019


A high-throughput screening and computation platform for identifying synthetic promoters with enhanced cell-state specificity (SPECS)
journal, June 2019


Evolution of the Cannabinoid and Terpene Content during the Growth of Cannabis sativa Plants from Different Chemotypes
journal, January 2016


BAGEL3: automated identification of genes encoding bacteriocins and (non-)bactericidal posttranslationally modified peptides
journal, May 2013

  • van Heel, Auke J.; de Jong, Anne; Montalbán-López, Manuel
  • Nucleic Acids Research, Vol. 41, Issue W1
  • DOI: 10.1093/nar/gkt391

The Eukaryotic Promoter Database: expansion of EPDnew and new promoter analysis tools
journal, November 2014

  • Dreos, René; Ambrosini, Giovanna; Périer, Rouayda Cavin
  • Nucleic Acids Research, Vol. 43, Issue D1
  • DOI: 10.1093/nar/gku1111

A new cyanogenic metabolite in Arabidopsis required for inducible pathogen defence
journal, September 2015

  • Rajniak, Jakub; Barco, Brenden; Clay, Nicole K.
  • Nature, Vol. 525, Issue 7569
  • DOI: 10.1038/nature14907

GeMS: an advanced software package for designing synthetic genes
journal, May 2005


Quantifying Absolute Protein Synthesis Rates Reveals Principles Underlying Allocation of Cellular Resources
journal, April 2014


Synthetic biology to access and expand nature's chemical diversity
journal, February 2016

  • Smanski, Michael J.; Zhou, Hui; Claesen, Jan
  • Nature Reviews Microbiology, Vol. 14, Issue 3
  • DOI: 10.1038/nrmicro.2015.24

antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers
journal, May 2013

  • Blin, Kai; Medema, Marnix H.; Kazempour, Daniyal
  • Nucleic Acids Research, Vol. 41, Issue W1
  • DOI: 10.1093/nar/gkt449

WebGeSTer DB—a transcription terminator database
journal, November 2010

  • Mitra, Anirban; Kesarwani, Anil K.; Pal, Debnath
  • Nucleic Acids Research, Vol. 39, Issue suppl_1
  • DOI: 10.1093/nar/gkq971

Efficient search, mapping, and optimization of multi-protein genetic systems in diverse bacteria
journal, June 2014

  • Farasat, I.; Kushwaha, M.; Collens, J.
  • Molecular Systems Biology, Vol. 10, Issue 6, p. 731-731
  • DOI: 10.15252/msb.20134955

Modular and tunable biological feedback control using a de novo protein switch
journal, July 2019


Codon and Codon-Pair Usage Tables (CoCoPUTs): Facilitating Genetic Variation Analyses and Recombinant Gene Design
journal, June 2019

  • Alexaki, Aikaterini; Kames, Jacob; Holcomb, David D.
  • Journal of Molecular Biology, Vol. 431, Issue 13
  • DOI: 10.1016/j.jmb.2019.04.021

Rationally reduced libraries for combinatorial pathway optimization minimizing experimental effort
journal, March 2016

  • Jeschek, Markus; Gerngross, Daniel; Panke, Sven
  • Nature Communications, Vol. 7, Issue 1
  • DOI: 10.1038/ncomms11163

Machine-learning-guided directed evolution for protein engineering
journal, July 2019


Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites
journal, November 2013

  • Espah Borujeni, Amin; Channarasappa, Anirudh S.; Salis, Howard M.
  • Nucleic Acids Research, Vol. 42, Issue 4
  • DOI: 10.1093/nar/gkt1139

plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters
journal, April 2017

  • Kautsar, Satria A.; Suarez Duran, Hernando G.; Blin, Kai
  • Nucleic Acids Research, Vol. 45, Issue W1
  • DOI: 10.1093/nar/gkx305

UpGene: Application of a Web-Based DNA Codon Optimization Algorithm
journal, January 2004

  • Gao, Wentao; Rzewski, Alexis; Sun, Huijie
  • Biotechnology Progress, Vol. 20, Issue 2
  • DOI: 10.1021/bp0300467

SBSPKS: structure based sequence analysis of polyketide synthases
journal, May 2010

  • Anand, Swadha; Prasad, M. V. R.; Yadav, Gitanjali
  • Nucleic Acids Research, Vol. 38, Issue suppl_2
  • DOI: 10.1093/nar/gkq340

Automated design of synthetic ribosome binding sites to control protein expression
journal, October 2009

  • Salis, Howard M.; Mirsky, Ethan A.; Voigt, Christopher A.
  • Nature Biotechnology, Vol. 27, Issue 10, p. 946-950
  • DOI: 10.1038/nbt.1568

A new genome-mining tool redefines the lasso peptide biosynthetic landscape
journal, February 2017

  • Tietz, Jonathan I.; Schwalen, Christopher J.; Patel, Parth S.
  • Nature Chemical Biology, Vol. 13, Issue 5
  • DOI: 10.1038/nchembio.2319

High Guanine and Cytosine Content Increases mRNA Levels in Mammalian Cells
journal, May 2006


Engineering of cell factories for the production of natural products
journal, January 2019


Six enzymes from mayapple that complete the biosynthetic pathway to the etoposide aglycone
journal, September 2015


Codon Optimization OnLine (COOL): a web-based multi-objective optimization platform for synthetic gene design
journal, April 2014


Predicting effects of noncoding variants with deep learning–based sequence model
journal, August 2015

  • Zhou, Jian; Troyanskaya, Olga G.
  • Nature Methods, Vol. 12, Issue 10
  • DOI: 10.1038/nmeth.3547

antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline
journal, April 2019

  • Blin, Kai; Shaw, Simon; Steinke, Katharina
  • Nucleic Acids Research, Vol. 47, Issue W1
  • DOI: 10.1093/nar/gkz310

EuGene: maximizing synthetic gene design for heterologous expression
journal, July 2012


ClusterCAD: a computational platform for type I modular polyketide synthase design
journal, October 2017

  • Eng, Clara H.; Backman, Tyler W. H.; Bailey, Constance B.
  • Nucleic Acids Research, Vol. 46, Issue D1
  • DOI: 10.1093/nar/gkx893

Multiparameter RNA and Codon Optimization: A Standardized Tool to Assess and Enhance Autologous Mammalian Gene Expression
journal, March 2011


NRPSpredictor2—a web server for predicting NRPS adenylation domain specificity
journal, May 2011

  • Röttig, Marc; Medema, Marnix H.; Blin, Kai
  • Nucleic Acids Research, Vol. 39, Issue suppl_2
  • DOI: 10.1093/nar/gkr323

Towards a fully automated algorithm driven platform for biosystems design
journal, November 2019


Expansion of Biological Pathways Based on Evolutionary Inference
journal, July 2014


PePPER: a webserver for prediction of prokaryote promoter elements and regulons
journal, January 2012