DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: HypoRiPPAtlas as an Atlas of hypothetical natural products for mass spectrometry database search

Journal Article · · Nature Communications

Abstract Recent analyses of public microbial genomes have found over a million biosynthetic gene clusters, the natural products of the majority of which remain unknown. Additionally, GNPS harbors billions of mass spectra of natural products without known structures and biosynthetic genes. We bridge the gap between large-scale genome mining and mass spectral datasets for natural product discovery by developing HypoRiPPAtlas, an Atlas of hypothetical natural product structures, which is ready-to-use for in silico database search of tandem mass spectra. HypoRiPPAtlas is constructed by mining genomes using seq2ripp, a machine-learning tool for the prediction of ribosomally synthesized and post-translationally modified peptides (RiPPs). In HypoRiPPAtlas, we identify RiPPs in microbes and plants. HypoRiPPAtlas could be extended to other natural product classes in the future by implementing corresponding biosynthetic logic. This study paves the way for large-scale explorations of biosynthetic pathways and chemical structures of microbial and plant RiPP classes.

Sponsoring Organization:
USDOE
Grant/Contract Number:
NONE; SC0021340
OSTI ID:
1989754
Journal Information:
Nature Communications, Journal Name: Nature Communications Journal Issue: 1 Vol. 14; ISSN 2041-1723
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (67)

Ribosomal Synthesis of Tricyclic Depsipeptides in Bloom-Forming Cyanobacteria journal September 2008
Transient Mammalian Cell Transfection with Polyethylenimine (PEI) book January 2013
Basic local alignment search tool journal October 1990
EMBOSS: The European Molecular Biology Open Software Suite journal June 2000
MetaMiner: A Scalable Peptidogenomics Approach for Discovery of Ribosomal Peptide Natural Products with Blind Modifications from Microbial Communities journal December 2019
De Novo Peptide Sequencing Reveals Many Cyclopeptides in the Human Gut and Other Environments journal January 2020
Genome mining strategies for ribosomally synthesised and post-translationally modified peptides journal January 2020
Subgraph isomorphism in graph classes journal November 2012
Natural products with preservative properties for enhancing the microbiological safety and extending the shelf-life of seafood: A review journal January 2020
Comprehensive Analysis of Distinctive Polyketide and Nonribosomal Peptide Structural Motifs Encoded in Microbial Genomes journal May 2007
Isolation and structure determination of a new thiopeptide globimycin from Streptomyces globisporus subsp. globisporus based on genome mining journal January 2018
The Natural Products Atlas: An Open Access Knowledge Base for Microbial Natural Products Discovery journal September 2019
The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. journal May 1965
Bioinformatic Mapping of Radical S -Adenosylmethionine-Dependent Ribosomally Synthesized and Post-Translationally Modified Peptides Identifies New Cα, Cβ, and Cγ-Linked Thioether-Containing Peptides journal May 2019
Natural Products As Sources of New Drugs over the 30 Years from 1981 to 2010 journal January 2012
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking journal August 2016
Discovery of MRSA active antibiotics using primary sequence from the human microbiome journal October 2016
A new genome-mining tool redefines the lasso peptide biosynthetic landscape journal February 2017
A global assembly line for cyanobactins journal April 2008
The re-emergence of natural products for drug discovery in the genomics era journal January 2015
PRESTO-Tango as an open-source resource for interrogation of the druggable human GPCRome journal April 2015
Dereplication of microbial metabolites through database search of mass spectra journal October 2018
Comprehensive prediction of secondary metabolite structure and biological activity from microbial genome sequences journal November 2020
MolDiscovery: learning mass spectrometry fragmentation of small molecules journal June 2021
Structural basis for the inactivation of cytosolic DNA sensing by the vaccinia virus journal November 2022
Increased diversity of peptidic natural products revealed by modification-tolerant database search of mass spectra journal January 2018
A community resource for paired genomic and metabolomic data mining journal February 2021
Discovery and biosynthesis of cyclic plant peptides via autocatalytic cyclases journal November 2021
NeuRiPP: Neural network identification of RiPP precursor peptides journal September 2019
Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature journal January 2013
Dereplication, sequencing and identification of peptidic natural products: from genome mining to peptidogenomics to spectral networks journal January 2016
New developments in RiPP discovery, enzymology and engineering journal January 2021
Genome mining and genetic analysis of cypemycin biosynthesis reveal an unusual class of posttranslationally modified peptides journal August 2010
Mass spectral molecular networking of living microbial colonies journal May 2012
Structural investigation of ribosomally synthesized natural products by hypothetical structure enumeration and evaluation using tandem MS journal August 2014
Illuminating the dark matter in metabolomics journal October 2015
Genomic charting of ribosomally synthesized natural product chemical space facilitates targeted mining journal October 2016
Gene-guided discovery and engineering of branched cyclic peptides in plants journal October 2018
DeepRiPP integrates multiomics data to automate discovery of novel ribosomally synthesized natural products journal December 2019
Biosynthesis and insecticidal properties of plant cyclotides: The cyclic knotted proteins from Oldenlandia affinis journal September 2001
SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing journal May 2012
DeepCleave: a deep learning predictor for caspase and matrix metalloprotease substrates and cleavage sites journal September 2019
PubChem in 2021: new data content and improved web interfaces journal November 2020
antiSMASH 6.0: improving cluster detection and comparison capabilities journal May 2021
NRPSpredictor2—a web server for predicting NRPS adenylation domain specificity journal May 2011
HMMER web server: interactive sequence similarity searching journal May 2011
IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes journal November 2016
The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters journal October 2016
antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification journal April 2017
RiPPMiner: a bioinformatics resource for deciphering chemical structures of RiPPs based on prediction of cleavage and cross-links journal May 2017
BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins journal May 2018
Uncovering the unexplored diversity of thioamidated ribosomal peptides in Actinobacteria using the RiPPER genome mining tool journal March 2019
Long-term recurrent convolutional networks for visual recognition and description conference June 2015
The biosynthesis of Caryophyllaceae-like cyclic peptides in Saponaria vaccaria L. from DNA-encoded precursors: Caryophyllaceae-like cyclic peptide biosynthesis journal June 2011
Epidermin: sequencing of a heterodet tetracyclic 21-peptide amide antibiotic journal October 1986
pEAQ: versatile expression vectors for easy and quick transient expression of heterologous proteins in plants journal September 2009
Coelichelin, a new peptide siderophore encoded by theStreptomyces coelicolorgenome: structure prediction from the sequence of its non-ribosomal peptide synthetase journal June 2000
Biosynthesis and Regulation of Grisemycin, a New Member of the Linaridin Family of Ribosomally Synthesized Peptides Produced by Streptomyces griseus IFO 13350 journal May 2011
Cloning, expression, and nucleotide sequence of genes involved in production of lactococcin DR, a bacteriocin from lactococcus lactis subsp. lactis journal May 1994
Characterization of the lacticin 481 operon: the Lactococcus lactis genes lctF, lctE, and lctG encode a putative ABC transporter involved in bacteriocin immunity journal November 1997
An Algorithm for Subgraph Isomorphism journal January 1976
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
Precursor peptide-targeted mining of more than one hundred thousand genomes expands the lanthipeptide natural product family journal June 2020
Pep2Path: Automated Mass Spectrometry-Guided Genome Mining of Peptidic Natural Products journal September 2014
ThioFinder: A Web-Based Tool for the Identification of Thiopeptide Gene Clusters in DNA Sequences journal September 2012
Ziconotide: a review of its pharmacology and use in the treatment of pain journal February 2007
Radamycin, a Novel Thiopeptide Produced by Streptomyces sp. RSP9. I. Taxonomy, Fermentation, Isolation and Biological Activities. journal January 2002

Similar Records

Automated genome mining of ribosomal peptide natural products
Journal Article · 2014 · ACS Chemical Biology, 9(7):1545-1551 · OSTI ID:1171294

Cytochromes P450 involved in bacterial RiPP biosyntheses
Journal Article · 2023 · Journal of Industrial Microbiology and Biotechnology · OSTI ID:1968325

P450-Mediated Non-natural Cyclopropanation of Dehydroalanine-Containing Thiopeptides
Journal Article · 2017 · ACS Chemical Biology · OSTI ID:1463727

Related Subjects