DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: The secondary metabolism collaboratory: a database and web discussion portal for secondary metabolite biosynthetic gene clusters

Journal Article · · Nucleic Acids Research

Secondary metabolites are small molecules produced by all corners of life, often with specialized bioactive functions with clinical and environmental relevance. Secondary metabolite biosynthetic gene clusters (BGCs) can often be identified within DNA sequences by various sequence similarity tools, but determining the exact functions of genes in the pathway and predicting their chemical products can often only be done by careful, manual comparative analysis. To facilitate this, we report the first release of the secondary metabolism collaboratory (SMC), which aims to provide a comprehensive, tool-agnostic repository of BGC sequence data drawn from all publicly available and user-submitted bacterial and archaeal genome and contig sources. On the website, users are provided a searchable catalog of putative BGCs identified from each source, along with visualizations of gene and domain annotations derived from multiple sequence analysis tools. SMC’s data is also available through publicly-accessible application programming interface (API) endpoints to facilitate programmatic access. Users are encouraged to share their findings (and search for others’) through comment posts on BGC and source pages. At the time of writing, SMC is the largest repository of BGC information, holding 13.1M BGC regions from 1.3M source sequences and growing, and can be found at https://smc.jgi.doe.gov.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); USDOE Joint Genome Institute (JGI), Berkeley, CA (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES). Scientific User Facilities (SUF); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2481474
Journal Information:
Nucleic Acids Research, Journal Name: Nucleic Acids Research Journal Issue: D1 Vol. 53; ISSN 0305-1048
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United States
Language:
English

References (21)

Natural product discovery: past, present, and future journal January 2016
Basic local alignment search tool journal October 1990
Renaissance in antibacterial discovery from actinomycetes journal October 2008
Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation journal October 2010
Module-Based Polyketide Synthase Engineering for de Novo Polyketide Biosynthesis journal October 2023
Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking journal August 2016
A community-sourced glossary of open scholarship terms journal February 2022
Ancient defensive terpene biosynthetic gene clusters in the soft corals journal May 2022
Exploring and retrieving sequence and metadata for species across the tree of life with NCBI Datasets journal July 2024
Natural products in soil microbe interactions and evolution journal January 2015
Genome mining methods to discover bioactive natural products journal January 2021
InterProScan 5: genome-scale protein function classification journal January 2014
The Natural Products Atlas 2.0: a database of microbially-derived natural products journal October 2021
MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters journal November 2022
The conserved domain database in 2023 journal December 2022
The IMG/M data management and analysis system v.7: content updates and new features journal November 2022
antiSMASH 7.0: new and improved predictions for detection, regulation, chemical structures and visualisation journal May 2023
The antiSMASH database version 4: additional genomes and BGCs, new sequence-based searches and more journal October 2023
A deep learning genome-mining strategy for biosynthetic gene cluster prediction journal August 2019
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
A standardized workflow for submitting data to the Minimum Information about a Biosynthetic Gene cluster (MIBiG) repository: prospects for research-based educational experiences journal July 2018