DOE JGI Metagenome Workflow
- Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
The DOE Joint Genome Institute (JGI) Metagenome Workflow performs metagenome data processing, including assembly; structural, functional, and taxonomic annotation; and binning of metagenomic data sets that are subsequently included into the Integrated Microbial Genomes and Microbiomes (IMG/M) (I.-M. A. Chen, K. Chu, K. Palaniappan, A. Ratner, et al., Nucleic Acids Res, 49:D751–D763, 2021, https://doi.org/10.1093/nar/gkaa939) comparative analysis system and provided for download via the JGI data portal (https://genome.jgi.doe.gov/portal/). This workflow scales to run on thousands of metagenome samples per year, which can vary by the complexity of microbial communities and sequencing depth. Here, we describe the different tools, databases, and parameters used at different steps of the workflow to help with the interpretation of metagenome data available in IMG and to enable researchers to apply this workflow to their own data. We use 20 publicly available sediment metagenomes to illustrate the computing requirements for the different steps and highlight the typical results of data processing. The workflow modules for read filtering and metagenome assembly are available as a workflow description language (WDL) file (https://code.jgi.doe.gov/BFoster/jgi_meta_wdl). The workflow modules for annotation and binning are provided as a service to the user community at https://img.jgi.doe.gov/submit and require filling out the project and associated metadata descriptions in the Genomes OnLine Database (GOLD) (S. Mukherjee, D. Stamatis, J. Bertsch, G. Ovchinnikova, et al., Nucleic Acids Res, 49:D723–D733, 2021, https://doi.org/10.1093/nar/gkaa983).
- Research Organization:
- Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
- Sponsoring Organization:
- U.S. Department of Energy; USDOE Office of Science (SC)
- Grant/Contract Number:
- AC02-05CH11231
- OSTI ID:
- 1826557
- Alternate ID(s):
- OSTI ID: 1828334
- Journal Information:
- mSystems, Journal Name: mSystems Journal Issue: 3 Vol. 6; ISSN 2379-5077
- Publisher:
- American Society for MicrobiologyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
IMG/M v.5.0: an integrated data management and comparative analysis system for microbial genomes and microbiomes
IMG/M 4 version of the integrated metagenome comparative analysis system
IMG/M: integrated genome and metagenome comparative data analysis system
Journal Article
·
Thu Oct 04 20:00:00 EDT 2018
· Nucleic Acids Research
·
OSTI ID:1542357
IMG/M 4 version of the integrated metagenome comparative analysis system
Journal Article
·
Tue Oct 15 20:00:00 EDT 2013
· Nucleic Acids Research
·
OSTI ID:1625530
IMG/M: integrated genome and metagenome comparative data analysis system
Journal Article
·
Wed Oct 12 20:00:00 EDT 2016
· Nucleic Acids Research
·
OSTI ID:1379657