DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Coassembly and binning of a twenty-year metagenomic time-series from Lake Mendota

Journal Article · · Scientific Data
; ; ; ; ORCiD logo; ; ; ; ; ; ; ; ORCiD logo; ; ORCiD logo; ; ORCiD logo; ORCiD logo; ORCiD logo; ORCiD logo more »; ORCiD logo; ; ORCiD logo; ORCiD logo; ; ; ORCiD logo; ; ; ; ORCiD logo; ; ORCiD logo; ORCiD logo « less

Abstract The North Temperate Lakes Long-Term Ecological Research (NTL-LTER) program has been extensively used to improve understanding of how aquatic ecosystems respond to environmental stressors, climate fluctuations, and human activities. Here, we report on the metagenomes of samples collected between 2000 and 2019 from Lake Mendota, a freshwater eutrophic lake within the NTL-LTER site. We utilized the distributed metagenome assembler MetaHipMer to coassemble over 10 terabases (Tbp) of data from 471 individual Illumina-sequenced metagenomes. A total of 95,523,664 contigs were assembled and binned to generate 1,894 non-redundant metagenome-assembled genomes (MAGs) with ≥50% completeness and ≤10% contamination. Phylogenomic analysis revealed that the MAGs were nearly exclusively bacterial, dominated by Pseudomonadota (Proteobacteria, N = 623) and Bacteroidota (N = 321). Nine eukaryotic MAGs were identified by eukCC with six assigned to the phylum Chlorophyta. Additionally, 6,350 high-quality viral sequences were identified by geNomad with the majority classified in the phylum Uroviricota. This expansive coassembled metagenomic dataset provides an unprecedented foundation to advance understanding of microbial communities in freshwater ecosystems and explore temporal ecosystem dynamics.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
USDOE; USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science (BSS)
Grant/Contract Number:
89233218CNA000001; AC02-05CH11231; AC05-00OR22725; AC05-76RL01830
OSTI ID:
2439608
Journal Information:
Scientific Data, Journal Name: Scientific Data Journal Issue: 1 Vol. 11; ISSN 2052-4463
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United Kingdom
Language:
English

References (15)

Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea journal August 2017
CheckV assesses the quality and completeness of metagenome-assembled viral genomes journal December 2020
Identification of mobile genetic elements with geNomad journal September 2023
Terabase-scale metagenome coassembly with MetaHipMer journal July 2020
Species invasions shift microbial phenology in a two-decade freshwater time series journal March 2023
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database journal November 2019
PhycoCosm, a comparative algal genomics resource journal October 2020
Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation journal April 2021
Twenty-five years of Genomes OnLine Database (GOLD): data updates and new features in v.9 journal November 2022
The IMG/M data management and analysis system v.7: content updates and new features journal November 2022
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes journal May 2015
DOE JGI Metagenome Workflow journal June 2021
Estimating the quality of eukaryotic genomes recovered from metagenomic analysis with EukCC journal September 2020
Freshwater carbon and nutrient cycles revealed through reconstructed population genomes journal January 2018
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies journal January 2019