Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses

Journal Article · · mSystems

ABSTRACT <p>Although long-read sequencing has enabled obtaining high-quality and complete genomes from metagenomes, many challenges still remain to completely decompose a metagenome into its constituent prokaryotic and viral genomes. This study focuses on decomposing an estuarine metagenome to obtain a more accurate estimate of microbial diversity. To achieve this, we developed a new bead-based DNA extraction method, a novel bin refinement method, and obtained 150 Gbp of Nanopore sequencing. We estimate that there are ~500 bacterial and archaeal species in our sample and obtained 68 high-quality bins (>90% complete, <5% contamination, ≤5 contigs, contig length of >100 kbp, and all ribosomal and tRNA genes). We also obtained many contigs of picoeukaryotes, environmental DNA of larger eukaryotes such as mammals, and complete mitochondrial and chloroplast genomes and detected ~40,000 viral populations. Our analysis indicates that there are only a few strains that comprise most of the species abundances.</p> </sec> <sec> <title>IMPORTANCE

Ocean and estuarine microbiomes play critical roles in global element cycling and ecosystem function. Despite the importance of these microbial communities, many species still have not been cultured in the lab. Environmental sequencing is the primary way the function and population dynamics of these communities can be studied. Long-read sequencing provides an avenue to overcome limitations of short-read technologies to obtain complete microbial genomes but comes with its own technical challenges, such as needed sequencing depth and obtaining high-quality DNA. We present here new sampling and bioinformatics methods to attempt decomposing an estuarine microbiome into its constituent genomes. Our results suggest there are only a few strains that comprise most of the species abundances from viruses to picoeukaryotes, and to fully decompose a metagenome of this diversity requires 1 Tbp of long-read sequencing. We anticipate that as long-read sequencing technologies continue to improve, less sequencing will be needed.

Sponsoring Organization:
USDOE
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
2432477
Journal Information:
mSystems, Journal Name: mSystems Journal Issue: 9 Vol. 9; ISSN 2379-5077
Publisher:
American Society for MicrobiologyCopyright Statement
Country of Publication:
United States
Language:
English

References (63)

Using SPAdes De Novo Assembler journal June 2020
In-depth Spatiotemporal Characterization of Planktonic Archaeal and Bacterial Communities in North and South San Francisco Bay journal November 2020
Nanopore-based metagenomics analysis reveals prevalence of mobile antibiotic and heavy metal resistome in wastewater journal January 2021
Toward Accurate and Quantitative Comparative Metagenomics journal August 2016
Marine DNA Viral Macro- and Microdiversity from Pole to Pole journal May 2019
San Francisco Bay nutrients and plankton dynamics as simulated by a coupled hydrodynamic-ecosystem model journal June 2018
The Origin and Diversification of Mitochondria journal November 2017
25 years of serving the community with ribosomal RNA gene reference databases and tools journal November 2017
One chromosome, one contig: complete microbial genomes from long-read sequencing and assembly journal February 2015
Tracking contemporary microbial evolution in a changing ocean journal April 2023
Individual genome assembly from complex community short-read metagenomic datasets journal October 2011
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea journal August 2017
Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation journal December 2017
Climate-mediated dance of the plankton journal September 2014
Fast and sensitive taxonomic classification for metagenomics with Kaiju journal April 2016
Diverse uncultivated ultra-small bacterial cells in groundwater journal February 2015
Marine viruses — major players in the global ecosystem journal October 2007
Microbial drivers of methane emissions from unrestored industrial salt ponds journal July 2021
Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing journal March 2021
Phage puppet masters of the marine microbial realm journal June 2018
Scientists’ warning to humanity: microorganisms and climate change journal June 2019
Tara Oceans: towards global ocean ecosystems biology journal May 2020
Giant virus biology and diversity in the era of genome-resolved metagenomics journal July 2022
Compendium of 4,941 rumen metagenome-assembled genomes for rumen microbiome biology and enzyme discovery journal August 2019
Complete, closed bacterial genomes from microbiomes using nanopore sequencing journal February 2020
metaFlye: scalable long-read metagenome assembly using repeat graphs journal October 2020
Discovering multiple types of DNA methylation from bacteria and microbiome using nanopore sequencing journal April 2021
Improved high-molecular-weight DNA extraction, nanopore sequencing and metagenomic assembly from the human gut microbiome journal December 2020
Water quality measurements in San Francisco Bay by the U.S. Geological Survey, 1969–2015 journal August 2017
Phylogenetic structure of the prokaryotic domain: The primary kingdoms journal November 1977
Fast and sensitive taxonomic assignment to metagenomic contigs journal March 2021
Metagenomic binning with assembly graph embeddings journal August 2022
GTDB-Tk v2: memory friendly classification with the genome taxonomy database journal October 2022
Infernal 1.1: 100-fold faster RNA homology searches journal September 2013
MetaQUAST: evaluation of metagenome assemblies journal November 2015
Minimap2: pairwise alignment for nucleotide sequences journal May 2018
IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies journal November 2014
Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation journal April 2021
The SILVA ribosomal RNA gene database project: improved data processing and web-based tools journal November 2012
Ribosomal Database Project: data and tools for high throughput rRNA analysis journal November 2013
CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning posted_content July 2022
You can move, but you can’t hide: identification of mobile genetic elements with geNomad posted_content March 2023
Adaptive seeds tame genomic sequence comparison journal January 2011
Accurate, multi-kb reads resolve complex populations and detect rare microorganisms journal February 2015
Canu: scalable and accurate long-read assembly via adaptive k -mer weighting and repeat separation journal March 2017
Assembly-free single-molecule sequencing recovers complete virus genomes from natural microbial communities journal February 2020
Inclusion of Oxford Nanopore long reads improves all microbial and viral metagenome‐assembled genomes from a complex aquifer system journal August 2020
Primary Production of the Biosphere: Integrating Terrestrial and Oceanic Components journal July 1998
Ecological Significance of Microdiversity: Identical 16S rRNA Gene Sequences Can Be Found in Bacteria with Highly Divergent Genomes and Ecophysiologies journal August 2004
Patterns in Wetland Microbial Community Composition and Functional Gene Repertoire Associated with Methane Emissions journal May 2015
How, When, and Where Relic DNA Affects Microbial Diversity journal June 2018
Uncultivated Viral Populations Dominate Estuarine Viromes on the Spatiotemporal Scale journal April 2021
Long-Read Sequencing Improves Recovery of Picoeukaryotic Genomes and Zooplankton Marker Genes from Marine Metagenomes journal December 2022
Multiple microbial guilds mediate soil methane cycling along a wetland salinity gradient journal January 2024
BLAST+: architecture and applications journal January 2009
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces journal May 2021
Bio-On-Magnetic-Beads (BOMB): Open platform for high-throughput nucleic acid extraction and manipulation journal January 2019
A method for achieving complete microbial genomes and improving bins from metagenomics data journal May 2021
Read Length and Repeat Resolution: Exploring Prokaryote Genomes Using Next-Generation Sequencing Technologies journal July 2010
A modular method for the extraction of DNA and RNA, and the separation of DNA pools from diverse environmental sample types journal May 2015
Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics journal August 2021
A total of 219 metagenome-assembled genomes of microorganisms from Icelandic marine waters journal April 2021

Similar Records

Trimming and Decontamination of Metagenomic Data can Significantly Impact Assembly and Binning Metrics, Phylogenomic and Functional Analysis
Journal Article · Thu Jun 01 00:00:00 EDT 2023 · Current Bioinformatics · OSTI ID:1992554

Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes
Journal Article · Tue Jun 28 00:00:00 EDT 2016 · mSystems · OSTI ID:1290402

Estimating DNA coverage and abundance in metagenomes using a gamma approximation
Journal Article · Thu Dec 31 23:00:00 EST 2009 · Bioinformatics Online · OSTI ID:983279

Related Subjects