Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Consistent Metagenome-Derived Metrics Verify and Delineate Bacterial Species Boundaries

Journal Article · · mSystems
 [1];  [1];  [1];  [1];  [1];  [2]
  1. Univ. of California, Berkeley, CA (United States)
  2. Univ. of California, Berkeley, CA (United States); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States); Chan Zuckerberg Biohub, San Francisco, CA (United States)
Longstanding questions relate to the existence of naturally distinct bacterial species and genetic approaches to distinguish them. Bacterial genomes in public databases form distinct groups, but these databases are subject to isolation and deposition biases. To avoid these biases, we compared 5,203 bacterial genomes from 1,457 environmental metagenomic samples to test for distinct clouds of diversity and evaluated metrics that could be used to define the species boundary. Bacterial genomes from the human gut, soil, and the ocean all exhibited gaps in whole-genome average nucleotide identities (ANI) near the previously suggested species threshold of 95% ANI. While genome-wide ratios of nonsynonymous and synonymous nucleotide differences (dN/dS) decrease until ANI values approach ~98%, two methods for estimating homologous recombination approached zero at ~95% ANI, supporting breakdown of recombination due to sequence divergence as a species-forming force. We evaluated 107 genome-based metrics for their ability to distinguish species when full genomes are not recovered. Full-length 16S rRNA genes were least useful, in part because they were underrecovered from metagenomes. However, many ribosomal proteins displayed both high metagenomic recoverability and species discrimination power. Taken together, our results verify the existence of sequence-discrete microbial species in metagenome-derived genomes and highlight the usefulness of ribosomal genes for gene-level species discrimination.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
Alfred P. Sloan Foundation; National Institutes of Health (NIH); National Science Foundation (NSF); USDOE Office of Science (SC), Biological and Environmental Research (BER)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1615290
Journal Information:
mSystems, Journal Name: mSystems Journal Issue: 1 Vol. 5; ISSN 2379-5077
Publisher:
American Society for MicrobiologyCopyright Statement
Country of Publication:
United States
Language:
English

References (55)

16S rRNA Gene Copy Number Normalization Does Not Provide More Reliable Conclusions in Metataxonomic Surveys journal August 2020
Biases during DNA extraction of activated sludge samples revealed by high throughput sequencing journal July 2012
Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle journal January 2019
A Reverse Ecology Approach Based on a Biological Definition of Microbial Populations journal August 2019
Systematics: The Cohesive Nature of Bacterial Species Taxa journal March 2019
Comparisons of dN/dS are time dependent for closely related bacterial genomes journal March 2006
Genomic patterns of recombination, clonal divergence and environment in marine microbial populations journal June 2008
Individual genome assembly from complex community short-read metagenomic datasets journal October 2011
dRep: a tool for fast and accurate genomic comparisons that enables improved genome recovery from metagenomes through de-replication journal July 2017
Community structure and metabolism through reconstruction of microbial genomes from the environment journal February 2004
Unusual biology across a group comprising more than 15% of domain Bacteria journal June 2015
Mobile genes in the human microbiome are structured from global to individual scales journal July 2016
Genome sequences of rare, uncultured bacteria obtained by differential coverage binning of multiple metagenomes journal May 2013
A new view of the tree of life journal April 2016
Re-evaluating prokaryotic species journal August 2005
High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries journal November 2018
Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life journal September 2017
Differential depth distribution of microbial function and putative symbionts through sediment-hosted aquifers in the deep terrestrial subsurface journal January 2018
Mediterranean grassland soil C–N compound turnover is dependent on rainfall and depth, and is mediated by genomically divergent microorganisms journal May 2019
Genome-centric view of carbon processing in thawing permafrost journal July 2018
New insights from uncultivated genomes of the global human gut microbiome journal March 2019
Microbial Dysbiosis During Simian Immunodeficiency Virus Infection is Partially Reverted with Combination Anti-retroviral Therapy journal April 2020
The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans journal January 2018
Genomic insights that advance the species definition for prokaryotes journal February 2005
Molecular keys to speciation: DNA polymorphism and the control of genetic exchange in enterobacteria journal September 1997
Biopython: freely available Python tools for computational molecular biology and bioinformatics journal March 2009
genoPlotR: comparative gene and genome visualization in R journal July 2010
Search and clustering orders of magnitude faster than BLAST journal August 2010
NSimScan: DNA comparison tool with increased speed, sensitivity and accuracy journal March 2016
Updating the 97% identity threshold for 16S ribosomal RNA OTUs journal February 2018
DNA Sequence Similarity Requirements for Interspecific Recombination in Bacillus journal December 1999
ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data journal February 2016
Ribosomal Database Project: data and tools for high throughput rRNA analysis journal November 2013
DNA–DNA hybridization values and their relationship to whole-genome sequence similarities journal January 2007
The Reconstruction of 2,631 Draft Metagenome-Assembled Genomes from the Global Oceans posted_content July 2017
Microbial Speciation journal September 2015
CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes journal May 2015
Matplotlib: A 2D Graphics Environment journal January 2007
Metagenomic analysis of a high carbon dioxide subsurface microbial community populated by chemolithoautotrophs and bacteria and archaea from candidate phyla: High CO journal April 2015
Bacterial species may exist, metagenomics reveal: Bacterial species may exist journal December 2011
The dynamic genetic repertoire of microbial communities journal January 2009
Necrotizing enterocolitis is preceded by increased gut bacterial replication, Klebsiella , and fimbriae-encoding bacteria journal December 2019
Recombination and the Nature of Bacterial Speciation journal January 2007
Population Genomics of Early Events in the Ecological Differentiation of Bacteria journal April 2012
Introducing mothur: Open-Source, Platform-Independent, Community-Supported Software for Describing and Comparing Microbial Communities journal October 2009
Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB journal July 2006
Towards a Genome-Based Taxonomy for Prokaryotes journal September 2005
What are Bacterial Species? journal October 2002
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains journal May 2016
Patterns of Gene Flow Define Species of Thermophilic Archaea journal February 2012
Back to Basics – The Influence of DNA Extraction and Primer Choice on Phylogenetic Analysis of Activated Sludge Communities journal July 2015
Genetic Exchange Across a Species Boundary in the Archaeal Genus Ferroplasma journal July 2007
Carbon and Sulfur Cycling below the Chemocline in a Meromictic Lake and the Identification of a Novel Taxonomic Lineage in the FCB Superphylum, Candidatus Aegiribacteria journal April 2016
The Reconstruction of 2,631 Draft Metagenome-Assembled Genomes from the Global Oceans dataset January 2017

Cited By (1)

Genomic diversity affects the accuracy of bacterial single-nucleotide polymorphism–calling pipelines journal February 2020

Figures / Tables (4)


Similar Records

Biases in genome reconstruction from metagenomic data
Journal Article · Thu Oct 29 20:00:00 EDT 2020 · PeerJ · OSTI ID:1693794