DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains

Abstract

The 16s rRNA gene is so far the most widely used marker for taxonomical classification and separation of prokaryotes. Since it is universally conserved among prokaryotes, it is possible to use this gene to classify a broad range of prokaryotic organisms. At the same time, it has often been noted that the 16s rRNA gene is too conserved to separate between prokaryotes at finer taxonomic levels. In this paper, we examine how well levels of similarity of 16s rRNA and 73 additional universal or nearly universal marker genes correlate with genome-wide levels of gene sequence similarity. We demonstrate that the percent identity of 16s rRNA predicts genome-wide levels of similarity very well for distantly related prokaryotes, but not for closely related ones. In closely related prokaryotes, we find that there are many other marker genes for which levels of similarity are much more predictive of genome-wide levels of gene sequence similarity. Finally, we show that the identities of the markers that are most useful for predicting genome-wide levels of similarity within closely related prokaryotic lineages vary greatly between lineages. However, the most useful markers are always those that are least conserved in their sequences within each lineage. In conclusion, ourmore » results show that by choosing markers that are less conserved in their sequences within a lineage of interest, it is possible to better predict genome-wide gene sequence similarity between closely related prokaryotes than is possible using the 16s rRNA gene. We point readers towards a database we have created (POGO-DB) that can be used to easily establish which markers show lowest levels of sequence conservation within different prokaryotic lineages.« less

Authors:
; ;
Publication Date:
Research Org.:
Drexel Univ., Philadelphia, PA (United States)
Sponsoring Org.:
USDOE Office of Science (SC), Biological and Environmental Research (BER)
OSTI Identifier:
1618942
Alternate Identifier(s):
OSTI ID: 1282888
Grant/Contract Number:  
SC0004335
Resource Type:
Published Article
Journal Name:
Microbiome
Additional Journal Information:
Journal Name: Microbiome Journal Volume: 4 Journal Issue: 1; Journal ID: ISSN 2049-2618
Publisher:
Springer Science + Business Media
Country of Publication:
United Kingdom
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; Less-conserved genes; Lineage-specific; Marker genes; Genome-wide similarity; ribosomal-rna gene; microbial communities; gut microbiome; database; diversity; bacterial; metagenomics; reconstruction; identification; accurate

Citation Formats

Lan, Yemin, Rosen, Gail, and Hershberg, Ruth. Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains. United Kingdom: N. p., 2016. Web. doi:10.1186/s40168-016-0162-5.
Lan, Yemin, Rosen, Gail, & Hershberg, Ruth. Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains. United Kingdom. https://doi.org/10.1186/s40168-016-0162-5
Lan, Yemin, Rosen, Gail, and Hershberg, Ruth. Tue . "Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains". United Kingdom. https://doi.org/10.1186/s40168-016-0162-5.
@article{osti_1618942,
title = {Marker genes that are less conserved in their sequences are useful for predicting genome-wide similarity levels between closely related prokaryotic strains},
author = {Lan, Yemin and Rosen, Gail and Hershberg, Ruth},
abstractNote = {The 16s rRNA gene is so far the most widely used marker for taxonomical classification and separation of prokaryotes. Since it is universally conserved among prokaryotes, it is possible to use this gene to classify a broad range of prokaryotic organisms. At the same time, it has often been noted that the 16s rRNA gene is too conserved to separate between prokaryotes at finer taxonomic levels. In this paper, we examine how well levels of similarity of 16s rRNA and 73 additional universal or nearly universal marker genes correlate with genome-wide levels of gene sequence similarity. We demonstrate that the percent identity of 16s rRNA predicts genome-wide levels of similarity very well for distantly related prokaryotes, but not for closely related ones. In closely related prokaryotes, we find that there are many other marker genes for which levels of similarity are much more predictive of genome-wide levels of gene sequence similarity. Finally, we show that the identities of the markers that are most useful for predicting genome-wide levels of similarity within closely related prokaryotic lineages vary greatly between lineages. However, the most useful markers are always those that are least conserved in their sequences within each lineage. In conclusion, our results show that by choosing markers that are less conserved in their sequences within a lineage of interest, it is possible to better predict genome-wide gene sequence similarity between closely related prokaryotes than is possible using the 16s rRNA gene. We point readers towards a database we have created (POGO-DB) that can be used to easily establish which markers show lowest levels of sequence conservation within different prokaryotic lineages.},
doi = {10.1186/s40168-016-0162-5},
journal = {Microbiome},
number = 1,
volume = 4,
place = {United Kingdom},
year = {Tue May 03 00:00:00 EDT 2016},
month = {Tue May 03 00:00:00 EDT 2016}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record
https://doi.org/10.1186/s40168-016-0162-5

Citation Metrics:
Cited by: 42 works
Citation information provided by
Web of Science

Figures / Tables:

Fig. 1 Fig. 1: Box and whisker plot of 16s rRNA percent identities between genomes of the same taxa (species/genus/etc.). The whiskers represent boundaries within 1.5 interquartile range of the lower and upper quartiles. Although the 16s rRNA genes are more similar at finer taxonomic levels, there are still large overlaps betweenmore » taxonomic classification levels and outliers within each level. Therefore, the 16s rRNA gene is sometimes an ambiguous marker in taxonomic classification« less

Save / Share:

Works referenced in this record:

Oligotyping: differentiating between closely related microbial taxa using 16S rRNA gene data
journal, October 2013

  • Eren, A. Murat; Maignien, Loïs; Sul, Woo Jun
  • Methods in Ecology and Evolution, Vol. 4, Issue 12
  • DOI: 10.1111/2041-210X.12114

Enterotypes of the human gut microbiome
journal, April 2011

  • Arumugam, Manimozhiyan; Raes, Jeroen; Pelletier, Eric
  • Nature, Vol. 473, Issue 7346
  • DOI: 10.1038/nature09944

Phylogenomic analysis of bacterial and archaeal sequences with AMPHORA2
journal, February 2012


Metagenomics: Genomic Analysis of Microbial Communities
journal, December 2004


The Human Microbiome Project: A Community Resource for the Healthy Human Microbiome
journal, August 2012


Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB
journal, July 2006

  • DeSantis, T. Z.; Hugenholtz, P.; Larsen, N.
  • Applied and Environmental Microbiology, Vol. 72, Issue 7, p. 5069-5072
  • DOI: 10.1128/AEM.03006-05

The Variability of the 16S rRNA Gene in Bacterial Genomes and Its Consequences for Bacterial Community Analyses
journal, February 2013


A Comparison of rpoB and 16S rRNA as Markers in Pyrosequencing Studies of Bacterial Diversity
journal, February 2012


Identification of common molecular subsequences
journal, March 1981


Using the Basic Local Alignment Search Tool (BLAST)
journal, July 2007

  • Mount, David W.
  • Cold Spring Harbor Protocols, Vol. 2007, Issue 7
  • DOI: 10.1101/pdb.top17

TOPD/FMTS: a new software to compare phylogenetic trees
journal, April 2007


Assessing diversity of the female urine microbiota by high throughput sequencing of 16S rDNA amplicons
journal, January 2011

  • Siddiqui, Huma; Nederbragt, Alexander J.; Lagesen, Karin
  • BMC Microbiology, Vol. 11, Issue 1
  • DOI: 10.1186/1471-2180-11-244

Deep Sequencing of the Oral Microbiome Reveals Signatures of Periodontal Disease
journal, June 2012


cpnDB: A Chaperonin Sequence Database
journal, August 2004


Toward Automatic Reconstruction of a Highly Resolved Tree of Life
journal, March 2006


Species Identification and Profiling of Complex Microbial Communities Using Shotgun Illumina Sequencing of 16S rRNA Amplicon Sequences
journal, April 2013


A human gut microbial gene catalogue established by metagenomic sequencing
journal, March 2010

  • Qin, Junjie; Li, Ruiqiang; Raes, Jeroen
  • Nature, Vol. 464, Issue 7285
  • DOI: 10.1038/nature08821

Predicting relatedness of bacterial genomes using the chaperonin-60 universal target (cpn60 UT): Application to Thermoanaerobacter species
journal, May 2011

  • Verbeke, Tobin J.; Sparling, Richard; Hill, Janet E.
  • Systematic and Applied Microbiology, Vol. 34, Issue 3
  • DOI: 10.1016/j.syapm.2010.11.019

Bioinformatics for Whole-Genome Shotgun Sequencing of Microbial Communities
journal, January 2005


EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data
journal, May 2011

  • Miller, Christopher S.; Baker, Brett J.; Thomas, Brian C.
  • Genome Biology, Vol. 12, Issue 5
  • DOI: 10.1186/gb-2011-12-5-r44

The Earth Microbiome project: successes and aspirations
journal, August 2014


Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
journal, January 2011


Genomic variation landscape of the human gut microbiome
journal, December 2012

  • Schloissnig, Siegfried; Arumugam, Manimozhiyan; Sunagawa, Shinichi
  • Nature, Vol. 493, Issue 7430
  • DOI: 10.1038/nature11711

Metagenomics Using Next-Generation Sequencing
book, January 2014


Photosynthetic and Phylogenetic Primers for Detection of Anoxygenic Phototrophs in Natural Environments
journal, July 2001


Use of 16S rRNA and rpoB Genes as Molecular Markers for Microbial Ecology Studies
journal, October 2006

  • Case, R. J.; Boucher, Y.; Dahllof, I.
  • Applied and Environmental Microbiology, Vol. 73, Issue 1
  • DOI: 10.1128/AEM.01177-06

Toward an Efficient Method of Identifying Core Genes for Evolutionary and Functional Microbial Phylogenies
journal, September 2011


A core gut microbiome in obese and lean twins
journal, November 2008

  • Turnbaugh, Peter J.; Hamady, Micah; Yatsunenko, Tanya
  • Nature, Vol. 457, Issue 7228
  • DOI: 10.1038/nature07540

Revisiting bacterial phylogeny: Natural and experimental evidence for horizontal gene transfer of 16S rRNA
journal, January 2013

  • Kitahara, Kei; Miyazaki, Kentaro
  • Mobile Genetic Elements, Vol. 3, Issue 1
  • DOI: 10.4161/mge.24210

Metagenomics: DNA sequencing of environmental samples
journal, October 2005

  • Tringe, Susannah Green; Rubin, Edward M.
  • Nature Reviews Genetics, Vol. 6, Issue 11
  • DOI: 10.1038/nrg1709

The SILVA ribosomal RNA gene database project: improved data processing and web-based tools
journal, November 2012

  • Quast, Christian; Pruesse, Elmar; Yilmaz, Pelin
  • Nucleic Acids Research, Vol. 41, Issue D1
  • DOI: 10.1093/nar/gks1219

The Ribosomal Database Project: improved alignments and new tools for rRNA analysis
journal, January 2009

  • Cole, J. R.; Wang, Q.; Cardenas, E.
  • Nucleic Acids Research, Vol. 37, Issue Database
  • DOI: 10.1093/nar/gkn879

GenBank
journal, December 2004

  • Benson, D. A.
  • Nucleic Acids Research, Vol. 33, Issue Database issue
  • DOI: 10.1093/nar/gki063

PhylOTU: A High-Throughput Procedure Quantifies Microbial Community Diversity and Resolves Novel Taxa from Metagenomic Data
journal, January 2011

  • Sharpton, Thomas J.; Riesenfeld, Samantha J.; Kembel, Steven W.
  • PLoS Computational Biology, Vol. 7, Issue 1
  • DOI: 10.1371/journal.pcbi.1001061

Towards a Genome-Based Taxonomy for Prokaryotes
journal, September 2005


Ribosomal RNA diversity predicts genome diversity in gut bacteria and their relatives
journal, March 2010

  • Zaneveld, Jesse R.; Lozupone, Catherine; Gordon, Jeffrey I.
  • Nucleic Acids Research, Vol. 38, Issue 12
  • DOI: 10.1093/nar/gkq066

Metagenomic species profiling using universal phylogenetic marker genes
journal, October 2013

  • Sunagawa, Shinichi; Mende, Daniel R.; Zeller, Georg
  • Nature Methods, Vol. 10, Issue 12
  • DOI: 10.1038/nmeth.2693

Saliva microbiomes distinguish caries-active from healthy human populations
journal, June 2011


A meta-analysis of changes in bacterial and archaeal communities with time
journal, April 2013

  • Shade, Ashley; Gregory Caporaso, J.; Handelsman, Jo
  • The ISME Journal, Vol. 7, Issue 8
  • DOI: 10.1038/ismej.2013.54

A simple, fast, and accurate method of phylogenomic inference
journal, January 2008


A general method applicable to the search for similarities in the amino acid sequence of two proteins
journal, March 1970


The Chaperonin-60 Universal Target Is a Barcode for Bacteria That Enables De Novo Assembly of Metagenomic Sequence Data
journal, November 2012


Reduced selection leads to accelerated gene loss in Shigella
journal, January 2007


PhyloSift: phylogenetic analysis of genomes and metagenomes
journal, January 2014

  • Darling, Aaron E.; Jospin, Guillaume; Lowe, Eric
  • PeerJ, Vol. 2
  • DOI: 10.7717/peerj.243

WGSQuikr: Fast Whole-Genome Shotgun Metagenomic Classification
journal, March 2014


The COG database: an updated version includes eukaryotes
journal, January 2003

  • Tatusov, Roman L.; Fedorova, Natalie D.; Jackson, John D.
  • BMC Bioinformatics, Vol. 4, Article No. 41
  • DOI: 10.1186/1471-2105-4-41

Rare Events of Intragenus and Intraspecies Horizontal Transfer of the 16S rRNA Gene
journal, July 2015

  • Tian, Ren-Mao; Cai, Lin; Zhang, Wei-Peng
  • Genome Biology and Evolution, Vol. 7, Issue 8
  • DOI: 10.1093/gbe/evv143

16S rRNA Gene Sequencing for Bacterial Identification in the Diagnostic Laboratory: Pluses, Perils, and Pitfalls
journal, July 2007

  • Janda, J. M.; Abbott, S. L.
  • Journal of Clinical Microbiology, Vol. 45, Issue 9
  • DOI: 10.1128/JCM.01228-07

Metagenomic microbial community profiling using unique clade-specific marker genes
journal, June 2012

  • Segata, Nicola; Waldron, Levi; Ballarini, Annalisa
  • Nature Methods, Vol. 9, Issue 8
  • DOI: 10.1038/nmeth.2066

Gene sequences useful for predicting relatedness of whole genomes in bacteria
journal, November 2003

  • Zeigler, D. R.
  • INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY, Vol. 53, Issue 6
  • DOI: 10.1099/ijs.0.02713-0

Human gut microbes associated with obesity
journal, December 2006

  • Ley, Ruth E.; Turnbaugh, Peter J.; Klein, Samuel
  • Nature, Vol. 444, Issue 7122
  • DOI: 10.1038/4441022a

PhyloSift: Phylogenetic analysis of genomes and metagenomes
dataset, January 2013


Using QIIME to Analyze 16S rRNA Gene Sequences from Microbial Communities
journal, December 2011

  • Kuczynski, Justin; Stombaugh, Jesse; Walters, William Anton
  • Current Protocols in Bioinformatics, Vol. 36, Issue 1
  • DOI: 10.1002/0471250953.bi1007s36

Accurate and fast estimation of taxonomic profiles from metagenomic shotgun sequences
journal, September 2011


Works referencing / citing this record:

Rapid Genetic Adaptation during the First Four Months of Survival under Resource Exhaustion
journal, March 2017

  • Avrani, Sarit; Bolotin, Evgeni; Katz, Sophia
  • Molecular Biology and Evolution, Vol. 34, Issue 7
  • DOI: 10.1093/molbev/msx118

Iroki: automatic customization and visualization of phylogenetic trees
journal, September 2019

  • Moore, Ryan M.; Harrison, Amelia O.; McAllister, Sean M.
  • PeerJ
  • DOI: 10.1101/106138

Understanding and overcoming the pitfalls and biases of next-generation sequencing (NGS) methods for use in the routine clinical microbiological diagnostic laboratory
journal, March 2019

  • Boers, Stefan A.; Jansen, Ruud; Hays, John P.
  • European Journal of Clinical Microbiology & Infectious Diseases, Vol. 38, Issue 6
  • DOI: 10.1007/s10096-019-03520-3

Glutamine synthetase type I (glnAI) represents a rewarding molecular marker in the classification of bifidobacteria and related genera
journal, May 2019


Unveiling Plant-Beneficial Function as Seen in Bacteria Genes from Termite Mound Soil
journal, January 2020

  • Enagbonma, Ben Jesuorsemwen; Babalola, Olubukola Oluranti
  • Journal of Soil Science and Plant Nutrition, Vol. 20, Issue 2
  • DOI: 10.1007/s42729-019-00124-w

The vaginal microbiome of pregnant women is less rich and diverse, with lower prevalence of Mollicutes, compared to non-pregnant women
journal, August 2017


A comparative assessment of conventional and molecular methods, including MinION nanopore sequencing, for surveying water quality
journal, October 2019


Generating amplicon reads for microbial community assessment with next‐generation sequencing
journal, August 2019

  • Gołębiewski, M.; Tretyn, A.
  • Journal of Applied Microbiology, Vol. 128, Issue 2
  • DOI: 10.1111/jam.14380

Comprehensive benchmarking and ensemble approaches for metagenomic classifiers
journal, September 2017


Absolute quantitation of microbiota abundance in environmental samples
journal, June 2018


A Literature Review of Metagenomics and Culturomics of the Peri-implant Microbiome: Current Evidence and Future Perspectives
journal, September 2019

  • Martellacci, Leonardo; Quaranta, Gianluca; Patini, Romeo
  • Materials, Vol. 12, Issue 18
  • DOI: 10.3390/ma12183010

Figures/Tables have been extracted from DOE-funded journal article accepted manuscripts.