Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Large-scale 16S gene assembly using metagenomics shotgun sequences

Journal Article · · Bioinformatics
 [1];  [2];  [3];  [4];  [5]
  1. Xiamen Univ. (China)
  2. Tsinghua Univ., Beijing (China)
  3. Department of Automation, Xiamen University, Xiamen, Fujian, China
  4. Univ. of Oklahoma, Norman, OK (United States); Tsinghua Univ., Beijing (China); Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States)
  5. Tsinghua Univ., Beijing (China); Univ. of Southern California, Los Angeles, CA (United States)
Combining a 16S rRNA (16S) gene database with metagenomic shotgun sequences allows unbiased identification of known and novel microbes.To attain this, we herein report reference-based ribosome assembly (RAMBL), a computational pipeline, which integrates taxonomic tree search and Dirichlet process clustering to reconstruct full-length 16S gene sequences from metagenomic sequencing data with high accuracy. By benchmarking against the synthetic and real shotgun sequences, we demonstrated that full-length 16S gene assemblies of RAMBL were a good proxy for known and putative microbes, including Candidate Phyla Radiation. Here, we found that 30-40% of bacteria genera in the terrestrial and intestinal biomes have no closely related genome sequences. We also observed that RAMBL was able to generate a more accurate determination of environmental microbial diversity and yield better disease classification, suggesting that full-length 16S gene assemblies are a powerful alternative to marker gene set and 16S short reads. RAMBL first realizes the access to full-length 16S gene sequences in the near-terabase-scale metagenomic shotgun sequences, which markedly improve metagenomic data analysis and interpretation.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
Sponsoring Organization:
National Natural Science Foundation of China (NNSFC); USDOE Office of Science (SC)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1567080
Journal Information:
Bioinformatics, Journal Name: Bioinformatics Journal Issue: 10 Vol. 33; ISSN 1367-4803
Publisher:
Oxford University PressCopyright Statement
Country of Publication:
United States
Language:
English

References (40)

High-Definition Reconstruction of Clonal Composition in Cancer journal June 2014
High-resolution phylogenetic microbial community profiling journal February 2016
A human gut microbial gene catalogue established by metagenomic sequencing journal March 2010
Cross-biome metagenomic analyses of soil microbial communities and their functional attributes journal December 2012
FastTree: Computing Large Minimum Evolution Trees with Profiles instead of a Distance Matrix journal April 2009
rrndb: the Ribosomal RNA Operon Copy Number Database journal January 2001
SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB journal November 2007
Mason text January 2010
Insights into the Role of Erysipelotrichaceae in the Human Host journal November 2015
A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteria journal May 2007
Diet is a major factor governing the fecal butyrate-producing community structure across Mammalia, Aves and Reptilia journal October 2014
A framework for human microbiome research journal June 2012
A metagenome-wide association study of gut microbiota in type 2 diabetes journal September 2012
Genomic variation landscape of the human gut microbiome journal December 2012
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes journal July 2014
Fast gapped-read alignment with Bowtie 2 journal March 2012
Metagenomic species profiling using universal phylogenetic marker genes journal October 2013
MetaPhlAn2 for enhanced metagenomic taxonomic profiling journal September 2015
QIIME allows analysis of high-throughput community sequencing data journal April 2010
Metagenomics uncovers gaps in amplicon-based detection of microbial diversity journal February 2016
Sequencing and beyond: integrating molecular 'omics' for microbial community profiling journal April 2015
The diversity and biogeography of soil bacterial communities journal January 2006
A Greedy Algorithm for Aligning DNA Sequences journal February 2000
Probabilistic Inference of Viral Quasispecies Subject to Recombination journal February 2013
Multiple sequence alignment using partial order graphs journal March 2002
Infernal 1.0: inference of RNA alignments journal March 2009
The Sequence Alignment/Map format and SAMtools journal June 2009
Reconstructing 16S rRNA genes in metagenomic data journal June 2015
MUSCLE: multiple sequence alignment with high accuracy and high throughput journal March 2004
Ribosomal Database Project: data and tools for high throughput rRNA analysis journal November 2013
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons journal January 2011
Genomes from Metagenomics journal November 2013
Structure and function of the global ocean microbiome journal May 2015
Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy journal June 2007
Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB journal July 2006
High-Throughput Metagenomic Technologies for Complex Microbial Community Analysis: Open and Closed Formats journal January 2015
EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data journal May 2011
The “Most Wanted” Taxa from the Human Microbiome for Whole Genome Sequencing journal July 2012
Parallel-META 2.0: Enhanced Metagenomic Data Analysis with Functional Annotation, High Performance Computing and Advanced Visualization journal March 2014
of EMIRGE: reconstruction of full-length ribosomal genes from microbial community short read sequencing data image January 2019

Cited By (3)

An Integrated Metagenome Catalog Reveals New Insights into the Murine Gut Microbiome text January 2020
An integrated metagenome catalog reveals novel insights into the murine gut microbiome posted_content January 2019
Sequence and cultivation study of Muribaculaceae reveals novel species, host preference, and functional potential of this yet undescribed family journal February 2019

Figures / Tables (5)


Similar Records

Improved Microbial Community Characterization of 16S rRNA via Metagenome Hybridization Capture Enrichment
Journal Article · Mon Apr 26 20:00:00 EDT 2021 · Frontiers in Microbiology · OSTI ID:1849075

High-resolution phylogenetic microbial community profiling
Conference · Mon Mar 17 00:00:00 EDT 2014 · OSTI ID:1241181

Related Subjects