Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Metagenomic compendium of 189,680 DNA viruses from the human gut microbiome

Journal Article · · Nature Microbiology
 [1];  [2];  [2];  [3];  [4];  [2];  [5];  [6];  [7];  [3];  [2]
  1. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Institute and Environmental Genomics and Systems Biology Division; USDOE Joint Genome Institute (JGI), Berkeley, CA (United States); OSTI
  2. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint BioEnergy Institute and Environmental Genomics and Systems Biology Division; USDOE Joint Genome Institute (JGI), Berkeley, CA (United States)
  3. Univ. of Queensland, Brisbane, QLD (Australia). School of Chemistry and Molecular Biosciences. Australian Centre for Ecogenomics
  4. Stanford Univ., CA (United States). Dept. of Medicine (Hematology); Stanford Univ., CA (United States). Dept. of Genetics
  5. PolyBio Research Foundation, Kenmore, WA (United States)
  6. Stanford Univ., CA (United States). Dept. of Bioengineering; Stanford Univ., CA (United States). Dept. of Microbiology and Immunology; Stanford Univ., CA (United States). ChEM-H Inst.; Chan Zuckerberg Biohub, San Francisco, CA (United States)
  7. Stanford Univ., CA (United States). Dept. of Medicine (Hematology); Stanford Univ., CA (United States). Dept. of Genetics

Bacteriophages have important roles in the ecology of the human gut microbiome but are under-represented in reference databases. To address this problem, we assembled the Metagenomic Gut Virus catalogue that comprises 189,680 viral genomes from 11,810 publicly available human stool metagenomes. Over 75% of genomes represent double-stranded DNA phages that infect members of the Bacteroidia and Clostridia classes. Based on sequence clustering we identified 54,118 candidate viral species, 92% of which were not found in existing databases. The Metagenomic Gut Virus catalogue improves detection of viruses in stool metagenomes and accounts for nearly 40% of CRISPR spacers found in human gut Bacteria and Archaea. We also produced a catalogue of 459,375 viral protein clusters to explore the functional potential of the gut virome. This revealed tens of thousands of diversity-generating retroelements, which use error-prone reverse transcription to mutate target genes and may be involved in the molecular arms race between phages and their bacterial hosts.

Sponsoring Organization:
USDOE Office of Science (SC); Autoimmunity Research Foundation; Australian Research Council Laureate Fellowship; National Institutes of Health (NIH)
Grant/Contract Number:
AC02-05CH11231
OSTI ID:
1815925
Journal Information:
Nature Microbiology, Journal Name: Nature Microbiology Journal Issue: 7 Vol. 6; ISSN 2058-5276
Publisher:
Nature Publishing GroupCopyright Statement
Country of Publication:
United States
Language:
English

References (91)

Extensive Unexplored Human Microbiome Diversity Revealed by Over 150,000 Genomes from Metagenomes Spanning Age, Geography, and Lifestyle journal January 2019
Massive expansion of human gut bacteriophage diversity journal February 2021
Biology and Taxonomy of crAss-like Bacteriophages, the Most Abundant Virus in the Human Gut journal November 2018
Expansion of Bacteriophages Is Linked to Aggravated Intestinal Inflammation and Colitis journal February 2019
CRISPR-Cas System of a Prevalent Human Gut Bacterium Reveals Hyper-targeting against Phages in a Human Virome Catalog journal September 2019
The Human Gut Virome Is Highly Diverse, Stable, and Individual Specific journal October 2019
Whole-Virome Analysis Sheds Light on Viral Dark Matter in Inflammatory Bowel Disease journal December 2019
The Gut Virome Database Reveals Age-Dependent Patterns of Virome Diversity in the Human Gut journal November 2020
Ig-Like Domains on Bacteriophages: A Tale of Promiscuity and Deceit journal June 2006
Embracing the enemy: the diversification of microbial gene repertoires by phage-mediated horizontal gene transfer journal August 2017
Antibiotic resistance and extended spectrum beta-lactamases: Types, epidemiology and treatment journal January 2015
Improved annotation of antibiotic resistance determinants reveals microbial resistomes cluster by ecology journal July 2014
Phages rarely encode antibiotic resistance genes: a cautionary tale for virome analyses journal June 2016
A human gut microbial gene catalogue established by metagenomic sequencing journal March 2010
Viruses in the faecal microbiota of monozygotic twins and their mothers journal July 2010
Enterotypes of the human gut microbiome journal April 2011
A framework for human microbiome research journal June 2012
Uncovering Earth’s virome journal August 2016
An integrated catalog of reference genes in the human gut microbiome journal July 2014
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea journal August 2017
A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life journal August 2018
Minimum Information about an Uncultivated Virus Genome (MIUViG) journal December 2018
Major bacterial lineages are essentially devoid of CRISPR-Cas viral defence systems journal February 2016
Genome signature-based dissection of human gut metagenomes to extract subliminal viral sequences journal September 2013
A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes journal July 2014
Fast and sensitive protein alignment using DIAMOND journal November 2014
Explaining microbial population genomics through phage predation journal November 2009
ΦCrAss001 represents the most abundant bacteriophage family in the human gut and infects Bacteroides intestinalis journal November 2018
Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features journal February 2021
Megaphages infect Prevotella and variants are widespread in gut microbiomes journal January 2019
Evaluation of a concatenated protein phylogeny for classification of tailed double-stranded DNA viruses belonging to the order Caudovirales journal May 2019
A new genomic blueprint of the human gut microbiota journal February 2019
New insights from uncultivated genomes of the global human gut microbiome journal March 2019
Clades of huge phages from across Earth’s ecosystems journal February 2020
Detecting contamination in viromes using ViromeQC journal November 2019
CheckV assesses the quality and completeness of metagenome-assembled viral genomes journal December 2020
A unified catalog of 204,938 reference genomes from the human gut microbiome journal July 2020
Modular approach to customise sample preparation procedures for viral metagenomics: a reproducible protocol for virome analysis journal November 2015
The Human Intestinal Microbiome in Health and Disease journal December 2016
Rapid evolution of the human gut virome journal July 2013
Pervasive domestication of defective prophages by bacteria journal August 2014
trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses journal June 2009
Gene and translation initiation site prediction in metagenomic sequences journal July 2012
MMseqs software suite for fast and deep clustering and searching of large protein sequence sets journal January 2016
DeepGOPlus: improved protein function prediction from sequence journal July 2019
GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database journal November 2019
Single-stranded DNA phages: from early molecular biology tools to recent revolutions in environmental microbiology journal February 2016
Computational approaches to predict bacteriophage–host relationships journal December 2015
KEGG: Kyoto Encyclopedia of Genes and Genomes journal January 2000
AcrFinder: genome mining anti-CRISPR operons in prokaryotes and their viruses journal May 2020
The TIGRFAMs database of protein families journal January 2003
BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata journal December 2011
IMG/VR v.2.0: an integrated data management and analysis system for cultivated and environmental viral genomes journal November 2018
The Pfam protein families database in 2019 journal October 2018
Interactive Tree Of Life (iTOL) v4: recent updates and new developments journal April 2019
Enteric Virome and Bacterial Microbiota in Children With Ulcerative Colitis and Crohn Disease journal January 2019
Assessment of viral community functional potential from viral metagenomes may be hampered by contamination with cellular sequences journal December 2013
The human gut virome: Inter-individual variation and dynamic response to diet journal August 2011
Reverse Transcriptase-Mediated Tropism Switching in Bordetella Bacteriophage journal March 2002
Stop codon reassignments in the wild journal May 2014
Direct CRISPR spacer acquisition from RNA by a natural reverse transcriptase-Cas1 fusion protein journal February 2016
Validating the AMRFinder Tool and Resistance Gene Database by Using Antimicrobial Resistance Genotype-Phenotype Correlations in a Collection of Isolates journal August 2019
Amplification Methods Bias Metagenomic Libraries of Uncultured Single-Stranded and Double-Stranded DNA Viruses journal September 2011
Identifying Active Phage Lysins through Functional Viral Metagenomics journal September 2010
Metagenomic Analyses of an Uncultured Viral Community from Human Feces journal October 2003
Prophage Genomics journal June 2003
BLAST+: architecture and applications journal January 2009
PILER-CR: Fast and accurate identification of CRISPR repeats journal January 2007
CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats journal June 2007
Analysis of 1321 Eubacterium rectale genomes from metagenomes uncovers complex phylogeographic population structure and subspecies functional adaptations journal June 2020
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data journal July 2017
A human gut phage catalog correlates the gut phageome with type 2 diabetes journal February 2018
Reproducible protocols for metagenomic analysis of human faecal phageomes journal April 2018
Phages infecting Faecalibacterium prausnitzii belong to novel viral genera that help to decipher intestinal viromes journal April 2018
Evaluation of bias induced by viral enrichment and random amplification protocols in metagenomic surveys of saliva DNA viruses journal June 2018
Tracing mother-infant transmission of bacteriophages by means of a novel analytical tool for shotgun metagenomic datasets: METAnnotatorX journal August 2018
A diversity-generating retroelement encoded by a globally ubiquitous Bacteroides phage journal October 2018
Accelerated Profile HMM Searches journal October 2011
MUMmer4: A fast and versatile genome alignment system journal January 2018
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments journal March 2010
Fast and Sensitive Alignment of Microbial Whole Genome Sequencing Reads to Large Sequence Datasets on a Desktop PC: Application to Metagenomic Datasets and Pathogen Identification journal July 2014
Identification of Diversity-Generating Retroelements in Human Microbiomes journal August 2014
Phage therapy: An alternative to antibiotics in the age of multi-drug resistance journal January 2017
Phages infecting Faecalibacterium prausnitzii belong to novel viral genera that help to decipher intestinal viromes collection January 2018
Evaluation of bias induced by viral enrichment and random amplification protocols in metagenomic surveys of saliva DNA viruses collection January 2018
Tracing mother-infant transmission of bacteriophages by means of a novel analytical tool for shotgun metagenomic datasets: METAnnotatorX collection January 2018
Analysis of 1321 Eubacterium rectale genomes from metagenomes uncovers complex phylogeographic population structure and subspecies functional adaptations collection January 2020
CRISPR-Cas system of a prevalent human gut bacterium reveals hyper-targeting against phages in a human virome catalog dataset January 2021
BACPHLIP: predicting bacteriophage lifestyle from conserved protein domains journal January 2021
Towards optimized viral metagenomes for double-stranded and single-stranded DNA viruses from challenging soils journal January 2019
VirSorter: mining viral signal from microbial genomic data journal January 2015

Cited By (15)

Taxonomy-aware, sequence similarity ranking reliably predicts phage–host relationships journal October 2021
Does Intestine Morphology Still Have Secrets to Reveal? A Proposal about the “Ghost” Layer of the Bowel journal June 2022
Additional file 2 of Presence and role of viruses in anaerobic digestion of food waste under environmental variability dataset January 2023
Additional file 2 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 3 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 4 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 5 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 6 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 7 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 8 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 9 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 10 of Maast: genotyping thousands of microbial strains efficiently dataset January 2023
Additional file 1 of Characterizations of the multi-kingdom gut microbiota in Chinese patients with gouty arthritis dataset January 2023
Additional file 1 of A compendium of ruminant gastrointestinal phage genomes revealed a higher proportion of lytic phages than in any other environments dataset January 2024
Additional file 1 of Characterizing the gut phageome and phage-borne antimicrobial resistance genes in pigs dataset January 2024

Similar Records

A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes
Journal Article · Thu Jul 24 00:00:00 EDT 2014 · Nature Communications · OSTI ID:1623951

Bacterial and Archaeal Viruses of Himalayan Hot Springs at Manikaran Modulate Host Genomes
Journal Article · Thu Dec 13 23:00:00 EST 2018 · Frontiers in Microbiology · OSTI ID:1628173

Presence and Persistence of Putative Lytic and Temperate Bacteriophages in Vaginal Metagenomes from South African Adolescents
Journal Article · Mon Nov 22 23:00:00 EST 2021 · Viruses · OSTI ID:1895334