skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Computational approaches to predict bacteriophage–host relationships

Journal Article · · FEMS Microbiology Reviews
 [1];  [2];  [3];  [4];  [5]
  1. San Diego State Univ., CA (United States). Dept. of Computer Science; Federal Univ. of Rio de Janeiro (Brazil). Dept. of Marine Biology, Institute of Biology; Argonne National Lab. (ANL), Argonne, IL (United States). Division of Mathematics and Computer Science
  2. San Diego State Univ., CA (United States). Dept. of Computer Science
  3. Rega Institute KU Leuven, Leuven (Belgium). Dept. of Microbiology and Immunology; VIB Center for the Biology of Disease, Leuven (Belgium); rije Universiteit Brussel, Pleinlaan, Brussels (Belgium). Laboratory of Microbiology
  4. Rega Institute KU Leuven, Leuven (Belgium). Dept. of Microbiology and Immunology; VIB Center for the Biology of Disease, Leuven (Belgium); rije Universiteit Brussel, Pleinlaan, Brussels (Belgium). Laboratory of Microbiology
  5. Federal Univ. of Rio de Janeiro (Brazil). Dept. of Marine Biology, Institute of Biology; Univ. of Utrecht (Netherlands). Theoretical Biology and Bioinformatics; Radboud Univ., Medical Center, Nijmegen (Netherlands). Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences

Metagenomics has changed the face of virus discovery by enabling the accurate identification of viral genome sequences without requiring isolation of the viruses. As a result, metagenomic virus discovery leaves the first and most fundamental question about any novel virus unanswered: What host does the virus infect? The diversity of the global virosphere and the volumes of data obtained in metagenomic sequencing projects demand computational tools for virus–host prediction. We focus on bacteriophages (phages, viruses that infect bacteria), the most abundant and diverse group of viruses found in environmental metagenomes. By analyzing 820 phages with annotated hosts, we review and assess the predictive power of in silico phage–host signals. Sequence homology approaches are the most effective at identifying known phage–host pairs. Compositional and abundance-based methods contain significant signal for phage–host classification, providing opportunities for analyzing the unknowns in viral metagenomes. Together, these computational approaches further our knowledge of the interactions between phages and their hosts. Importantly, we find that all reviewed signals significantly link phages to their hosts, illustrating how current knowledge and insights about the interaction mechanisms and ecology of coevolving phages and bacteria can be exploited to predict phage–host relationships, with potential relevance for medical and industrial applications.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE; National Science Foundation (NSF); Netherlands Organization for Scientific Research (NWO)
Grant/Contract Number:
AC02-06CH11357; CNS-1305112; MCB-1330800; 864.14.004
OSTI ID:
1625320
Journal Information:
FEMS Microbiology Reviews, Vol. 40, Issue 2; ISSN 1574-6976
Publisher:
Federation of European Microbiological SocietiesCopyright Statement
Country of Publication:
United States
Language:
English

References (119)

Virus discovery: are we scientists or genome collectors? journal May 2014
CRISPRTarget: Bioinformatic prediction and analysis of crRNA targets journal February 2013
Codon usage in bacteria: correlation with gene expressivity journal January 1982
NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy journal November 2011
Recent advances in genomic DNA sequencing of microbial species from single cells journal August 2014
From cultured to uncultured genome sequences: metagenomics and modeling microbial ecosystems journal August 2015
Probing Individual Environmental Bacteria for Viruses by Using Microfluidic Digital PCR journal June 2011
Bacteria-Phage Interactions across Time and Space: Merging Local Adaptation and Time-Shift Experiments to Understand Phage Evolution journal August 2014
Viral tagging reveals discrete populations in Synechococcus viral genome sequence space journal July 2014
Viral and microbial community dynamics in four aquatic environments journal February 2010
CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes journal March 2007
The human microbiome: there is much left to do journal June 2022
Use of Fluorescently Labeled Phage in the Detection and Identification of Bacterial Species journal September 2003
Mechanisms of multi-strain coexistence in host–phage systems with nested infection networks journal September 2013
Phage-bacteria relationships and CRISPR elements revealed by a metagenomic survey of the rumen microbiome: Rumen viral metagenome journal October 2011
Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products preprint February 2014
Patterns and ecological drivers of ocean viral communities journal May 2015
Exploring the Vast Diversity of Marine Viruses journal June 2007
Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products preprint February 2014
Virus Population Dynamics and Acquired Virus Resistance in Natural Microbial Communities journal May 2008
Using CRISPRs as a metagenomic tool to identify microbial hosts of a diffuse flow hydrothermal vent viral assemblage: CRISPR spacers reveal hosts of marine vent viral assemblage journal April 2011
Viral metagenomics journal May 2005
CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats journal June 2007
CRISPI: a CRISPR interactive database journal October 2009
Antibiotic treatment expands the resistance reservoir and ecological network of the phage metagenome journal June 2013
Prophinder: a computational tool for prophage prediction in prokaryotic genomes journal January 2008
Beyond research: a primer for considerations on using viral metagenomics in the field and clinic journal March 2015
Degenerate target sites mediate rapid primed CRISPR adaptation journal April 2014
Increasing DNA Transfer Efficiency by Temporary Inactivation of Host Restriction journal May 1999
Integration sites for genetic elements in prokaryotic tRNA and tmRNA genes: sublocation preference of integrase subfamilies journal February 2002
Safety analysis of a Russian phage cocktail: From MetaGenomic analysis to oral application in healthy human subjects journal September 2013
PhiSpy: a novel algorithm for finding prophages in bacterial genomes that combines similarity- and composition-based strategies journal May 2012
Single-cell and population level viral infection dynamics revealed by phageFISH, a method to visualize intracellular and free viruses: phageFISH - visualizing intracellular and free viruses journal March 2013
Explaining microbial population genomics through phage predation journal July 2009
Contrasting Life Strategies of Viruses That Infect Photo- and Heterotrophic Bacteria, as Revealed by Viral Tagging journal March 2013
Exploring nucleo-cytoplasmic large DNA viruses in Tara Oceans microbial metagenomes journal April 2013
The cost of phage resistance in a plant pathogenic bacterium is context‐dependent journal April 2015
Understanding Bacteriophage Specificity in Natural Microbial Communities journal March 2013
Short-term observations of marine bacterial and viral communities: patterns, connections and resilience journal February 2013
The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats journal January 2007
Insights into the phylogeny and coding potential of microbial dark matter journal July 2013
Influenza Virus Evolution, Host Adaptation, and Pandemic Formation journal June 2010
Contrasting Life Strategies of Viruses that Infect Photo- and Heterotrophic Bacteria, as Revealed by Viral Tagging journal October 2012
A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea journal December 2009
Determinants of community structure in the global plankton interactome journal May 2015
PhiSiGns: an online tool to identify signature genes in phages and design PCR primers for examining phage diversity journal March 2012
Reconstruction of Novel Cyanobacterial Siphovirus Genomes from Mediterranean Metagenomic Fosmids journal November 2012
The metagenomics RAST server – a public resource for the automatic phylogenetic and functional analysis of metagenomes journal September 2008
Rapidly evolving CRISPRs implicated in acquired resistance of microorganisms to viruses journal September 2007
Reverse Transcriptase-Mediated Tropism Switching in Bordetella Bacteriophage journal March 2002
VirSorter: mining viral signal from microbial genomic data journal January 2015
A Greedy Algorithm for Aligning DNA Sequences journal February 2000
FOCUS: an alignment-free model to identify organisms in metagenomes using non-negative least squares journal January 2014
Distribution and Mechanism of the Type I CRISPR-Cas Systems book December 2012
Fluorescence In Situ Hybridization and Catalyzed Reporter Deposition for the Identification of Marine Bacteria journal June 2002
Superinfection exclusion by P22 prophage in lysogens of Salmonella typhimurium journal December 1974
The human gut virome: Inter-individual variation and dynamic response to diet journal August 2011
Genome signature-based dissection of human gut metagenomes to extract subliminal viral sequences journal September 2013
Metagenomics and future perspectives in virus discovery journal February 2012
Strain- and plasmid-level deconvolution of a synthetic metagenome by sequencing proximity ligation products journal January 2014
Superinfection exclusion by P22 prophage in lysogens of Salmonella typhimurium journal December 1974
A Broad-Host-Range, Generalized Transducing Phage (SN-T) Acquires 16S rRNA Genes from Different Genera of Bacteria journal December 2005
Characteristics of PRD1, a Plasmid-Dependent Broad Host Range DNA Bacteriophage journal January 1974
The Sorcerer II Global Ocean Sampling Expedition: Metagenomic Characterization of Viruses within Aquatic Microbial Samples journal January 2008
Unusual biology across a group comprising more than 15% of domain Bacteria journal June 2015
Top-down controls on bacterial community structure: microbial network analysis of bacteria, T4-like viruses and protists journal November 2013
Statistical structure of host-phage interactions journal June 2011
CRISPR/Cas, the Immune System of Bacteria and Archaea journal January 2010
Rapid evolution of the human gut virome journal July 2013
BLAST+: architecture and applications journal January 2009
Identification and assembly of genomes and genetic elements in complex metagenomic samples without using reference genomes journal July 2014
Structural Aspects of the Interaction of Dairy Phages with Their Host Bacteria journal August 2012
Effects of Plants on Metacommunities and Correlation Networks of Soil Microbial Groups in an Ecologically Restored Wetland journal October 2020
Linking environmental prokaryotic viruses and their host through CRISPRs journal May 2015
Analysis of streptococcal CRISPRs from human saliva reveals substantial sequence diversity within and between subjects over time journal December 2010
A fast, lock-free approach for efficient parallel counting of occurrences of k-mers journal January 2011
A highly abundant bacteriophage discovered in the unknown sequences of human faecal metagenomes journal July 2014
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes journal July 2015
Ecology and evolution of viruses infecting uncultivated SUP05 bacteria as revealed by single-cell- and meta-genomics journal August 2014
Phage–host interplay: examples from tailed phages and Gram-negative bacterial pathogens journal August 2014
Molecular evolution of bacteriophages: evidence of selection against the recognition sites of host restriction enzymes. journal January 1986
Development of a high throughput assay for indirectly measuring phage growth using the OmniLog TM system journal July 2012
Bacteriophage resistance mechanisms journal March 2010
Multi-scale structure and geographic drivers of cross-infection within marine bacteria and phages journal November 2012
Single-cell genomics-based analysis of virus–host interactions in marine surface bacterioplankton journal April 2015
CRISPR targeting reveals a reservoir of common phages associated with the human gut microbiome journal June 2012
Phage–bacteria infection networks journal February 2013
Amelioration of Bacterial Genomes: Rates of Change and Exchange journal April 1997
The Molecular and Genetic Basis of Repeatable Coevolution between Escherichia coli and Bacteriophage T3 in a Laboratory Microcosm journal June 2015
Site-specific recombination by φC31 integrase and other large serine recombinases journal March 2010
Superinfection exclusion by P22 prophage in lysogens of Salmonella typhimurium journal September 1971
Invertible DNA determines host specificity of bacteriophage Mu journal July 1980
PILER: identification and classification of genomic repeats journal June 2005
Coevolutionary diversification creates nested-modular structure in phage–bacteria interaction networks journal December 2013
Gene swapping in the dead zone journal October 2014
Characterization of the ELPhiS Prophage from Salmonella enterica Serovar Enteritidis Strain LK5 journal January 2012
Bacteriophage–host interaction: from splendid isolation into a messy reality journal August 2013
Expanding the Marine Virosphere Using Metagenomics journal December 2013
Superinfection exclusion by P22 prophage in lysogens of Salmonella typhimurium journal September 1971
Metagenomic ventures into outer sequence space journal December 2014
Explaining microbial population genomics through phage predation journal November 2009
Oxygen minimum zones harbour novel viral communities with low diversity: Viral community characteristics of an oxygen minimum zone journal October 2012
Characterization of the Integrase Gene and Attachment Site for the Myxococcus xanthus Bacteriophage Mx9 journal November 2003
Species-Level Deconvolution of Metagenome Assemblies with Hi-C–Based Contact Probability Maps journal May 2014
CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats journal May 2007
Gnotobiotic mouse model of phage-bacterial host dynamics in the human gut journal November 2013
Basic Local Alignment Search Tool journal October 1990
Coriander Genomics Database: a genomic, transcriptomic, and metabolic database for coriander journal April 2020
Prevalence of Broad-Host-Range Lytic Bacteriophages of Sphaerotilus natans, Escherichia coli, andPseudomonas aeruginosa journal February 1998
Cyanophages infecting the oceanic cyanobacterium Prochlorococcus journal August 2003
Comparative metagenomics of microbial traits within oceanic viral communities journal February 2011
Complete Genome Sequence of Bacillus thuringiensis subsp. kurstaki Strain HD73 journal February 2013
Phage_Finder: Automated identification and classification of prophage regions in complete bacterial genome sequences journal October 2006
Reference-independent comparative metagenomics using cross-assembly: crAss journal October 2012
Comparative genomics of closely related salmonellae journal February 2002
Diversity in the serine recombinases: Diversity in the serine recombinases journal April 2002
Structure of the lambda att sites generated by int-dependent deletions. journal November 1978
Host Controlled Variation in Bacterial Viruses1 journal January 1953
Genetic Studies of Lysogenicity in Escherichia coli journal January 1953

Cited By (79)

Host Taxon Predictor - A Tool for Predicting Taxon of the Host of a Newly Discovered Virus journal March 2019
Marine viruses discovered via metagenomics shed light on viral strategies throughout the oceans journal July 2017
Insights of Phage-Host Interaction in Hypersaline Ecosystem through Metagenomics Analyses journal March 2017
Temporal dynamics of uncultured viruses: a new dimension in viral diversity journal October 2017
Computational prediction of inter-species relationships through omics data analysis and machine learning journal November 2018
Long-read metagenomic exploration of extrachromosomal mobile genetic elements in the human gut journal August 2019
Minimum Information about an Uncultivated Virus Genome (MIUViG) journal December 2018
Trends of Microdiversity Reveal Depth-Dependent Evolutionary Strategies of Viruses in the Mediterranean journal November 2019
The enemy from within: a prophage of Roseburia intestinalis systematically turns lytic in the mouse gut, driving bacterial adaptation by CRISPR spacer acquisition journal March 2019
Phage puppet masters of the marine microbial realm journal June 2018
Alignment-free $d_2^*$ oligonucleotide frequency dissimilarity measure improves prediction of hosts from metagenomically-derived viral sequences journal November 2016
PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning journal June 2019
Alignment-free genomic sequence comparison using FCGR and signal processing journal December 2019
The Human Gut Phage Community and Its Implications for Health and Disease journal June 2017
IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses journal October 2016
A viral reckoning: viruses emerge as essential manipulators of global ecosystems: Crystal ball journal October 2018
Genomic and ecological attributes of marine bacteriophages encoding bacterial virulence genes journal February 2020
Multiple mechanisms drive phage infection efficiency in nearly identical hosts journal March 2018
Phage-centric ecological interactions in aquatic ecosystems revealed through ultra-deep metagenomics journal October 2019
Linking Virus Genomes with Host Taxonomy journal March 2016
Towards optimized viral metagenomes for double-stranded and single-stranded DNA viruses from challenging soils journal January 2019
Genomic and Seasonal Variations among Aquatic Phages Infecting the Baltic Sea Gammaproteobacterium Rheinheimera sp. Strain BAL341 journal July 2019
Genome‐resolved viral and cellular metagenomes revealed potential key virus‐host interactions in a deep freshwater lake journal September 2019
Biogeography and environmental conditions shape bacteriophage-bacteria networks across the human microbiome journal April 2018
Intriguing Interaction of Bacteriophage-Host Association: An Understanding in the Era of Omics journal April 2017
MetaCRAST: Reference-guided extraction of CRISPR spacers from unassembled metagenomes preprint August 2017
Gene networks provide a high-resolution view of bacteriophage ecology posted_content June 2017
CRISPR analysis suggests that small circular single-stranded DNA smacoviruses infect Archaea instead of humans journal January 2019
A virus or more in (nearly) every cell: ubiquitous networks of virus–host interactions in extreme environments journal February 2018
Genomes of Abundant and Widespread Viruses from the Deep Ocean journal July 2016
Environmental drivers of viral community composition in Antarctic soils identified by viromics journal July 2017
Host-linked soil viral ecology along a permafrost thaw gradient journal July 2018
MARVEL, a Tool for Prediction of Bacteriophage Sequences in Metagenomic Bins journal August 2018
Evolutionary History of Bacteriophages in the Genus Paraburkholderia journal May 2018
Uncovering Earth’s virome journal August 2016
Using CAPTURE to detect spacer acquisition in native CRISPR arrays journal February 2019
Interpretable detection of novel human viruses from genome sequencing data journal February 2021
Diversities and potential biogeochemical impacts of mangrove soil viruses journal April 2019
The virome: a missing component of biological interaction networks in health and disease journal April 2016
Correction to: Phage-centric ecological interactions in aquatic ecosystems revealed through ultra-deep metagenomics journal March 2020
MVP: a microbe–phage interaction database journal November 2017
Cultivation and sequencing of rumen microbiome members from the Hungate1000 Collection journal March 2018
A Viral Ecogenomics Framework To Uncover the Secrets of Nature’s “Microbe Whisperers” journal May 2019
Gene Co-occurrence Networks Reflect Bacteriophage Ecology and Evolution journal March 2018
Reply to: Caution in inferring viral strategies from abundance correlations in marine metagenomes journal January 2019
Depth-related variability in viral communities in highly stratified sulfidic mine tailings journal June 2020
Murine colitis reveals a disease-associated bacteriophage community journal July 2018
Phage-centric ecological interactions in aquatic ecosystems revealed through ultra-deep metagenomics text January 2019
A Protocol for Extraction of Infective Viromes Suitable for Metagenomics Sequencing from Low Volume Fecal Samples journal July 2019
An Integrative Approach to Virus–Host Protein–Protein Interactions book January 2018
A Protocol for Extraction of Infective Viromes Suitable for Metagenomics Sequencing from Low Volume Fecal Samples preprint June 2019
Defining the human gut host–phage network through single-cell viral tagging journal August 2019
Discovery of several novel, widespread, and ecologically distinct marine Thaumarchaeota viruses that encode amoC nitrification genes journal October 2018
Lysogeny in nature: mechanisms, impact and ecology of temperate phages journal March 2017
Interpretable detection of novel human viruses from genome sequencing data text January 2021
Ultrastructure and Viral Metagenome of Bacteriophages from an Anaerobic Methane Oxidizing Methylomirabilis Bioreactor Enrichment Culture journal November 2016
Lysogeny is prevalent and widely distributed in the murine gut microbiota journal February 2018
Phages for Phage Therapy: Isolation, Characterization, and Host Range Breadth journal March 2019
Biological species in the viral world journal May 2018
Determining virus-host interactions and glycerol metabolism profiles in geographically diverse solar salterns with metagenomics preprint July 2016
Determining virus-host interactions and glycerol metabolism profiles in geographically diverse solar salterns with metagenomics journal January 2017
Prevalence of viral photosynthesis genes along a freshwater to saltwater transect in Southeast USA journal July 2019
Approaches to optimize therapeutic bacteriophage and bacteriophage-derived products to combat bacterial infections journal February 2020
Long-read based de novo assembly of low-complexity metagenome samples results in finished genomes and reveals insights into strain diversity and an active phage system journal June 2019
Global phylogeography and ancient evolution of the widespread human gut virus crAssphage journal July 2019
Scaffolding bacterial genomes and probing host-virus interactions in gut microbiome by proximity ligation (chromosome capture) assay journal February 2017
Rapid identification of human‐infecting viruses journal August 2019
The enemy from within: a prophage of Roseburia intestinalis systematically turns lytic in the mouse gut, driving bacterial adaptation by CRISPR spacer acquisition journal December 2019
Phages and Human Health: More Than Idle Hitchhikers journal June 2019
Nontargeted virus sequence discovery pipeline and virus clustering for metagenomic data journal July 2017
HostPhinder: A Phage Host Prediction Tool journal May 2016
Minimum information about an uncultivated virus genome (MIUVIG) text January 2019
“French Phage Network”—Second Meeting Report journal April 2017
VirFinder: a novel k-mer based tool for identifying viral sequences from assembled metagenomic data journal July 2017
Predicting virus-host association by Kernelized logistic matrix factorization and similarity network fusion journal December 2019
Comprehensive Insights Into Composition, Metabolic Potentials, and Interactions Among Archaeal, Bacterial, and Viral Assemblages in Meromictic Lake Shunet in Siberia journal August 2018
Ecogenomics and potential biogeochemical impacts of globally abundant ocean viruses journal September 2016
WIsH: who is the host? Predicting prokaryotic hosts from metagenomic phage contigs journal July 2017
Mini‐Metagenomics and Nucleotide Composition Aid the Identification and Host Association of Novel Bacteriophage Sequences journal May 2019

Figures / Tables (5)