Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

Journal Article · · BMC Evolutionary Biology (Online)
 [1];  [1];  [1];  [2];  [3]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division
  2. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division; Univ. of Tennessee, Knoxville, TN (United States). Joint Inst. for Biological Sciences
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division

To supply some background, phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Our results show a total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. In conclusion, our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); BioEnergy Science Center (BESC)
Sponsoring Organization:
DOE Office of Science; USDOE; ORNL LDRD Director's R&D
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1286772
Journal Information:
BMC Evolutionary Biology (Online), Journal Name: BMC Evolutionary Biology (Online) Journal Issue: 1 Vol. 14; ISSN 1471-2148
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (54)

Reconstructing ancestral character states under Wagner parsimony journal December 1987
Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences journal August 2013
PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes journal August 2013
Biased biological functions of horizontally transferred genes in prokaryotic genomes journal June 2004
Horizontal gene transfer, genome innovation and evolution journal August 2005
Commensal Pseudomonas protect Arabidopsis thaliana from a coexisting pathogen via multiple lineage-dependent mechanisms journal December 2021
New globally distributed bacterial phyla within the FCB superphylum journal December 2022
Expanded diversity of Asgard archaea and their relationships with eukaryotes journal April 2021
Partitioning RNAs by length improves transcriptome reconstruction from short-read RNA-seq data journal January 2022
Red versus green leaves: transcriptomic comparison of foliar senescence between two Prunus cerasifera genotypes journal February 2020
A New Phylogenomic Approach For Quantifying Horizontal Gene Transfer Trends in Prokaryotes journal July 2020
Climatic oscillations in Quaternary have shaped the co-evolutionary patterns between the Norway spruce and its host-associated herbivore journal October 2020
Environmental factors shape the epiphytic bacterial communities of Gracilariopsis lemaneiformis journal April 2021
Full-length transcriptome analysis of multiple organs and identification of adaptive genes and pathways in Mikania micrantha journal February 2022
Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology journal December 2009
APE: Analyses of Phylogenetics and Evolution in R language journal January 2004
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure journal April 2006
Search and clustering orders of magnitude faster than BLAST journal August 2010
The Gene Ontology (GO) database and informatics resource journal January 2004
KAAS: an automatic genome annotation and pathway reconstruction server journal May 2007
The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases journal December 2007
GenBank journal November 2012
A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. journal May 1994
Origins and impact of constraints in evolution of gene families journal October 2006
The Retention Index and the Rescaled Consistency Index journal December 1989
Lateral Gene Transfer and the Origins of Prokaryotic Groups journal December 2003
Enzyme Recruitment and Its Role in Metabolic Expansion text January 2014
A survey of metabolic databases emphasizing the MetaCyc family journal April 2011
Comparison of phylogenetic trees journal February 1981
The prokaryotic tree of life: past, present…and future? journal May 2008
Computational tools for metabolic engineering journal May 2012
A Model Recognition Approach to the Prediction of All-Helical Membrane Protein Structure and Topology journal March 1994
Enzyme Recruitment and Its Role in Metabolic Expansion journal January 2014
High-throughput generation, optimization and analysis of genome-scale metabolic models journal August 2010
Ancient horizontal gene transfer journal February 2003
Estimating Divergence Times in Large Phylogenetic Trees journal October 2007
phangorn: phylogenetic analysis in R journal December 2010
Prokka: rapid prokaryotic genome annotation journal March 2014
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability journal January 2013
KEGG: Kyoto Encyclopedia of Genes and Genomes journal January 2000
Reactome: a database of reactions, pathways and biological processes journal November 2010
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases journal November 2011
The Pfam protein families database journal November 2011
Horizontal gene transfer in evolution: facts and challenges journal November 2009
The Bacterial Species Challenge: Making Sense of Genetic and Ecological Diversity journal February 2009
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions journal January 2010
A systematic comparison of the MetaCyc and KEGG pathway databases journal January 2013
The COG database: an updated version includes eukaryotes journal January 2003
Consistency of gene starts among Burkholderia genomes journal February 2011
The RAST Server: Rapid Annotations using Subsystems Technology journal January 2008
Accelerated Profile HMM Searches journal October 2011
Genome Majority Vote Improves Gene Predictions journal November 2011
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments journal March 2010

Cited By (8)

A genomic perspective on stoichiometric regulation of soil carbon cycling journal July 2017
Acquisition of 1,000 eubacterial genes physiologically transformed a methanogen at the origin of Haloarchaea journal November 2012
Increasing Metagenomic Resolution of Microbiome Interactions Through Functional Phylogenomics and Bacterial Sub-Communities journal February 2016
Compact graphical representation of phylogenetic data and metadata with GraPhlAn journal January 2015
Microbial metaproteomics for characterizing the range of metabolic functions and activities of human gut microbiota journal May 2015
Uncovering carbohydrate metabolism through a genotype-phenotype association study of 56 lactic acid bacteria genomes journal March 2019
Community proteogenomics reveals the systemic impact of phosphorus availability on microbial functions in tropical soil journal January 2018
Genome-Resolved Proteomic Stable Isotope Probing of Soil Microbial Communities Using 13CO2 and 13C-Methanol journal December 2019

Similar Records

A phylogenomic gene cluster resource: The phylogeneticallyinferred groups (PhlGs) database
Journal Article · Thu Aug 25 00:00:00 EDT 2005 · BMC Bioinformatics, Biomed Central · OSTI ID:882266

Final Report - Phylogenomic tools and web resources for the Systems Biology Knowledgebase
Technical Report · Sun Dec 07 23:00:00 EST 2014 · OSTI ID:1163480

Mycoparasites, Gut Dwellers, and Saprotrophs: Phylogenomic Reconstructions and Comparative Analyses of Kickxellomycotina Fungi
Journal Article · Fri Jan 06 23:00:00 EST 2023 · Genome Biology and Evolution · OSTI ID:1972187