skip to main content

SciTech ConnectSciTech Connect

Title: Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

To supply some background, phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Our results show a total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. Inmore » conclusion, our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.« less
 [1] ;  [1] ;  [1] ;  [2] ;  [3]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division
  2. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division; Univ. of Tennessee, Knoxville, TN (United States). Joint Inst. for Biological Sciences
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division
Publication Date:
OSTI Identifier:
Grant/Contract Number:
Accepted Manuscript
Journal Name:
BMC Evolutionary Biology (Online)
Additional Journal Information:
Journal Name: BMC Evolutionary Biology (Online); Journal Volume: 14; Journal Issue: 1; Journal ID: ISSN 1471-2148
BioMed Central
Research Org:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); BioEnergy Science Center (BESC)
Sponsoring Org:
Country of Publication:
United States
59 BASIC BIOLOGICAL SCIENCES Prokaryotes; Cellular function; Pathway; Genomes; Evolution; Phylogenomics