skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Functional phylogenomics analysis of bacteria and archaea using consistent genome annotation with UniFam

Journal Article · · BMC Evolutionary Biology (Online)
 [1];  [1];  [1];  [2];  [3]
  1. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division
  2. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division; Univ. of Tennessee, Knoxville, TN (United States). Joint Inst. for Biological Sciences
  3. Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Computer Science and Mathematics Division; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). BioSciences Division

To supply some background, phylogenetic studies have provided detailed knowledge on the evolutionary mechanisms of genes and species in Bacteria and Archaea. However, the evolution of cellular functions, represented by metabolic pathways and biological processes, has not been systematically characterized. Many clades in the prokaryotic tree of life have now been covered by sequenced genomes in GenBank. This enables a large-scale functional phylogenomics study of many computationally inferred cellular functions across all sequenced prokaryotes. Our results show a total of 14,727 GenBank prokaryotic genomes were re-annotated using a new protein family database, UniFam, to obtain consistent functional annotations for accurate comparison. The functional profile of a genome was represented by the biological process Gene Ontology (GO) terms in its annotation. The GO term enrichment analysis differentiated the functional profiles between selected archaeal taxa. 706 prokaryotic metabolic pathways were inferred from these genomes using Pathway Tools and MetaCyc. The consistency between the distribution of metabolic pathways in the genomes and the phylogenetic tree of the genomes was measured using parsimony scores and retention indices. The ancestral functional profiles at the internal nodes of the phylogenetic tree were reconstructed to track the gains and losses of metabolic pathways in evolutionary history. In conclusion, our functional phylogenomics analysis shows divergent functional profiles of taxa and clades. Such function-phylogeny correlation stems from a set of clade-specific cellular functions with low parsimony scores. On the other hand, many cellular functions are sparsely dispersed across many clades with high parsimony scores. These different types of cellular functions have distinct evolutionary patterns reconstructed from the prokaryotic tree.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). BioEnergy Science Center (BESC)
Sponsoring Organization:
USDOE Office of Science (SC); USDOE Laboratory Directed Research and Development (LDRD) Program
Grant/Contract Number:
AC05-00OR22725
OSTI ID:
1286772
Journal Information:
BMC Evolutionary Biology (Online), Vol. 14, Issue 1; ISSN 1471-2148
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 16 works
Citation information provided by
Web of Science

References (54)

Horizontal gene transfer, genome innovation and evolution journal August 2005
Prokka: rapid prokaryotic genome annotation journal March 2014
Comparison of phylogenetic trees journal February 1981
The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases journal December 2007
Enzyme Recruitment and Its Role in Metabolic Expansion journal January 2014
High-throughput generation, optimization and analysis of genome-scale metabolic models journal August 2010
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases journal November 2011
KAAS: an automatic genome annotation and pathway reconstruction server journal May 2007
Accelerated Profile HMM Searches journal October 2011
A survey of metabolic databases emphasizing the MetaCyc family journal April 2011
Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences journal August 2013
Estimating Divergence Times in Large Phylogenetic Trees journal October 2007
Improved scoring of functional groups from gene expression data by decorrelating GO graph structure journal April 2006
phangorn: phylogenetic analysis in R journal December 2010
KEGG: Kyoto Encyclopedia of Genes and Genomes journal January 2000
GenBank journal November 2012
Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology journal December 2009
The Retention Index and the Rescaled Consistency Index journal December 1989
Consistency of gene starts among Burkholderia genomes journal February 2011
BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions journal January 2010
FastTree 2 – Approximately Maximum-Likelihood Trees for Large Alignments journal March 2010
The Bacterial Species Challenge: Making Sense of Genetic and Ecological Diversity journal February 2009
Horizontal gene transfer in evolution: facts and challenges journal November 2009
The prokaryotic tree of life: past, present…and future? journal May 2008
The Pfam protein families database journal November 2011
Prodigal: prokaryotic gene recognition and translation initiation site identification journal March 2010
The COG database: an updated version includes eukaryotes journal January 2003
APE: Analyses of Phylogenetics and Evolution in R language journal January 2004
Genome Majority Vote Improves Gene Predictions journal November 2011
Lateral Gene Transfer and the Origins of Prokaryotic Groups journal December 2003
Ancient horizontal gene transfer journal February 2003
PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes journal August 2013
A systematic comparison of the MetaCyc and KEGG pathway databases journal January 2013
Reconstructing ancestral character states under Wagner parsimony journal December 1987
MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability journal January 2013
Origins and impact of constraints in evolution of gene families journal October 2006
Biased biological functions of horizontally transferred genes in prokaryotic genomes journal June 2004
Search and clustering orders of magnitude faster than BLAST journal August 2010
Computational tools for metabolic engineering journal May 2012
A Model Recognition Approach to the Prediction of All-Helical Membrane Protein Structure and Topology journal March 1994
Reactome: a database of reactions, pathways and biological processes journal November 2010
The RAST Server: Rapid Annotations using Subsystems Technology journal January 2008
Commensal Pseudomonas protect Arabidopsis thaliana from a coexisting pathogen via multiple lineage-dependent mechanisms journal December 2021
New globally distributed bacterial phyla within the FCB superphylum journal December 2022
Expanded diversity of Asgard archaea and their relationships with eukaryotes journal April 2021
Partitioning RNAs by length improves transcriptome reconstruction from short-read RNA-seq data journal January 2022
Red versus green leaves: transcriptomic comparison of foliar senescence between two Prunus cerasifera genotypes journal February 2020
A New Phylogenomic Approach For Quantifying Horizontal Gene Transfer Trends in Prokaryotes journal July 2020
Climatic oscillations in Quaternary have shaped the co-evolutionary patterns between the Norway spruce and its host-associated herbivore journal October 2020
Environmental factors shape the epiphytic bacterial communities of Gracilariopsis lemaneiformis journal April 2021
Full-length transcriptome analysis of multiple organs and identification of adaptive genes and pathways in Mikania micrantha journal February 2022
The Gene Ontology (GO) database and informatics resource journal January 2004
A simulation comparison of phylogeny algorithms under equal and unequal evolutionary rates. journal May 1994
Enzyme Recruitment and Its Role in Metabolic Expansion text January 2014

Cited By (8)

Microbial metaproteomics for characterizing the range of metabolic functions and activities of human gut microbiota journal May 2015
Uncovering carbohydrate metabolism through a genotype-phenotype association study of 56 lactic acid bacteria genomes journal March 2019
A genomic perspective on stoichiometric regulation of soil carbon cycling journal July 2017
Community proteogenomics reveals the systemic impact of phosphorus availability on microbial functions in tropical soil journal January 2018
Genome-Resolved Proteomic Stable Isotope Probing of Soil Microbial Communities Using 13CO2 and 13C-Methanol journal December 2019
Compact graphical representation of phylogenetic data and metadata with GraPhlAn journal January 2015
Acquisition of 1,000 eubacterial genes physiologically transformed a methanogen at the origin of Haloarchaea journal November 2012
Increasing Metagenomic Resolution of Microbiome Interactions Through Functional Phylogenomics and Bacterial Sub-Communities journal February 2016