skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: On the Use of Gene Ontology Annotations to Assess Functional Similarity among Orthologs and Paralogs: A Short Report

Journal Article · · PLoS Computational Biology (Online)
 [1];  [2];  [3];  [3];  [4]
  1. Univ. of California, Los Angeles, CA (United States). Dept. of Preventive Medicine. Division of Bioinformatics
  2. Univ. of Cambridge (United Kingdom). Dept. of Biochemistry. Cambridge Systems Biology Centre
  3. Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Genomics Division
  4. The Jackson Lab., Bar Harbor, ME (United States). Bioinformatics and Computational Biology

A recent paper (Nehrt et al., PLoS Comput. Biol. 7:e1002073, 2011) has proposed a metric for the ‘‘functional similarity’’ between two genes that uses only the Gene Ontology (GO) annotations directly derived from published experimental results. Applying this metric, the authors concluded that paralogous genes within the mouse genome or the human genome are more functionally similar on average than orthologous genes between these genomes, an unexpected result with broad implications if true. We suggest, based on both theoretical and empirical considerations, that this proposed metric should not be interpreted as a functional similarity, and therefore cannot be used to support any conclusions about the ‘‘ortholog conjecture’’ (or, more properly, the ‘‘ortholog functional conservation hypothesis’’). First, we reexamine the case studies presented by Nehrt et al. as examples of orthologs with divergent functions, and come to a very different conclusion: they actually exemplify how GO annotations for orthologous genes provide complementary information about conserved biological functions. We then show that there is a global ascertainment bias in the experiment-based GO annotations for human and mouse genes: particular types of experiments tend to be performed in different model organisms. We conclude that the reported statistical differences in annotations between pairs of orthologous genes do not reflect differences in biological function, but rather complementarity in experimental approaches. Our results underscore two general considerations for researchers proposing novel types of analysis based on the GO: 1) that GO annotations are often incomplete, potentially in a biased manner, and subject to an ‘‘open world assumption’’ (absence of an annotation does not imply absence of a function), and 2) that conclusions drawn from a novel, large-scale GO analysis should whenever possible be supported by careful, in-depth examination of examples, to help ensure the conclusions have a justifiable biological basis.

Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States). National Energy Research Scientific Computing Center (NERSC)
Sponsoring Organization:
USDOE Office of Science (SC), Biological and Environmental Research (BER). Biological Systems Science Division; National Institutes of Health (NIH)
Grant/Contract Number:
AC02-05CH11231; P41 HG002273; R01 GM081084
OSTI ID:
1627221
Journal Information:
PLoS Computational Biology (Online), Vol. 8, Issue 2; ISSN 1553-7358
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English

References (18)

The Mouse Genome Database (MGD): premier model organism resource for mammalian genomics and genetics journal November 2010
The MAP Kinase Signaling Cascades: A System of Hundreds of Components Regulates a Diverse Array of Physiological Functions book January 2010
Testing the Ortholog Conjecture with Comparative Functional Genomic Data from Mammals journal June 2011
A novel DNA damage recognition protein in Schizosaccharomyces pombe journal April 2006
How confident can we be that orthologs are similar, but paralogs differ? journal May 2009
The 3′ Ends of Mature Transcripts Are Generated by a Processosome Complex in Fission Yeast Mitochondria journal April 2008
Control of a Kinesin-Cargo Linkage Mechanism by JNK Pathway Kinases journal August 2007
Physiological and Molecular Basis of Thyroid Hormone Action journal July 2001
Gene Ontology: tool for the unification of biology journal May 2000
The GOA database in 2009--an integrated Gene Ontology Annotation resource journal January 2009
Gene Ontology annotations: what they mean and where they come from journal January 2008
The Gene Ontology in 2010: extensions and refinements journal January 2010
Protein Evolution by Molecular Tinkering: Diversification of the Nuclear Receptor Superfamily from a Ligand-Dependent Ancestor journal October 2010
Evolution of Hormone-Receptor Complexity by Molecular Exploitation journal April 2006
Distinguishing Homologous from Analogous Proteins journal June 1970
When orthologs diverge between human and mouse journal June 2011
Motor Proteins: Trafficking and Signaling Collide journal September 2007
Estrogen receptors and human disease journal March 2006

Cited By (40)

Genome-Wide Analysis of Protein Disorder in Arabidopsis thaliana: Implications for Plant Environmental Adaptation journal February 2013
Identifying mouse developmental essential genes using machine learning journal December 2018
Standardized benchmarking in the quest for orthologs journal April 2016
ARTDeco: automatic readthrough transcription detection journal May 2020
Biological interpretation of genome-wide association studies using predicted gene functions journal January 2015
Functional and evolutionary implications of gene orthology journal April 2013
Protein Function Prediction: Problems and Pitfalls journal September 2015
An integrative data analysis platform for gene set analysis and knowledge discovery in a data warehouse framework journal January 2016
Big data and other challenges in the quest for orthologs journal July 2014
Interspecies gene function prediction using semantic similarity journal December 2016
Semantic Similarity from Natural Language and Ontology Analysis journal May 2015
Semantic Similarity from Natural Language and Ontology Analysis text January 2017
A Tight Link between Orthologs and Bidirectional Best Hits in Bacterial and Archaeal Genomes journal November 2012
Functional and structural profiles of GST gene family from three Populus species reveal the sequence–function decoupling of orthologous genes journal September 2018
Gene ontology improves template selection in comparative protein docking journal December 2018
Conserved syntenic clusters of protein coding genes are missing in birds journal December 2014
The Ortholog Conjecture Revisited: the Value of Orthologs and Paralogs in Function Prediction journal December 2019
Pairwise comparisons across species are problematic when analyzing functional genomic data journal January 2018
Accurate prediction of orthologs in the presence of divergence after duplication journal June 2018
Accurate prediction of orthologs in the presence of divergence after duplication journal April 2018
Human Monogenic Disease Genes Have Frequently Functionally Redundant Paralogs journal May 2013
The ortholog conjecture revisited: the value of orthologs and paralogs in function prediction journal July 2020
OrtholugeDB: a bacterial and archaeal orthology resource for improved comparative genomic analysis journal November 2012
OrthoList 2: A New Comparative Genomic Analysis of Human and Caenorhabditis elegans Genes journal August 2018
Standardized benchmarking in the quest for orthologs text January 2016
The Ortholog Conjecture Is Untestable by the Current Gene Ontology but Is Supported by RNA Sequencing Data journal November 2012
Gene Family Level Comparative Analysis of Gene Expression in Mammals Validates the Ortholog Conjecture journal March 2014
Protein Function Prediction Using Deep Restricted Boltzmann Machines journal January 2017
The case of Iranian immigrants in the greater Toronto area: a qualitative study journal January 2012
Progress and challenges in the computational prediction of gene function using networks journal September 2012
Phyletic Profiling with Cliques of Orthologs Is Enhanced by Signatures of Paralogy Relationships journal January 2013
Ten Quick Tips for Using the Gene Ontology journal November 2013
WORMHOLE: Novel Least Diverged Ortholog Prediction through Machine Learning journal November 2016
Quickly Finding Orthologs as Reciprocal Best Hits with BLAT, LAST, and UBLAST: How Much Do We Miss? journal July 2014
Characterizing the state of the art in the computational assignment of gene function: lessons from the first critical assessment of functional annotation (CAFA) text January 2013
In Silico Analysis and Experimental Validation of Active Compounds from Cichorium intybus L. Ameliorating Liver Injury journal September 2015
Resolving the Ortholog Conjecture: Orthologs Tend to Be Weakly, but Significantly, More Similar in Function than Paralogs text January 2012
Phylogenetic Profiling : How Much Input Data Is Enough? text January 2015
Standardized benchmarking in the quest for orthologs text January 2016
Evaluating the adaptive evolutionary convergence of carnivorous plant taxa through functional genomics journal January 2018