DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Measuring semantic similarities by combining gene ontology annotations and gene co-function networks

Journal Article · · BMC Bioinformatics
 [1];  [2];  [3];  [4];  [3];  [5]
  1. Harbin Institute of Technology, Harbin (China); Michigan State Univ., East Lansing, MI (United States); None
  2. Michigan State Univ., East Lansing, MI (United States)
  3. Carnegie Institution for Science, Stanford, CA (United States)
  4. Harbin Institute of Technology, Harbin (China)
  5. Michigan State University, East Lansing, MI (United States)

Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstrate that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.

Research Organization:
Michigan State Univ., East Lansing, MI (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Basic Energy Sciences (BES) (SC-22)
Grant/Contract Number:
FG02-91ER20021
OSTI ID:
1194164
Journal Information:
BMC Bioinformatics, Journal Name: BMC Bioinformatics Journal Issue: 1 Vol. 16; ISSN 1471-2105
Publisher:
BioMed CentralCopyright Statement
Country of Publication:
United States
Language:
English

References (70)

Globally predicting protein functions based on co-expressed protein–protein interaction networks and ontology taxonomy similarities journal April 2007
Towards revealing the functions of all genes in plants journal April 2014
Basic local alignment search tool journal October 1990
Cytochrome P450 and Chemical Toxicology journal January 2008
Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana journal January 2010
An integrated approach to characterize genetic interaction networks in yeast metabolism journal May 2011
Microarray data analysis: from disarray to consolidation and consensus journal January 2006
Use and misuse of the gene ontology annotations journal May 2008
A novel network pharmacology approach for leukaemia differentiation therapy using Mogrify® journal October 2022
Mitochondrial dysfunction induces ALK5-SMAD2-mediated hypovascularization and arteriovenous malformations in mouse retinas journal December 2022
Dietary palmitic acid promotes a prometastatic memory via Schwann cells journal November 2021
Defining genetic interaction journal February 2008
The Gene Ontology Categorizer journal July 2004
The MIPS mammalian protein-protein interaction database journal November 2004
A new method to measure the semantic similarity of GO terms journal March 2007
Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications journal May 2007
QuickGO: a web-based tool for Gene Ontology searching journal September 2009
Diverse Transcriptional Programs Associated with Environmental Stress and Hormones in the Arabidopsis Receptor-Like Kinase Gene Family journal January 2009
Saccharomyces Genome Database: the genomics resource of budding yeast journal November 2011
The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools journal December 2011
PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors journal October 2013
Prioritizing candidate disease genes by network-based boosting of genome-wide association data journal May 2011
Creation of a Genome-Wide Metabolic Pathway Database for Populus trichocarpa Using a New Approach for Reconstruction and Curation of Metabolic Pathways for Plants journal June 2010
Enhanced automated function prediction using distantly related sequences and contextual association by PFP journal June 2006
Arabidopsis Transcription Factors: Genome-Wide Comparative Analysis Among Eukaryotes journal December 2000
Diversification of P450 Genes During Land Plant Evolution journal June 2010
Semantic Similarity in Biomedical Ontologies journal July 2009
An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae journal October 2007
Evaluation of high-throughput functional categorization of human disease genes text January 2007
Comparing partitions journal December 1985
Basic local alignment search tool journal October 1990
Globally predicting protein functions based on co-expressed protein–protein interaction networks and ontology taxonomy similarities journal April 2007
Towards revealing the functions of all genes in plants journal April 2014
Cytochrome P450 and Chemical Toxicology journal January 2008
Gene Ontology: tool for the unification of biology journal May 2000
Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana journal January 2010
An integrated approach to characterize genetic interaction networks in yeast metabolism journal May 2011
Microarray data analysis: from disarray to consolidation and consensus journal January 2006
Use and misuse of the gene ontology annotations journal May 2008
Defining genetic interaction journal February 2008
The Pathway Tools software journal July 2002
The Gene Ontology Categorizer journal July 2004
Predicting gene function through systematic analysis and quality assessment of high-throughput data journal November 2004
The MIPS mammalian protein-protein interaction database journal November 2004
Using GOstats to test gene lists for GO term association journal November 2006
A new method to measure the semantic similarity of GO terms journal March 2007
Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications journal May 2007
QuickGO: a web-based tool for Gene Ontology searching journal September 2009
Measuring gene functional similarity based on group-wise comparison of GO terms journal April 2013
Diverse Transcriptional Programs Associated with Environmental Stress and Hormones in the Arabidopsis Receptor-Like Kinase Gene Family journal January 2009
Saccharomyces Genome Database: the genomics resource of budding yeast journal November 2011
The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools journal December 2011
STRING v9.1: protein-protein interaction networks, with increased coverage and integration journal November 2012
PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors journal October 2013
The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases journal November 2013
Prioritizing candidate disease genes by network-based boosting of genome-wide association data journal May 2011
MetaCyc and AraCyc. Metabolic Pathway Databases for Plant Research journal May 2005
Creation of a Genome-Wide Metabolic Pathway Database for Populus trichocarpa Using a New Approach for Reconstruction and Curation of Metabolic Pathways for Plants journal June 2010
Enhanced automated function prediction using distantly related sequences and contextual association by PFP journal June 2006
A categorization approach to automated ontological function annotation journal June 2006
Arabidopsis Transcription Factors: Genome-Wide Comparative Analysis Among Eukaryotes journal December 2000
Diversification of P450 Genes During Land Plant Evolution journal June 2010
A new measure for functional similarity of gene products based on Gene Ontology journal June 2006
Evaluation of high-throughput functional categorization of human disease genes journal January 2007
Functional classification of proteins for the prediction of cellular function from a protein-protein interaction network journal January 2003
Computational prediction of human metabolic pathways from the complete human genome journal January 2004
Classification journal June 1999
Semantic Similarity in Biomedical Ontologies journal July 2009
An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae journal October 2007
Improving the Measurement of Semantic Similarity between Gene Ontology Terms and Gene Products: Insights from an Edge- and IC-Based Hybrid Method journal May 2013

Cited By (16)

Exploring Approaches for Detecting Protein Functional Similarity within an Orthology-based Framework journal March 2017
OAHG: an integrated resource for annotating human genes with multi-level ontologies journal October 2016
Constructing an integrated gene similarity network for the identification of disease genes conference December 2016
Predicting disease-related genes using integrated biomedical networks journal January 2017
Erratum to: InteGO2: a web tool for measuring and visualizing gene semantic similarities using Gene Ontology journal March 2017
An online tool for measuring and visualizing phenotype similarities using HPO journal August 2018
Improving the measurement of semantic similarity by combining gene ontology and co-functional network: a random walk based approach journal March 2018
Investigations on factors influencing HPO-based semantic similarity calculation journal September 2017
Constructing Networks of Organelle Functional Modules in Arabidopsis journal August 2016
OAHG: an integrated resource for annotating human genes with multi-level ontologies journal October 2016
Constructing an integrated gene similarity network for the identification of disease genes conference December 2016
InteGO2: a web tool for measuring and visualizing gene semantic similarities using Gene Ontology journal August 2016
Predicting disease-related genes using integrated biomedical networks journal January 2017
An online tool for measuring and visualizing phenotype similarities using HPO journal August 2018
Measuring disease similarity and predicting disease-related ncRNAs by a novel method journal December 2017
Constructing an integrated gene similarity network for the identification of disease genes journal September 2017