skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology

Abstract

Here, the Gene Ontology (GO) has been used in high-throughput omics research as a major bioinformatics resource. The hierarchical structure of GO provides users a convenient platform for biological information abstraction and hypothesis testing. Computational methods have been developed to identify functionally similar genes. However, none of the existing measurements take into account all the rich information in GO. Similarly, using these existing methods, web-based applications have been constructed to compute gene functional similarities, and to provide pure text-based outputs. Without a graphical visualization interface, it is difficult for result interpretation. As a result, we present InteGO2, a web tool that allows researchers to calculate the GO-based gene semantic similarities using seven widely used GO-based similarity measurements. Also, we provide an integrative measurement that synergistically integrates all the individual measurements to improve the overall performance. Using HTML5 and cytoscape.js, we provide a graphical interface in InteGO2 to visualize the resulting gene functional association networks. In conclusion, InteGO2 is an easy-to-use HTML5 based web tool. With it, researchers can measure gene or gene product functional similarity conveniently, and visualize the network of functional interactions in a graphical interface.

Authors:
 [1];  [2];  [2];  [2];  [2];  [2];  [3]
  1. Northwestern Polytechnical Univ., Xi'an (China); Michigan State Univ., East Lansing, MI (United States)
  2. Harbin Institute of Technology, Harbin (China)
  3. Michigan State Univ., East Lansing, MI (United States)
Publication Date:
Research Org.:
Michigan State Univ., East Lansing, MI (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1362018
Grant/Contract Number:
FG02-91ER20021
Resource Type:
Journal Article: Accepted Manuscript
Journal Name:
BMC Genomics
Additional Journal Information:
Journal Volume: 17; Journal Issue: S5; Journal ID: ISSN 1471-2164
Publisher:
Springer
Country of Publication:
United States
Language:
English
Subject:
59 BASIC BIOLOGICAL SCIENCES; 60 APPLIED LIFE SCIENCES; gene ontology; semantic similarity; web tool

Citation Formats

Peng, Jiajie, Li, Hongxiang, Liu, Yongzhuang, Juan, Liran, Jiang, Qinghua, Wang, Yadong, and Chen, Jin. InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology. United States: N. p., 2016. Web. doi:10.1186/s12864-016-2828-6.
Peng, Jiajie, Li, Hongxiang, Liu, Yongzhuang, Juan, Liran, Jiang, Qinghua, Wang, Yadong, & Chen, Jin. InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology. United States. doi:10.1186/s12864-016-2828-6.
Peng, Jiajie, Li, Hongxiang, Liu, Yongzhuang, Juan, Liran, Jiang, Qinghua, Wang, Yadong, and Chen, Jin. 2016. "InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology". United States. doi:10.1186/s12864-016-2828-6. https://www.osti.gov/servlets/purl/1362018.
@article{osti_1362018,
title = {InteGO2: A web tool for measuring and visualizing gene semantic similarities using Gene Ontology},
author = {Peng, Jiajie and Li, Hongxiang and Liu, Yongzhuang and Juan, Liran and Jiang, Qinghua and Wang, Yadong and Chen, Jin},
abstractNote = {Here, the Gene Ontology (GO) has been used in high-throughput omics research as a major bioinformatics resource. The hierarchical structure of GO provides users a convenient platform for biological information abstraction and hypothesis testing. Computational methods have been developed to identify functionally similar genes. However, none of the existing measurements take into account all the rich information in GO. Similarly, using these existing methods, web-based applications have been constructed to compute gene functional similarities, and to provide pure text-based outputs. Without a graphical visualization interface, it is difficult for result interpretation. As a result, we present InteGO2, a web tool that allows researchers to calculate the GO-based gene semantic similarities using seven widely used GO-based similarity measurements. Also, we provide an integrative measurement that synergistically integrates all the individual measurements to improve the overall performance. Using HTML5 and cytoscape.js, we provide a graphical interface in InteGO2 to visualize the resulting gene functional association networks. In conclusion, InteGO2 is an easy-to-use HTML5 based web tool. With it, researchers can measure gene or gene product functional similarity conveniently, and visualize the network of functional interactions in a graphical interface.},
doi = {10.1186/s12864-016-2828-6},
journal = {BMC Genomics},
number = S5,
volume = 17,
place = {United States},
year = 2016,
month = 8
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 9works
Citation information provided by
Web of Science

Save / Share:
  • Background: Gene Ontology (GO) has been used widely to study functional relationships between genes. The current semantic similarity measures rely only on GO annotations and GO structure. This limits the power of GO-based similarity because of the limited proportion of genes that are annotated to GO in most organisms. Results: We introduce a novel approach called NETSIM (network-based similarity measure) that incorporates information from gene co-function networks in addition to using the GO structure and annotations. Using metabolic reaction maps of yeast, Arabidopsis, and human, we demonstrate that NETSIM can improve the accuracy of GO term similarities. We also demonstratemore » that NETSIM works well even for genomes with sparser gene annotation data. We applied NETSIM on large Arabidopsis gene families such as cytochrome P450 monooxygenases to group the members functionally and show that this grouping could facilitate functional characterization of genes in these families. Conclusions: Using NETSIM as an example, we demonstrated that the performance of a semantic similarity measure could be significantly improved after incorporating genome-specific information. NETSIM incorporates both GO annotations and gene co-function network data as a priori knowledge in the model. Therefore, functional similarities of GO terms that are not explicitly encoded in GO but are relevant in a taxon-specific manner become measurable when GO annotations are limited.« less
  • Most current approaches to automatic pathway generation are based on a reverse engineering approach in which pathway plausibility is solely derived from microarray gene expression data. These approaches tend to lack in generality and offer no independent validation as they are too reliant on the pathway observables that guide pathway generation. By contrast, alternative approaches that use prior biological knowledge to validate pathways inferred from gene expression data may err in the opposite direction as the prior knowledge is usually not sufficiently tuned to the pathology of focus. In this paper, we present a novel pathway generation approach that combinesmore » insights from the reverse engineering and knowledge-based approaches to increase the biological plausibility of automatically generated regulatory networks and describe an application of this approach to transcriptional data from a mouse model of neuroprotection during stroke.« less
  • Background: The Environment Ontology (ENVO; http://www.environmentontology.org/), first described in 2013, is a resource and research target for the semantically controlled description of environmental entities. The ontology's initial aim was the representation of the biomes, environmental features, and environmental materials pertinent to genomic and microbiome-related investigations. However, the need for environmental semantics is common to a multitude of fields, and ENVO's use has steadily grown since its initial description. We have thus expanded, enhanced, and generalised the ontology to support its increasingly diverse applications. Methods: We have updated our development suite to promote expressivity, consistency, and speed: we now develop ENVOmore » in the Web Ontology Language (OWL) and employ templating methods to accelerate class creation. We have also taken steps to better align ENVO with the Open Biological and Biomedical Ontologies (OBO) Foundry principles and interoperate with existing OBO ontologies. Further, we applied text-mining approaches to extract habitat information from the Encyclopedia of Life and automatically create experimental habitat classes within ENVO. Results: Relative to its state in 2013, ENVO's content, scope, and implementation have been enhanced and much of its existing content revised for improved semantic representation. ENVO now offers representations of habitats, environmental processes, anthropogenic environments, and entities relevant to environmental health initiatives and the global Sustainable Development Agenda for 2030. Several branches of ENVO have been used to incubate and seed new ontologies in previously unrepresented domains such as food and agronomy. The current release version of the ontology, in OWL format, is available at http://purl.obolibrary.org/obo/envo.owl. Conclusions: ENVO has been shaped into an ontology which bridges multiple domains including biomedicine, natural and anthropogenic ecology, 'omics, and socioeconomic development. Through continued interactions with our users and partners, particularly those performing data archiving and sythesis, we anticipate that ENVO's growth will accelerate in 2017. As always, we invite further contributions and collaboration to advance the semantic representation of the environment, ranging from geographic features and environmental materials, across habitats and ecosystems, to everyday objects in household settings.« less
  • The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000more » rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.« less
  • The Human Phenotype Ontology (HPO) is widely used in the rare disease community for differential diagnostics, phenotype-driven analysis of next-generation sequence-variation data, and translational research, but a comparable resource has not been available for common disease. Here, we have developed a concept-recognition procedure that analyzes the frequencies of HPO disease annotations as identified in over five million PubMed abstracts by employing an iterative procedure to optimize precision and recall of the identified terms. We derived disease models for 3,145 common human diseases comprising a total of 132,006 HPO annotations. The HPO now comprises over 250,000 phenotypic annotations for over 10,000more » rare and common diseases and can be used for examining the phenotypic overlap among common diseases that share risk alleles, as well as between Mendelian diseases and common diseases linked by genomic location. The annotations, as well as the HPO itself, are freely available.« less
    Cited by 43