Semantic similarity between ontologies at different scales
Journal Article
·
· IEEE/CAA Journal of Automatica Sinica
In the past decade, existing and new knowledge and datasets has been encoded in different ontologies for semantic web and biomedical research. The size of ontologies is often very large in terms of number of concepts and relationships, which makes the analysis of ontologies and the represented knowledge graph computational and time consuming. As the ontologies of various semantic web and biomedical applications usually show explicit hierarchical structures, it is interesting to explore the trade-offs between ontological scales and preservation/precision of results when we analyze ontologies. This paper presents the first effort of examining the capability of this idea via studying the relationship between scaling biomedical ontologies at different levels and the semantic similarity values. We evaluate the semantic similarity between three Gene Ontology slims (Plant, Yeast, and Candida, among which the latter two belong to the same kingdom—Fungi) using four popular measures commonly applied to biomedical ontologies (Resnik, Lin, Jiang-Conrath, and SimRel). The results of this study demonstrate that with proper selection of scaling levels and similarity measures, we can significantly reduce the size of ontologies without losing substantial detail. In particular, the performance of Jiang-Conrath and Lin are more reliable and stable than that of the other two in this experiment, as proven by (a) consistently showing that Yeast and Candida are more similar (as compared to Plant) at different scales, and (b) small deviations of the similarity values after excluding a majority of nodes from several lower scales. This study provides a deeper understanding of the application of semantic similarity to biomedical ontologies, and shed light on how to choose appropriate semantic similarity measures for biomedical engineering.
- Research Organization:
- Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 1337285
- Report Number(s):
- PNNL-SA--115549; 453040300
- Journal Information:
- IEEE/CAA Journal of Automatica Sinica, Journal Name: IEEE/CAA Journal of Automatica Sinica Journal Issue: 2 Vol. 3; ISSN 2329-9266
- Country of Publication:
- United States
- Language:
- English
Similar Records
Measuring semantic similarities by combining gene ontology annotations and gene co-function networks
Mapping between the OBO and OWL ontology languages
Structure Discovery in Large Semantic Graphs Using Extant Ontological Scaling and Descriptive Statistics
Journal Article
·
Fri Feb 13 19:00:00 EST 2015
· BMC Bioinformatics
·
OSTI ID:1194164
Mapping between the OBO and OWL ontology languages
Journal Article
·
Sun Mar 06 19:00:00 EST 2011
· Journal of Biomedical Semantics
·
OSTI ID:1629609
Structure Discovery in Large Semantic Graphs Using Extant Ontological Scaling and Descriptive Statistics
Conference
·
Mon Jul 18 00:00:00 EDT 2011
·
OSTI ID:1092681