Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Query Optimization for Graph Analytics on Linked Data Using SPARQL

Technical Report ·
DOI:https://doi.org/10.2172/1215587· OSTI ID:1215587

Triplestores that support query languages such as SPARQL are emerging as the preferred and scalable solution to represent data and meta-data as massive heterogeneous graphs using Semantic Web standards. With increasing adoption, the desire to conduct graph-theoretic mining and exploratory analysis has also increased. Addressing that desire, this paper presents a solution that is the marriage of Graph Theory and the Semantic Web. We present software that can analyze Linked Data using graph operations such as counting triangles, finding eccentricity, testing connectedness, and computing PageRank directly on triple stores via the SPARQL interface. We describe the process of optimizing performance of the SPARQL-based implementation of such popular graph algorithms by reducing the space-overhead, simplifying iterative complexity and removing redundant computations by understanding query plans. Our optimized approach shows significant performance gains on triplestores hosted on stand-alone workstations as well as hardware-optimized scalable supercomputers such as the Cray XMT.

Research Organization:
Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)
Sponsoring Organization:
USDOE Laboratory Directed Research and Development (LDRD) Program
DOE Contract Number:
AC05-00OR22725
OSTI ID:
1215587
Report Number(s):
ORNL/TM--2015/342
Country of Publication:
United States
Language:
English

Similar Records

Graph Mining Meets the Semantic Web
Conference · Wed Dec 31 23:00:00 EST 2014 · OSTI ID:1190754

EAGLE: 'EAGLE'Is an' Algorithmic Graph Library for Exploration
Software · Fri Jan 16 00:00:00 EST 2015 · OSTI ID:1232204

TripleGraph
Software · Mon Apr 06 20:00:00 EDT 2020 · OSTI ID:code-45844