skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Scaling Semantic Graph Databases in Size and Performance

Journal Article · · IEEE Micro, 34(4):16-26
DOI:https://doi.org/10.1109/MM.2014.39· OSTI ID:1170474

In this paper we present SGEM, a full software system for accelerating large-scale semantic graph databases on commodity clusters. Unlike current approaches, SGEM addresses semantic graph databases by only employing graph methods at all the levels of the stack. On one hand, this allows exploiting the space efficiency of graph data structures and the inherent parallelism of graph algorithms. These features adapt well to the increasing system memory and core counts of modern commodity clusters. On the other hand, however, these systems are optimized for regular computation and batched data transfers, while graph methods usually are irregular and generate fine-grained data accesses with poor spatial and temporal locality. Our framework comprises a SPARQL to data parallel C compiler, a library of parallel graph methods and a custom, multithreaded runtime system. We introduce our stack, motivate its advantages with respect to other solutions and show how we solved the challenges posed by irregular behaviors. We present the result of our software stack on the Berlin SPARQL benchmarks with datasets up to 10 billion triples (a triple corresponds to a graph edge), demonstrating scaling in dataset size and in performance as more nodes are added to the cluster.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1170474
Report Number(s):
PNNL-SA-101644; 400470000
Journal Information:
IEEE Micro, 34(4):16-26, Journal Name: IEEE Micro, 34(4):16-26
Country of Publication:
United States
Language:
English

Similar Records

Accelerating semantic graph databases on commodity clusters
Conference · Sun Oct 06 00:00:00 EDT 2013 · OSTI ID:1170474

In-Memory Graph Databases for Web-Scale Data
Journal Article · Sun Mar 01 00:00:00 EST 2015 · Computer, 48(3):24-35 · OSTI ID:1170474

High Performance Semantic Factoring of Giga-Scale Semantic Graph Databases
Conference · Mon Oct 04 00:00:00 EDT 2010 · OSTI ID:1170474