skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: In-Memory Graph Databases for Web-Scale Data

Journal Article · · Computer, 48(3):24-35
DOI:https://doi.org/10.1109/MC.2015.74· OSTI ID:1208713

RDF databases have emerged as one of the most relevant way for organizing, integrating, and managing expo- nentially growing, often heterogeneous, and not rigidly structured data for a variety of scientific and commercial fields. In this paper we discuss the solutions integrated in GEMS (Graph database Engine for Multithreaded Systems), a software framework for implementing RDF databases on commodity, distributed-memory high-performance clusters. Unlike the majority of current RDF databases, GEMS has been designed from the ground up to primarily employ graph-based methods. This is reflected in all the layers of its stack. The GEMS framework is composed of: a SPARQL-to-C++ compiler, a library of data structures and related methods to access and modify them, and a custom runtime providing lightweight software multithreading, network messages aggregation and a partitioned global address space. We provide an overview of the framework, detailing its component and how they have been closely designed and customized to address issues of graph methods applied to large-scale datasets on clusters. We discuss in details the principles that enable automatic translation of the queries (expressed in SPARQL, the query language of choice for RDF databases) to graph methods, and identify differences with respect to other RDF databases.

Research Organization:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1208713
Report Number(s):
PNNL-SA-107315; 400470000
Journal Information:
Computer, 48(3):24-35, Journal Name: Computer, 48(3):24-35
Country of Publication:
United States
Language:
English

Similar Records

Accelerating semantic graph databases on commodity clusters
Conference · Sun Oct 06 00:00:00 EDT 2013 · OSTI ID:1208713

Scaling Semantic Graph Databases in Size and Performance
Journal Article · Wed Aug 06 00:00:00 EDT 2014 · IEEE Micro, 34(4):16-26 · OSTI ID:1208713

High Level Synthesis of RDF Queries for Graph Analytics
Conference · Mon Nov 02 00:00:00 EST 2015 · OSTI ID:1208713