skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Composing Data Parallel Code for a SPARQL Graph Engine

Abstract

Big data analytics process large amount of data to extract knowledge from them. Semantic databases are big data applications that adopt the Resource Description Framework (RDF) to structure metadata through a graph-based representation. The graph based representation provides several benefits, such as the possibility to perform in memory processing with large amounts of parallelism. SPARQL is a language used to perform queries on RDF-structured data through graph matching. In this paper we present a tool that automatically translates SPARQL queries to parallel graph crawling and graph matching operations. The tool also supports complex SPARQL constructs, which requires more than basic graph matching for their implementation. The tool generates parallel code annotated with OpenMP pragmas for x86 Shared-memory Multiprocessors (SMPs). With respect to commercial database systems such as Virtuoso, our approach reduces memory occupation due to join operations and provides higher performance. We show the scaling of the automatically generated graph-matching code on a 48-core SMP.

Authors:
; ; ; ;
Publication Date:
Research Org.:
Pacific Northwest National Lab. (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE
OSTI Identifier:
1123246
Report Number(s):
PNNL-SA-96193
400470000
DOE Contract Number:  
AC05-76RL01830
Resource Type:
Conference
Resource Relation:
Conference: IEEE International Conference on Social Computing (SocialCom 2013), September 8-14, 2013, Alexandria, Virginia, 691-699
Country of Publication:
United States
Language:
English
Subject:
SPARQL; Big data; SPARQL-to-C

Citation Formats

Castellana, Vito G., Tumeo, Antonino, Villa, Oreste, Haglin, David J., and Feo, John. Composing Data Parallel Code for a SPARQL Graph Engine. United States: N. p., 2013. Web. doi:10.1109/SocialCom.2013.104.
Castellana, Vito G., Tumeo, Antonino, Villa, Oreste, Haglin, David J., & Feo, John. Composing Data Parallel Code for a SPARQL Graph Engine. United States. https://doi.org/10.1109/SocialCom.2013.104
Castellana, Vito G., Tumeo, Antonino, Villa, Oreste, Haglin, David J., and Feo, John. 2013. "Composing Data Parallel Code for a SPARQL Graph Engine". United States. https://doi.org/10.1109/SocialCom.2013.104.
@article{osti_1123246,
title = {Composing Data Parallel Code for a SPARQL Graph Engine},
author = {Castellana, Vito G. and Tumeo, Antonino and Villa, Oreste and Haglin, David J. and Feo, John},
abstractNote = {Big data analytics process large amount of data to extract knowledge from them. Semantic databases are big data applications that adopt the Resource Description Framework (RDF) to structure metadata through a graph-based representation. The graph based representation provides several benefits, such as the possibility to perform in memory processing with large amounts of parallelism. SPARQL is a language used to perform queries on RDF-structured data through graph matching. In this paper we present a tool that automatically translates SPARQL queries to parallel graph crawling and graph matching operations. The tool also supports complex SPARQL constructs, which requires more than basic graph matching for their implementation. The tool generates parallel code annotated with OpenMP pragmas for x86 Shared-memory Multiprocessors (SMPs). With respect to commercial database systems such as Virtuoso, our approach reduces memory occupation due to join operations and provides higher performance. We show the scaling of the automatically generated graph-matching code on a 48-core SMP.},
doi = {10.1109/SocialCom.2013.104},
url = {https://www.osti.gov/biblio/1123246}, journal = {},
number = ,
volume = ,
place = {United States},
year = {Sun Sep 08 00:00:00 EDT 2013},
month = {Sun Sep 08 00:00:00 EDT 2013}
}

Conference:
Other availability
Please see Document Availability for additional information on obtaining the full-text document. Library patrons may search WorldCat to identify libraries that hold this conference proceeding.

Save / Share: