Efficient Synthesis of Graph Methods: a Dynamically Scheduled Architecture

Minutoli, Marco; Castellana, Vito G.; Tumeo, Antonino; Lattuada, Marco; Ferrandi, Fabrizio

doi:10.1145/2966986.2967030

Title: Efficient Synthesis of Graph Methods: a Dynamically Scheduled Architecture

Conference · Mon Nov 07 00:00:00 EST 2016

DOI:https://doi.org/10.1145/2966986.2967030· OSTI ID:1440701

Minutoli, Marco; Castellana, Vito G.; Tumeo, Antonino; Lattuada, Marco; Ferrandi, Fabrizio

RDF databases naturally map to a graph representation and employ languages, such as SPARQL, that implements queries as graph pattern matching routines. Graph methods exhibit an irregular behavior: they present unpredictable, fine-grained data accesses, and are synchronization inten- sive. Graph data structures expose large amounts of dy- namic parallelism, but are difficult to partition without gen- erating load unbalance. In this paper, we present a novel ar- chitecture to improve the synthesis of graph methods. Our design addresses the issues of these algorithms with two com- ponents: a Dynamic Task Scheduler (DTS), which reduces load unbalance and maximize resource utilization, and a Hi- erarchical Memory Interface controller (HMI), which pro- vides support for concurrent memory operations on multi- ported/multi-banked shared memories. We evaluate our ap- proach by generating the accelerators for a set of SPARQL queries from the Lehigh University Benchmark (LUBM). We first analyze the load unbalance of these queries, showing that execution time among tasks can differ even of order of magnitudes. We then synthesize the queries and com- pare the performance of the resulting accelerators against the current state of the art. Experimental results show that our solution provides a speedup over the serial implementa- tion close to the theoretical maximum and a speedup up to 3.45 over a baseline parallel implementation. We conclude our study by exploring the design space to achieve maximum memory channels utilization. The best design used at least three of the four memory channels for more than 90% of the execution time.

OSTI does not have a digital full text copy available. For more information, please see document availability, search WorldCat, or search Google Scholar.

Cite

Export

Save

Research Organization:: Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

Sponsoring Organization:: USDOE

DOE Contract Number:: AC05-76RL01830

OSTI ID:: 1440701

Report Number(s):: PNNL-SA-119594; 453040300

Resource Relation:: Conference: Proceedings of the 35th International Conference on Computer-Aided Design (ICCAD 2016), November 7-10, 2016, Austin, Texas, Article No. 128

Country of Publication:: United States

Language:: English

References (13)

An architecture for exploiting coarse-grain parallelism on FPGAs Capalija, Davor; Abdelrahman, Tarek S. 2009 International Conference on Field-Programmable Technology (FPT) https://doi.org/10.1109/FPT.2009.5377658	conference	December 2009
A Reconfigurable Computing Approach for Efficient and Scalable Parallel Graph Exploration Betkaoui, Brahim; Wang, Yu; Thomas, David B. 2012 IEEE 23rd International Conference on Application-specific Systems, Architectures and Processors (ASAP) https://doi.org/10.1109/ASAP.2012.30	conference	July 2012
Generating hardware from OpenMP programs Leow, Y. Y.; Ng, C. y.; Wong, W. f. 2006 IEEE International Conference on Field Programmable Technology https://doi.org/10.1109/FPT.2006.270297	conference	December 2006
Trinity: a distributed graph engine on a memory cloud Shao, Bin; Wang, Haixun; Li, Yatao Proceedings of the 2013 international conference on Management of data - SIGMOD '13 https://doi.org/10.1145/2463676.2467799	conference	January 2013
An adaptive Memory Interface Controller for improving bandwidth utilization of hybrid and reconfigurable systems Castellana, Vito Giovanni; Tumeo, Antonino; Ferrandi, Fabrizio Design Automation and Test in Europe, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2014 https://doi.org/10.7873/DATE2014.192	conference	January 2014
Efficient and Scalable OpenMP-based System-level Design Cilardo, Alessandro; Gallo, Luca; Mazzeo, Antonino Design Automation and Test in Europe, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2013 https://doi.org/10.7873/DATE.2013.206	conference	January 2013
A multilevel computing architecture for embedded multimedia applications Karim, F.; Mellan, A.; Nguyen, A. IEEE Micro, Vol. 24, Issue 3 https://doi.org/10.1109/MM.2004.1	journal	May 2004
Irregular Applications: From Architectures to Algorithms [Guest editors' introduction] Tumeo, Antonino; Feo, John Computer, Vol. 48, Issue 8 https://doi.org/10.1109/MC.2015.233	journal	August 2015
An automated flow for the High Level Synthesis of coarse grained parallel applications Castellana, Vito Giovanni; Ferrandi, Fabrizio 2013 International Conference on Field-Programmable Technology (FPT) https://doi.org/10.1109/FPT.2013.6718370	conference	December 2013
In-Memory Graph Databases for Web-Scale Data Castellana, Vito Giovanni; Morari, Alessandro; Weaver, Jesse Computer, Vol. 48, Issue 3 https://doi.org/10.1109/MC.2015.74	journal	March 2015
ElasticFlow: A complexity-effective approach for pipelining irregular loop nests Tan, Mingxing; Liu, Gai; Zhao, Ritchie 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) https://doi.org/10.1109/ICCAD.2015.7372553	conference	November 2015
High level synthesis of RDF queries for graph analytics Castellana, Vito Giovanni; Minutoli, Marco; Morari, Alessandro 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) https://doi.org/10.1109/ICCAD.2015.7372587	conference	November 2015
From software threads to parallel hardware in high-level synthesis for FPGAs Choi, Jongsok; Brown, Stephen; Anderson, Jason 2013 International Conference on Field-Programmable Technology (FPT) https://doi.org/10.1109/FPT.2013.6718365	conference	December 2013

Similar Records

Enabling the High Level Synthesis of Data Analytics Accelerators

Conference · Sat Oct 01 00:00:00 EDT 2016 · OSTI ID:1440701

Minutoli, Marco; Castellana, Vito G.; Tumeo, Antonino; +2 more

Considerations on the Use of Custom Accelerators for Big Data Analytics

Book · Fri May 19 00:00:00 EDT 2017 · OSTI ID:1440701

Castellana, Vito G.; Tumeo, Antonino; Minutoli, Marco; +2 more

High Level Synthesis of RDF Queries for Graph Analytics

Conference · Mon Nov 02 00:00:00 EST 2015 · OSTI ID:1440701

Castellana, Vito G.; Minutoli, Marco; Morari, Alessandro; +3 more

Related Subjects

FPGA implementations
Graph database Engine for Multithreaded Systems
architecture
Dynamic Scheduling

Title: Efficient Synthesis of Graph Methods: a Dynamically Scheduled Architecture

Citation Formats

References (13)

Similar Records

Related Subjects