SHMEMGraph: Efficient and Balanced Graph Processing Using One-Sided Communication

Fu, Huansong; Gorentla Venkata, Manjunath; Salman, Shaeke; Imam, Neena; Yu, Weikuan

doi:10.1109/CCGRID.2018.00078

Title: SHMEMGraph: Efficient and Balanced Graph Processing Using One-Sided Communication

Conference · Tue May 01 00:00:00 EDT 2018

DOI:https://doi.org/10.1109/CCGRID.2018.00078· OSTI ID:1468157

Fu, Huansong ^[1];

^[2]; Salman, Shaeke ^[1];

^[2]; Yu, Weikuan ^[1]

Florida State University, Tallahassee
ORNL

State-of-the-art synchronous graph processing frameworks face both inefficiency and imbalance issues that cause their performance to be suboptimal. These issues include the inefficiency of communication and the imbalanced graph computation/communication costs in an iteration. We propose to replace their conventional two-sided communication model with the one-sided counterpart. Accordingly, we design SHMEMGraph, an efficient and balanced graph processing framework that is formulated across a global memory space and takes advantage of the flexibility and efficiency of one-sided communication for graph processing. Through an efficient one-sided communication channel, SHMEMGraph utilizes the high-performance operations with RDMA while minimizing the resource contention within a computer node. In addition, SHMEMGraph synthesizes a number of optimizations to address both computation imbalance and communication imbalance. By using a graph of 1 billion edges, our evaluation shows that compared to the state-of-the-art Gemini framework, SHMEMGraph achieves an average improvement of 35.5% in terms of job completion time for five representative graph algorithms.

View Conference

Cite

Export

Save

Research Organization:: Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)

Sponsoring Organization:: USDOE Office of Science (SC)

DOE Contract Number:: AC05-00OR22725

OSTI ID:: 1468157

Resource Relation:: Conference: IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing - Washington D.C, District of Columbia, United States of America - 5/1/2018 12:00:00 PM-5/4/2018 12:00:00 PM

Country of Publication:: United States

Language:: English

References (18)

One trillion edges: graph processing at Facebook-scale Ching, Avery; Edunov, Sergey; Kabiljo, Maja Proceedings of the VLDB Endowment, Vol. 8, Issue 12 https://doi.org/10.14778/2824032.2824077	journal	August 2015
Mizan-RMA: Accelerating Mizan Graph Processing Framework with MPI RMA Li, Mingzhe; Lu, Xiaoyi; Hamidouche, Khaled 2016 IEEE 23rd International Conference on High Performance Computing (HiPC) https://doi.org/10.1109/HiPC.2016.015	conference	December 2016
Trinity: a distributed graph engine on a memory cloud Shao, Bin; Wang, Haixun; Li, Yatao Proceedings of the 2013 international conference on Management of data - SIGMOD '13 https://doi.org/10.1145/2463676.2467799	conference	January 2013
Scalable Graph500 design with MPI-3 RMA Li, Mingzhe; Lu, Xiaoyi; Potluri, Sreeram 2014 IEEE International Conference on Cluster Computing (CLUSTER) https://doi.org/10.1109/CLUSTER.2014.6968755	conference	September 2014
Mizan Khayyat, Zuhair; Awara, Karim; Alonazi, Amani Proceedings of the 8th ACM European Conference on Computer Systems https://doi.org/10.1145/2465351.2465369	conference	April 2013
Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks Boldi, Paolo; Rosa, Marco; Santini, Massimo Proceedings of the 20th international conference on World wide web - WWW '11 https://doi.org/10.1145/1963405.1963488	conference	January 2011
Pregel: a system for large-scale graph processing Malewicz, Grzegorz; Austern, Matthew H.; Bik, Aart J. C. Proceedings of the 2010 international conference on Management of data - SIGMOD '10 https://doi.org/10.1145/1807167.1807184	conference	January 2010
SYNC or ASYNC: time to fuse for distributed graph-parallel computation Xie, Chenning; Chen, Rong; Guan, Haibing ACM SIGPLAN Notices, Vol. 50, Issue 8 https://doi.org/10.1145/2858788.2688508	journal	December 2015
High-Performance Key-Value Store on OpenSHMEM Fu, Huansong; Venkata, Manjunath Gorentla; Choudhury, Ahana Roy 2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) https://doi.org/10.1109/CCGRID.2017.49	conference	May 2017
X-Stream: edge-centric graph processing using streaming partitions Roy, Amitabha; Mihailovic, Ivo; Zwaenepoel, Willy Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles - SOSP '13 https://doi.org/10.1145/2517349.2522740	conference	January 2013
Designing Scalable Out-of-core Sorting with Hybrid MPI+PGAS Programming Models Jose, Jithin; Potluri, Sreeram; Subramoni, Hari Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models https://doi.org/10.1145/2676870.2676880	conference	October 2014
To Push or To Pull Besta, Maciej; Podstawski, Michał; Groner, Linus Proceedings of the 26th International Symposium on High-Performance Parallel and Distributed Computing https://doi.org/10.1145/3078597.3078616	conference	June 2017
Ligra Shun, Julian; Blelloch, Guy E. ACM SIGPLAN Notices, Vol. 48, Issue 8 https://doi.org/10.1145/2517327.2442530	journal	February 2013
An experimental comparison of pregel-like graph processing systems Han, Minyang; Daudjee, Khuzaima; Ammar, Khaled Proceedings of the VLDB Endowment, Vol. 7, Issue 12 https://doi.org/10.14778/2732977.2732980	journal	August 2014
G ra M Wu, Ming; Yang, Fan; Xue, Jilong Proceedings of the Sixth ACM Symposium on Cloud Computing https://doi.org/10.1145/2806777.2806849	conference	August 2015
Distributed GraphLab: a framework for machine learning and data mining in the cloud Low, Yucheng; Bickson, Danny; Gonzalez, Joseph Proceedings of the VLDB Endowment, Vol. 5, Issue 8 https://doi.org/10.14778/2212351.2212354	journal	April 2012
The webgraph framework I: compression techniques Boldi, P.; Vigna, S. Proceedings of the 13th conference on World Wide Web - WWW '04 https://doi.org/10.1145/988672.988752	conference	January 2004
Introducing OpenSHMEM: SHMEM for the PGAS community Chapman, Barbara; Curtis, Tony; Pophale, Swaroop Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model - PGAS '10 https://doi.org/10.1145/2020373.2020375	conference	January 2010

Similar Records

An asynchronous traversal engine for graph-based rich metadata management

Journal Article · Thu Jun 23 00:00:00 EDT 2016 · Parallel Computing · OSTI ID:1468157

Dai, Dong; Carns, Philip; Ross, Robert B.; +3 more

Final Report for Project DE-FC02-06ER25755 [Pmodels2]

Technical Report · Wed Mar 12 00:00:00 EDT 2014 · OSTI ID:1468157

Panda, Dhabaleswar; Sadayappan, P.

Decomposition of Large Scale Semantic Graphsvia an Efficient Communities Algorithm

Technical Report · Fri Feb 08 00:00:00 EST 2008 · OSTI ID:1468157

Yao, Y

Title: SHMEMGraph: Efficient and Balanced Graph Processing Using One-Sided Communication

Citation Formats

References (18)

Similar Records

Related Subjects