Distributed Multi-GPU Community Detection on Exascale Computing Platforms
- Oak Ridge National Laboratory
- BATTELLE (PACIFIC NW LAB)
Community detection is a fundamental operation in graph mining, and by uncovering hidden structures and patterns within complex systems it helps solve fundamental problems pertaining to social networks, such as information diffusion, epidemics, and recommender systems. Scaling graph algorithms for massive networks becomes challenging on modern distributed-memory multi-GPU (Graphics Processing Unit) systems due to limitations such as irregular memory access patterns, load imbalances, higher communication-computation ratios, and cross-platform support. We present a novel algorithm HiPDPL-GPU (distributed parallel Louvain) to address these challenges. We conduct experiments involving different partitioning techniques to achieve optimized performance of HiPDPL-GPU on the two largest supercomputers: Frontier and Summit. Remarkably, HiPDPL-GPU processes a graph with 4.2 billion edges in less than 3 minutes using 1024 GPUs. Qualitatively performance of HiPDPL-GPU is similar or better compared to other state-of-the-art CPU- and GPU-based implementations. While prior GPU implementations have predominantly employed CUDA, our first-of-its-kind implementation for community detection is cross-platform, accommodating both AMD and NVIDIA GPUs.
- Research Organization:
- Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
- Sponsoring Organization:
- USDOE
- DOE Contract Number:
- AC05-76RL01830
- OSTI ID:
- 2482258
- Report Number(s):
- PNNL-SA-195868
- Country of Publication:
- United States
- Language:
- English
Similar Records
Distributed Multi-GPU Community Detection on Exascale Computing Platforms
Towards scaling community detection on distributed-memory heterogeneous systems
GPU-accelerated DNS of compressible turbulent flows
Conference
·
Wed May 01 00:00:00 EDT 2024
·
OSTI ID:2372978
Towards scaling community detection on distributed-memory heterogeneous systems
Journal Article
·
Mon Feb 21 19:00:00 EST 2022
· Parallel Computing
·
OSTI ID:1854603
GPU-accelerated DNS of compressible turbulent flows
Journal Article
·
Sun Nov 27 19:00:00 EST 2022
· Computers and Fluids
·
OSTI ID:1959502